Skip to content

Commit cba2e4e

Browse files
committed
warm up more tokens
1 parent b73985c commit cba2e4e

File tree

1 file changed

+12
-0
lines changed

1 file changed

+12
-0
lines changed

app.py

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -176,6 +176,18 @@ def load_model():
176176
print("Warming up (40 tokens)…")
177177
with torch.inference_mode():
178178
_ = model.generate(**dummy, max_new_tokens=40)
179+
print("Warming up (100 tokens)…")
180+
with torch.inference_mode():
181+
_ = model.generate(**dummy, max_new_tokens=100)
182+
print("Warming up (200 tokens)…")
183+
with torch.inference_mode():
184+
_ = model.generate(**dummy, max_new_tokens=200)
185+
print("Warming up (400 tokens)…")
186+
with torch.inference_mode():
187+
_ = model.generate(**dummy, max_new_tokens=400)
188+
print("Warming up (800 tokens)…")
189+
with torch.inference_mode():
190+
_ = model.generate(**dummy, max_new_tokens=800)
179191
print("Warm-up complete.")
180192

181193

0 commit comments

Comments
 (0)