Skip to content

Commit 1a298a8

Browse files
committed
Generator: Fix job enqueue when max_new_tokens == 1
1 parent 03f3d9b commit 1a298a8

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

exllamav3/generator/job.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -191,7 +191,7 @@ def __init__(
191191
self.sequences.append(seq)
192192

193193
# Generation parameters
194-
self.max_new_tokens = max_new_tokens - 1 or 100
194+
self.max_new_tokens = max_new_tokens - 1 or 1
195195
self.min_new_tokens = min_new_tokens
196196
self.new_tokens = 0 if self.prefix_token is None else -1
197197
self.sampler = sampler

0 commit comments

Comments
 (0)