Skip to content

Commit f8f8f06

Browse files
committed
Fix test
1 parent b74e2c3 commit f8f8f06

File tree

1 file changed

+1
-4
lines changed

1 file changed

+1
-4
lines changed

examples/models/llama/runner/generation.py

Lines changed: 1 addition & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -100,10 +100,7 @@ def generate( # noqa: C901
100100
),
101101
)
102102

103-
if self.has_full_logits:
104-
current_token = next_token(logits[:, -1, :], temperature, top_p)
105-
else:
106-
current_token = next_token(logits, temperature, top_p)
103+
current_token = next_token(logits, temperature, top_p)
107104
print(f"{self.tokenizer.decode_token(current_token)}", end="", flush=True)
108105
tokens = prompt_tokens + [current_token]
109106

0 commit comments

Comments
 (0)