Decode_n_tokens yield eos/eot token #1474

nlpfollower · 2025-01-23T22:01:44Z

Seems like we should yield the cur_token/next_token when it's an eos or eot so that it is included in the generator_func in chat. Also this way start_pos is incremented to an empty position in the cache. Repro similar to #1462, although there is no visible response degradation.

I've pushed logs from my trace (nlpfollower#2). You can see how without this yield the first id in the input_pos at the beginning of a round overlaps, and overwrites, the last cache entry made in the previous turn.

pytorch-bot · 2025-01-23T22:01:47Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1474

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit aef0b8b with merge base 42c52bf ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Jack-Khuu · 2025-01-25T01:06:45Z

Thanks for another fix; I'll try to verify over the weekend

Jack-Khuu

My guts says this is legit, I'll give it another look

Decode_n_tokens yield eos/eot token

2ab9092

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 23, 2025

nlpfollower mentioned this pull request Jan 23, 2025

Increment start_pos by encoded size in generate #1462

Merged

Merge branch 'main' into nlp/decode-yield-eot-token

aef0b8b

Jack-Khuu approved these changes Jan 29, 2025

View reviewed changes

Jack-Khuu merged commit 53a1004 into pytorch:main Feb 7, 2025
62 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Decode_n_tokens yield eos/eot token #1474

Decode_n_tokens yield eos/eot token #1474

Uh oh!

nlpfollower commented Jan 23, 2025

Uh oh!

pytorch-bot bot commented Jan 23, 2025 •

edited

Loading

Uh oh!

Jack-Khuu commented Jan 25, 2025

Uh oh!

Jack-Khuu left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Decode_n_tokens yield eos/eot token #1474

Decode_n_tokens yield eos/eot token #1474

Uh oh!

Conversation

nlpfollower commented Jan 23, 2025

Uh oh!

pytorch-bot bot commented Jan 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1474

✅ No Failures

Uh oh!

Jack-Khuu commented Jan 25, 2025

Uh oh!

Jack-Khuu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Jan 23, 2025 •

edited

Loading