server: handle n_predict==2 error #9938

kylo5aby · 2024-10-18T10:10:55Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

maybe fix: #9933

ggerganov · 2024-10-18T11:07:31Z

examples/server/server.cpp

+        } else if (global_params.n_predict == -2) {
+            n_remaining = global_params.n_ctx - n_decoded;


This is not precise. It should use the slot's context. And instead of n_decoded it has to use n_past. Writing a server test to verify the implementation would be useful.

aviallon · 2025-04-06T13:55:26Z

I'm interested in this PR!

github-actions bot added examples server labels Oct 18, 2024

ggerganov reviewed Oct 18, 2024

View reviewed changes

server: handle n_predict==2 error

63978cb

kylo5aby force-pushed the server branch from dc88e93 to 63978cb Compare October 23, 2024 07:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

server: handle n_predict==2 error #9938

server: handle n_predict==2 error #9938

kylo5aby commented Oct 18, 2024

Uh oh!

ggerganov Oct 18, 2024

Uh oh!

aviallon commented Apr 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		} else if (global_params.n_predict == -2) {
		n_remaining = global_params.n_ctx - n_decoded;

server: handle n_predict==2 error #9938

Are you sure you want to change the base?

server: handle n_predict==2 error #9938

Conversation

kylo5aby commented Oct 18, 2024

Uh oh!

ggerganov Oct 18, 2024

Choose a reason for hiding this comment

Uh oh!

aviallon commented Apr 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants