server: disable context shift #9544

VJHack · 2024-09-19T00:53:21Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

This enables the --no-context-shift argument to be passed to the server.
The server will generate n_predict tokens such that n_predict <= n_ctx - n_tokens_prompt.
If n_tokens_prompts > n_ctx, an error will be thrown and the slot is discarded.

Implements feature request #9390

eskeletor97 · 2024-09-19T10:26:26Z

examples/server/server.cpp

+                        // context shift is disabled and prompt is too large - discard it
+                        if (!params.ctx_shift && slot.n_prompt_tokens > slot.n_ctx ){
+                            slot.release();
+                            send_error(slot, "Input is too large to process. Either disable context shift or increase context length. ", ERROR_TYPE_SERVER);


Shouldn't the error say enable context shift, since it's already disabled?

Yes, I made the change. Thanks for the correction.

ExtReMLapin · 2024-09-20T09:32:42Z

n_ctx divided by slots available, right ?

VJHack · 2024-09-20T20:18:37Z

n_ctx divided by slots available, right ?

slot.n_ctx for each slot is allocated by n_ctx divided by total number of slots;

ExtReMLapin · 2024-09-21T05:23:03Z

Accoding to your commit message it returns 200, best would be 413

IMHO, it's always better if a tool is the clearer without reading any documentation, 200 with null is just confusing

VJHack · 2024-09-21T05:38:50Z

@ExtReMLapin Sorry if the commit message wasn't clear. In the commit that you're referring to, it was initially returning a 200 null response but I fixed it so it returns 500 with error message "Input is too large to process. Either enable context shift or increase the context length."

But now that you mention it, I think 400 is more appropriate because it matches what OpenAI uses in their API response. Just updated it to 400.

ExtReMLapin · 2024-09-21T05:40:48Z

Thanks for the fix, I was about to open a PR to add 413 but if it's what openAI is doing, fair enough !

eskeletor97 · 2024-09-21T17:00:24Z

examples/server/server.cpp

                            continue;
                        }
+                        // context shift is disabled and prompt is too large - discard it
+                        if (!params.ctx_shift && (slot.n_prompt_tokens > slot.n_ctx) ){


I don't know if this leads to correct behavior. I found that if we do:

slot.n_prompt_tokens > slot.n_ctx

Then, it's possible to fall through the check down to prompt truncation which might be confusing for the user. Maybe we should change it to:

slot.n_prompt_tokens >= slot.n_ctx

Maybe someone more knowledgeable could chime in.

ngxson · 2024-09-23T16:12:34Z

As I'm merging #9607 , I'll close this PR. Feel free to discuss on the other PR if you want to change something.

allow disable context shift for sever

5688864

github-actions bot added examples server labels Sep 19, 2024

VJHack changed the title ~~allow disable context shift for sever~~ server: disable context shift Sep 19, 2024

VJHack marked this pull request as draft September 19, 2024 02:06

eskeletor97 reviewed Sep 19, 2024

View reviewed changes

VJHack added 3 commits September 20, 2024 08:48

Fixed error message to say 'enable context shift'

2f2e4b3

fixed server 200 null response when context is exceeded

0cabcbe

changed error message wording

9880e3a

VJHack marked this pull request as ready for review September 20, 2024 19:58

VJHack requested a review from eskeletor97 September 20, 2024 20:11

updated context shift error to ERROR_TYPE_INVALID_REQUEST

4af076b

eskeletor97 reviewed Sep 21, 2024

View reviewed changes

ngxson mentioned this pull request Sep 23, 2024

server : add --no-context-shift option #9607

Merged

2 tasks

ngxson closed this Sep 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

server: disable context shift #9544

server: disable context shift #9544

Uh oh!

VJHack commented Sep 19, 2024 •

edited

Loading

Uh oh!

eskeletor97 Sep 19, 2024

Uh oh!

VJHack Sep 20, 2024

Uh oh!

ExtReMLapin commented Sep 20, 2024

Uh oh!

VJHack commented Sep 20, 2024

Uh oh!

ExtReMLapin commented Sep 21, 2024 •

edited

Loading

Uh oh!

VJHack commented Sep 21, 2024

Uh oh!

ExtReMLapin commented Sep 21, 2024

Uh oh!

eskeletor97 Sep 21, 2024

Uh oh!

ngxson commented Sep 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

server: disable context shift #9544

server: disable context shift #9544

Uh oh!

Conversation

VJHack commented Sep 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eskeletor97 Sep 19, 2024

Choose a reason for hiding this comment

Uh oh!

VJHack Sep 20, 2024

Choose a reason for hiding this comment

Uh oh!

ExtReMLapin commented Sep 20, 2024

Uh oh!

VJHack commented Sep 20, 2024

Uh oh!

ExtReMLapin commented Sep 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

VJHack commented Sep 21, 2024

Uh oh!

ExtReMLapin commented Sep 21, 2024

Uh oh!

eskeletor97 Sep 21, 2024

Choose a reason for hiding this comment

Uh oh!

ngxson commented Sep 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

VJHack commented Sep 19, 2024 •

edited

Loading

ExtReMLapin commented Sep 21, 2024 •

edited

Loading