server : disable context shift by default #15416

ggerganov · 2025-08-19T07:11:10Z

Context shift was a useful feature in the past with pre-trained models and the raw /completions API. But today, it is causing a lot of confusion, so it is better to disable it by default. Can be re-enabled with --context-shift CLI arg.

ggml-ci

GuillaumeBruand · 2025-08-19T08:02:54Z

@ggerganov I'm looking for ressources about the behaviour when context overflows. I was planning to conduct experiments using this --context-shift along with --keep N option (still not sure if this one is relevant) and --ctx-size smaller than training context.

What should I get from this change ? Is there a link with attention sink recently supported in llama.cpp ? Is this --context-shift option unrelevant for instruct fine-tuned model ?

ngxson · 2025-08-19T08:29:16Z

What should I get from this change ?

This only changes the default behavior, instead of having context shift on by default, it's now off by default.

You can manually enable it.

ggerganov · 2025-08-19T08:51:47Z

tools/server/tests/unit/test_ctx_shift.py

+    server.enable_ctx_shift = True
    server.start()
+    server.enable_ctx_shift = False


@ngxson I noticed that the server parameters are stateful - i.e. if we change a parameter in one test, it will remain changed for the rest of the tests. This is the reason I do it like this here.

Is there a better way to set the parameter just for the scope of the current test?

It could be possible that the scope=module is the problem. Could you try removing it? (While keeping auto_use)

I was a bit confused about the notion of scope in pytest

Thanks - this seems to work.

ggerganov · 2025-08-19T13:52:30Z

@GuillaumeBruand The context shift is difficult to handle with formatted endpoints such as /chat/completions because it can destroy the structure of the chat template, degrading the quality. So strongly recommend against using it in such cases.

GuillaumeBruand · 2025-08-19T13:58:57Z

Thanks for the insight, I'll go on with this PR and let it disabled for my experiments.

DamonFool · 2025-08-20T09:49:01Z

Hi @ggerganov , the help msg about --context-shift seems incorrect?
Please see #15448 .
Thanks.

server : disable context shift by default

0876d42

ggml-ci

ggerganov requested a review from ngxson as a code owner August 19, 2025 07:11

github-actions bot added examples python python script changes server labels Aug 19, 2025

ngxson approved these changes Aug 19, 2025

View reviewed changes

ggerganov commented Aug 19, 2025

View reviewed changes

server : make scopr of test parameters local

14c2d45

ngxson approved these changes Aug 19, 2025

View reviewed changes

ggerganov mentioned this pull request Aug 19, 2025

changelog : llama-server REST API #9291

Open

ggerganov merged commit d2fcd91 into master Aug 19, 2025
50 checks passed

ggerganov deleted the gg/server-disable-context-shift-default branch August 19, 2025 13:46

DamonFool mentioned this pull request Aug 20, 2025

Help msg about context shift is incorrect #15448

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

server : disable context shift by default #15416

server : disable context shift by default #15416

ggerganov commented Aug 19, 2025

Uh oh!

GuillaumeBruand commented Aug 19, 2025

Uh oh!

ngxson commented Aug 19, 2025

Uh oh!

ggerganov Aug 19, 2025

Uh oh!

ngxson Aug 19, 2025

Uh oh!

ggerganov Aug 19, 2025

Uh oh!

Uh oh!

ggerganov commented Aug 19, 2025

Uh oh!

GuillaumeBruand commented Aug 19, 2025

Uh oh!

DamonFool commented Aug 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

server : disable context shift by default #15416

server : disable context shift by default #15416

Conversation

ggerganov commented Aug 19, 2025

Uh oh!

GuillaumeBruand commented Aug 19, 2025

Uh oh!

ngxson commented Aug 19, 2025

Uh oh!

ggerganov Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

ngxson Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

ggerganov Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ggerganov commented Aug 19, 2025

Uh oh!

GuillaumeBruand commented Aug 19, 2025

Uh oh!

DamonFool commented Aug 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants