Using --keep -1 to keep the starting prompt in context on context swap. #11

rabidcopy · 2023-04-02T01:31:45Z

rabidcopy
Apr 2, 2023

Title mostly explains itself. Llama.cpp implemented 'infinite output' via context swapping when the context size limit is reached. As such an argument called --keep was added that lets the user determine how many tokens of the initial prompt should be kept in context after this swap occurs. As far as I can tell this repository doesn't use/change the default value of n_keep or provide a argument to set it from command line. So was just wondering if it might be worth changing the default behavior or allowing some way to set it without needing to hardcode the default and compile.

LostRuins · 2023-04-02T06:38:55Z

LostRuins
Apr 2, 2023
Maintainer

Actually we do something even better - KoboldAI already has automatic context handling, and will automatically displace older context as new context exceeds the limit.

This context length limit is dynamic and can be configured from the Settings panel inside Kobold UI. So if you send a longer text, it should be appropriately parsed to ensure the correct limit is maintained while keeping new text. Let me know if it doesn't work.

1 reply

rabidcopy Apr 2, 2023
Author

Ah, I should of figured that. Well thanks for the explanation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Using --keep -1 to keep the starting prompt in context on context swap. #11

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Using --keep -1 to keep the starting prompt in context on context swap. #11

Uh oh!

rabidcopy Apr 2, 2023

Replies: 1 comment · 1 reply

Uh oh!

LostRuins Apr 2, 2023 Maintainer

Uh oh!

rabidcopy Apr 2, 2023 Author

rabidcopy
Apr 2, 2023

Replies: 1 comment 1 reply

LostRuins
Apr 2, 2023
Maintainer

rabidcopy Apr 2, 2023
Author