Cannot change max token output when set in librechat.yaml #6419

frenzybiscuit · 2025-03-19T13:01:52Z

frenzybiscuit
Mar 19, 2025

What happened?

When you define max_tokens as addParm in librechat.yaml, it sends it correctly to the llm backend:

In this case, mine is set to 1024.

However, when a user changes the max output tokens (either more or less), it stops sending it to the LLM backend entirely:

The LLM backend remains at the earlier 1024, but thats only because it was set to that on the last prompt. max_tokens completely vanishes from new prompts.

The max output tokens works when it's not predefined in the librechat.yaml.

The reason I think this is a bug, is because the max_tokens addParm output does vanish from the LLM backend and librechat no longer sends it.

Also, I personally feel like max_tokens should just define the upper limit and the end-user should be able to set it lower when they want to, so I feel like this is a bug.

Version Information

ghcr.io/danny-avila/librechat-dev latest c83689215440 5 hours ago 882MB
ghcr.io/danny-avila/librechat-dev e4979ae60fba 40 hours ago 866MB
ghcr.io/danny-avila/librechat-rag-api-dev latest 5f0a3f475b72 12 days ago 7.79GB
ghcr.io/danny-avila/librechat-rag-api-dev-lite latest 6550e7ddf180 12 days ago 1.3GB

Steps to Reproduce

add max_tokens: 1024 to addParm on a custom model in librechat.yaml.

Use koboldcpp to verify the output.

What browsers are you seeing the problem on?

Firefox

Relevant log output

Screenshots

No response

Code of Conduct

I agree to follow this project's Code of Conduct

danny-avila · 2025-03-19T16:51:27Z

danny-avila
Mar 19, 2025
Maintainer

This is by design, the yaml file sets things on a global/system level. Remove it from the YAML if you want users to set their own limits

4 replies

frenzybiscuit Mar 19, 2025
Author

This is by design, the yaml file sets things on a global/system level. Remove it from the YAML if you want users to set their own limits

The issue with this is text generation seems to go on indefinitely. What is the default set to?

danny-avila Mar 19, 2025
Maintainer

there is no default, it's based on the provider at that point.

frenzybiscuit Mar 19, 2025
Author

there is no default, it's based on the provider at that point.

…I will see if my backend can have a default set then, thank you.

What section of the code would I have to change to add a variable default? I don’t mind getting my hands dirty but I am completely unfamiliar with the code base.

frenzybiscuit Mar 19, 2025
Author

Also the reason I reported it as a bug is because when users change it on their end it removes the max_tokens from the prompt. As far as I can tell the only reason it continues to be at the set default is because it was previously sent.

im not sure if that’s by design. I’ll have to enable debug logging to see if it’s throwing errors.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Cannot change max token output when set in librechat.yaml #6419

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 4 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Cannot change max token output when set in librechat.yaml #6419

Uh oh!

frenzybiscuit Mar 19, 2025

What happened?

Version Information

Steps to Reproduce

What browsers are you seeing the problem on?

Relevant log output

Screenshots

Code of Conduct

Replies: 1 comment · 4 replies

Uh oh!

danny-avila Mar 19, 2025 Maintainer

Uh oh!

frenzybiscuit Mar 19, 2025 Author

Uh oh!

danny-avila Mar 19, 2025 Maintainer

Uh oh!

frenzybiscuit Mar 19, 2025 Author

Uh oh!

frenzybiscuit Mar 19, 2025 Author

frenzybiscuit
Mar 19, 2025

Replies: 1 comment 4 replies

danny-avila
Mar 19, 2025
Maintainer

frenzybiscuit Mar 19, 2025
Author

danny-avila Mar 19, 2025
Maintainer

frenzybiscuit Mar 19, 2025
Author

frenzybiscuit Mar 19, 2025
Author