Add gemini 2.5 flash preview ----thinking mode and max tokens for reasoning #2746

hxhyyy · 2025-04-18T06:38:55Z

hxhyyy
Apr 18, 2025

This model is available in two variants: thinking and non-thinking. The output pricing varies significantly depending on whether the thinking capability is active. If you select the standard variant (without the ":thinking" suffix), the model will explicitly avoid generating thinking tokens.

To utilize the thinking capability and receive thinking tokens, you must choose the ":thinking" variant, which will then incur the higher thinking-output pricing.

Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation

cte · 2025-04-18T08:27:36Z

cte
Apr 18, 2025
Maintainer

#2752

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add gemini 2.5 flash preview ----thinking mode and max tokens for reasoning #2746

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Add gemini 2.5 flash preview ----thinking mode and max tokens for reasoning #2746

Uh oh!

hxhyyy Apr 18, 2025

Replies: 1 comment

Uh oh!

cte Apr 18, 2025 Maintainer

hxhyyy
Apr 18, 2025

cte
Apr 18, 2025
Maintainer