Skip to content
Discussion options

You must be logged in to vote

Howdy, folks. 👋 Ryan here from the Gemini CLI team. Hopefully, I can demystify the behavior.

For those devs using the free tier by logging in with your Google account, our goal is to deliver the best possible experience at the keyboard – ideally, one where you never have to stop work because you hit a limit. To do that, we have to balance model choice with capacity. Thus, the free tier uses a blend of Gemini 2.5 Pro and Flash.

For example, we might use Flash to determine the complexity of a request before routing a request to the model for the “official” response. After all, Pro is overkill for a lot of really simple steps (e.g. “start the npm server”) better routed to Flash. Pro is bette…

Replies: 11 comments 12 replies

Comment options

You must be logged in to vote
1 reply
@Domoramo
Comment options

Comment options

You must be logged in to vote
4 replies
@Divyanshu-20
Comment options

@GUSTAVOWORKW
Comment options

@JoshuaKahle
Comment options

@rutexd
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
7 replies
@Manamama
Comment options

@JoshuaKahle
Comment options

@Manamama
Comment options

@carlosglz1912
Comment options

@skaman5
Comment options

Answer selected by ryanjsalva
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet