MAJOR BILLING ISSUE!!! #4472
Replies: 10 comments 7 replies
-
See my informal tips, mentioning your issue, here #4446 (comment) |
Beta Was this translation helpful? Give feedback.
-
@Manamama Well, I thought the charges above where the worst of it... however, I woke up this morning to an email saying that my account is over the threshold with a new billing amount: On the 16th, I started using the API key - before this, I was only using the login with my google account, and it seems to have selected this account to bill. I was getting a .15 cent to .35 cent per day using the gemini-2.5-flash. After I added my API key and started using it, mostly just getting to see what it could do - it started racking up $$$ and I had no idea! Yesterday, I used it heavy because I used in on one of my projects that I built and maintain and I was making a tutorial video and in about 4 hours, I ran up a bill of $72 - and I had no idea. I was assuming I was in the free tier, because I had used about 160 requests... but 100,000,000 (yes, 100 MILLION) input tokens!!!! So - I have no idea how that happened... the CLI must send an incredible amount of information to the API... and then, unfortunately for the user - they get a huge bill. And this was only over about a 5 hour period while I was making a training tutorial about Gemini CLI!!! I am going to have to contact Google Billing and see if I can get this adjusted, because this is not exactly a fair bill for using a product and then getting hit with this sort of bill when you think you are well within the free use limits! I have removed the API key from the CLI and will be returning to using the flash model with my google account. I thought Claude was expensive! This makes Claude look like a charity case! |
Beta Was this translation helpful? Give feedback.
-
Wow. Wait some so that I fetch my graphs for all to see.... (in very short, I have not paid a cent, but then, I had never linked a credit card to this or other API keys just to avoid your situation...) Update 1: In the meantime (as very busy today with two Gemini's out of my 10 API keys or so, running on a complicated repo fix in parallel, one strategically QA checking the other plus this "thinking plus" tool that you may help you regain some sanity and maybe even $$$ in mid-term, btw... ), here is the formal advice re my "trick" how I do not pay a cent, all as Google itself suggests: https://developers.google.com/gemini-code-assist/resources/faqs :
Update 2: Checking Billing and Usage I’ve dug into my usage stats ("fetched my graphs," as it were), and the raw token counts—sometimes millions consumed in a short time—can be overwhelming and not very insightful. So, here’s my general advice as a fellow end user (not an expert, and I haven’t even fully read the docs, so take this with a grain of salt):
Screenshot references: ![]() ![]() |
Beta Was this translation helpful? Give feedback.
-
Please see my Issue Report: #4495 |
Beta Was this translation helpful? Give feedback.
-
Update 3, Grok AI hereinbelow this time, after it doing a more formal search, which even found my own replies from weeks ago about the same problems to other users here. "Sorry for it being so long, no time to shorten it properly". Also Grok is below, with my minute style fixes only:
Plus a follow up Grok's advice to you:
Which reminds me that weeks ago I did upload my screenshots, from my early usage, also inconclusive: #2896 (comment) |
Beta Was this translation helpful? Give feedback.
-
@Manamama OK - I am happy that you are providing all of this information - bottom line is this: You CANNOT access the supposedly FREE rates (1000 RPD) unless you have an API key. If you attempt it with the Google Account, you will get kicked from PRO to FLASH within a few requests, because of how many tokens the CLI is using - it blows through the allowable Input rate limits quickly, so PRO is simply not viable is logged in through the regular Google account. Of course, at the time - I didnt realize this - Hindsight is always 20/20, just as its easy and simple NOW for you or me to tell people what they should have done - I was doing what I was told through the docs - I was told that the Rate Limits have been set so high that a developer would have a hard time reaching them. Well, I am a professional developer! I am retired from Hewlett Packard as a developer and I have been developing WITHOUT AI TOOLS for many many years. These tools are game changers, and Google explained that to use the 1000 RPD, you have to use an API Key - so, I did. And within 3 days of doing so, I incurred a $142 bill!!! Why - Because I used 502 requests, and over 100 MILLION tokens in just a 5 hour window. And how did that happen? I was just doing what I normally do as a professional developer would, but I was in the process of creating a tutorial video to help other get the environment setup to use the CLI... little did I know, and had no way of knowing because there is no indication in the CLI that you are exceeding the rate limits... that I was incurring this huge bill! I am just happy that I had my threshold set to $100, and I received an email that I had exceeded the threshold and was charged for the use. This is something that EVERYONE using this CLI needs to be aware of until they fix the issue. The CLI can get into a loop where it continues to send questions to the API looking at how to do something, and basically fights with itself - or simply goes around and around with itself - while you are thinking its normal behavior, when in the background, its eating up input token usage at extremely high rates, and all you asked was a simple question to have the CLI do. I had to Escape several times from loops after they ran for 5 or 6 minutes, just thinking that the CLI got stuck, but not knowing that the whole time, in the background, it was sending an unreal amount of data to the API. I burned through 100 MILIION input tokens in about 5 hours of development. And this was just basic web stack stuff... HTML, CSS and JavaScript. I built a weather application that access the NWS API and shows the weather nicely for an area... I was doing it for a tutorial showing the workflow of the CLI... and was charged $142!!! All the time, thinking I was well withing the free use rate limits. So, I will be contacting Google Billing Monday, showing them all of these issues and discussions here and on other discussion boards of the nightmares that have been occurring... and ask for a refund due to the issues with how the CLI is working. I have several billing accounts with Google. I have had them for many years, as I maintain several enterprise level applications and I always know what I am spending and what projected costs will be. This is the first time I have ever had this type of issue where I use a product as it is meant to be used, thinking I am well within a free use tier, and then receive a huge bill. So, stay tuned - and we will see what happens from here... |
Beta Was this translation helpful? Give feedback.
-
You never check the API pricing page before using the Pro model? I host my own gemini chatbot using the API service, it's good but it can be cheaper than Gemini Pro plan if you only need to use it for planning and avoid user data collection for product improvement. |
Beta Was this translation helpful? Give feedback.
-
@tuapuikia Of course I check the API pricing page. Did you know that using the Pro version with CLI would use 100 Million Tokens in just a few hours or regular development? The API Pricing page says nothing about that. It does say that when they set stuff up, they made it so a normal developer would probably never reach its limits. Well - I didnt reach its limits - Its the way the Pro model works or is allowed to work in the CLI. You can go over the pricing structure all you want, it will not help you with what is happening here. |
Beta Was this translation helpful? Give feedback.
-
Related. A request isnt 1 request. It is several back and forth between flash and pro. |
Beta Was this translation helpful? Give feedback.
-
ok, so I havent been billed or something because I saw an email saying I had a bill and my projects would be suspended. I open up google and I have a $2+k effing bill. wtf!!!! I havent touched Gemini API in months and I have two $600 bills. What the heck is going on!!! |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
OK, well - there is a potential huge issue. I am a Tier 1 user - then, I connected my API Key to Gemini CLI, because I was not able to use Gemini-2.5-pro, and could only use Flash when signed in with my Google Account. But - I wanted to be able to use Gemini-2.5-Pro to see how that was. Well, its great... however, its SUPER expensive! I have been checking my requests daily for the past 3 days when I sign out of Gemini CLI since I have been using my API key, and I have only used around a couple hundred max per day, so well within the free tier limits. I have just been using it on my projects, and seeing how well it works at looking for issues and fixing problems. Well... although I am in the "FREE TIER" portion - well under 1000 requests, I have somehow racked up a bill of $66 in just 3 days - most of this today! Just a few hours. How is this possible? Here are the screen shots:
How? What is happening where I am getting billed - while I am well under the requests - and 97 MILLION tokens in just one afternoon!!!??? HOW!!! Well, I will be removing my API key and returning to Gemini-2.5-Flash through my regular google account... because at this rate, this would be a $2000 a month bill!!! How is this even possible! Everyone needs to check there billing!!!
Beta Was this translation helpful? Give feedback.
All reactions