Skip to content

Commit 14a1234

Browse files
Added workers ai to workers pricing
1 parent 8bf72ab commit 14a1234

File tree

1 file changed

+63
-3
lines changed

1 file changed

+63
-3
lines changed

src/content/docs/workers/platform/pricing.mdx

Lines changed: 63 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -9,9 +9,9 @@ description: Workers plans and pricing information.
99

1010
import { GlossaryTooltip, Render } from "~/components";
1111

12-
By default, users have access to the Workers Free plan. The Workers Free plan includes limited usage of Workers, Pages Functions and Workers KV. Read more about the [Free plan limits](/workers/platform/limits/#worker-limits).
12+
By default, users have access to the Workers Free plan. The Workers Free plan includes limited usage of Workers, Pages Functions, Workers AI and Workers KV. Read more about the [Free plan limits](/workers/platform/limits/#worker-limits).
1313

14-
The Workers Paid plan includes Workers, Pages Functions, Workers KV, and Durable Objects usage for a minimum charge of $5 USD per month for an account. The plan includes increased initial usage allotments, with clear charges for usage that exceeds the base plan.
14+
The Workers Paid plan includes Workers, Pages Functions, Workers KV, Workers AI and Durable Objects usage for a minimum charge of $5 USD per month for an account. The plan includes increased initial usage allotments, with clear charges for usage that exceeds the base plan.
1515

1616
All included usage is on a monthly basis.
1717

@@ -101,7 +101,7 @@ If you had a Worker on the Bundled usage model prior to the migration to Standar
101101

102102
:::note
103103

104-
Some Workers Enterprise customers maintain the ability to change usage models.
104+
Some Workers Enterprise customers maintain the ability to change usage models.
105105

106106
Usage models may be changed at the individual Worker level:
107107

@@ -205,6 +205,66 @@ For example, if Worker A makes a subrequest to Worker B via a Service Binding, o
205205
If your Worker is on the deprecated [Bundled](/workers/platform/pricing/#example-pricing-bundled-usage-model) or [Unbound](/workers/platform/pricing/#example-pricing-unbound-usage-model) pricing plans, subrequests and RPC calls made via Service Bindings are charged as if they are request from the Internet. In the example above, you would be charged for two requests, one to Worker A, and one to Worker B.
206206
:::
207207

208+
## Workers AI
209+
210+
### Text Generation LLMs (incl Vision models)
211+
212+
Model size is measured in parameters.
213+
Pricing is based on blended tokens (input + output).
214+
Vision models will convert the image input into tokens for billing. Depending on size an aspect ratio, images will be charged for between 1,601 and 6,404 tokens. Most images that are more that 224 pixels wide or tall will be charged as 6,404 tokens each.
215+
216+
| Model Size | Pricing |
217+
| ----------- | ------------------------ |
218+
| \<= 3B | $0.10 per Million Tokens |
219+
| 3.1B - 8B | $0.15 per Million Tokens |
220+
| 8.1B - 20B | $0.20 per Million Tokens |
221+
| 20.1B - 40B | $0.50 per Million Tokens |
222+
| 40.1B+ | $0.75 per Million Tokens |
223+
224+
### Embeddings
225+
226+
Model size is measured in parameters.
227+
Pricing is based on input tokens.
228+
229+
| Model Size | Pricing |
230+
| ------------------- | ------------------------- |
231+
| \<= 150M parameters | $0.008 per Million Tokens |
232+
| 151M+ parameters | $0.015 per Million Tokens |
233+
234+
### Image Generation
235+
236+
Standard models are large image models such as `@cf/stabilityai/stable-diffusion-xl-base-1.0`
237+
Fast models are usually smaller image models that require fewer steps to generate an image, such as `@cf/black-forest-labs/flux-1-schnell` and `@cf/bytedance/stable-diffusion-xl-lightning`
238+
We take the maximum of the image height and width to calculate pricing. For example, and image of 1024x768 would fall under 1024x1024 pricing.
239+
240+
| Image Size | Price |
241+
| ------------ | -------------------- |
242+
| \<=256x256 | $0.00025 per 5 steps |
243+
| \<=512x512 | $0.0005 per 5 steps |
244+
| \<=1024x1024 | $0.001 per 5 steps |
245+
| \<=2048x2048 | $0.002 per 5 steps |
246+
247+
### Speech-to-text
248+
249+
Speech-to-text models like `@cf/openai/whisper` are billed on minutes of audio input.
250+
251+
| Price |
252+
| --------------------------------- |
253+
| $0.0039 per minute of audio input |
254+
255+
### Free Allocation
256+
257+
Our free allocation allows anyone to use Workers AI up to a certain limit per day. To use more than the free allocation, upgrade to the Workers Paid plan, where you will be charged on any usage above the free tier based on the pricing structure above.
258+
259+
| Model | Free tier size |
260+
| --------------------- | -------------------------------------------- |
261+
| Text Generation - LLM | 10,000 tokens a day across any model size |
262+
| Embeddings | 10,000 tokens a day across any model size |
263+
| Images | Sum of 250 steps, up to 1024x1024 resolution |
264+
| Speech-to-text | 10 minutes of audio a day |
265+
266+
All limits reset daily at 00:00 UTC. If you exceed any one of the above limits, further operations will fail with an error.
267+
208268
## Fine Print
209269

210270
Workers Paid plan is separate from any other Cloudflare plan (Free, Professional, Business) you may have. If you are an Enterprise customer, reach out to your account team to confirm pricing details.

0 commit comments

Comments
 (0)