diff --git a/src/content/docs/workers-ai/platform/limits.mdx b/src/content/docs/workers-ai/platform/limits.mdx index 8a252517a3fdecc..8f647f14db74f50 100644 --- a/src/content/docs/workers-ai/platform/limits.mdx +++ b/src/content/docs/workers-ai/platform/limits.mdx @@ -38,16 +38,16 @@ Rate limits are default per task type, with some per-model limits defined as fol * 1500 requests per minute -### [Text Classification](/workers-ai/models/#text-classification) +### [Text Classification](/workers-ai/models/) * 2000 requests per minute -### [Text Embeddings](/workers-ai/models/#text-embeddings) +### [Text Embeddings](/workers-ai/models/) * 3000 requests per minute * [@cf/baai/bge-large-en-v1.5](/workers-ai/models/bge-large-en-v1.5/) is 1500 requests per minute -### [Text Generation](/workers-ai/models/#text-generation) +### [Text Generation](/workers-ai/models/) * 300 requests per minute * [@hf/thebloke/mistral-7b-instruct-v0.1-awq](/workers-ai/models/mistral-7b-instruct-v0.1-awq/) is 400 requests per minute @@ -57,11 +57,11 @@ Rate limits are default per task type, with some per-model limits defined as fol * [@cf/qwen/qwen1.5-14b-chat-awq](/workers-ai/models/qwen1.5-14b-chat-awq/) is 150 requests per minute * [@cf/tinyllama/tinyllama-1.1b-chat-v1.0](/workers-ai/models/tinyllama-1.1b-chat-v1.0/) is 720 requests per minute -### [Text-to-Image](/workers-ai/models/#text-to-image) +### [Text-to-Image](/workers-ai/models/) * 720 requests per minute * [@cf/runwayml/stable-diffusion-v1-5-img2img](/workers-ai/models/stable-diffusion-v1-5-img2img/) is 1500 requests per minute -### [Translation](/workers-ai/models/#translation) +### [Translation](/workers-ai/models/) * 720 requests per minute