feat(ask-fern): add 800 token output cap to reduce AWS Bedrock costs #4666

devin-ai-integration · 2025-11-02T02:54:57Z

Add 800 token output cap to Ask AI to reduce AWS Bedrock costs

This PR adds a maxTokens: 800 parameter to the streamText() call in the Anthropic streaming handler to cap output token usage and reduce AWS Bedrock costs.

Context & Motivation

The current Ask AI implementation has no maxTokens parameter, while the Slack/Discord bots cap at 1000-2000 tokens. This change adds an 800 token cap to prevent runaway costs from long responses.

Changes Made

Added maxTokens: 800 to streamText() call in packages/fern-docs/search-server/ask-fern/src/ask-fern/stream-anthropic.ts:162

Testing

⚠️ This change has NOT been tested yet. Please verify:

The parameter name maxTokens is correct for AI SDK v5.0.0-beta.2 (not maxOutputTokens)
The 800 token cap is appropriate for your use case
Consider a phased rollout starting with high-cost domains (ElevenLabs)
Monitor answer quality metrics after deployment

Human Review Checklist

Verify maxTokens is the correct parameter name for AI SDK version 5.0.0-beta.2
Confirm 800 tokens is an appropriate cap (not too restrictive for quality)
Consider if this should be rolled out to all domains at once or phased
Plan to monitor both cost reduction AND answer quality/satisfaction metrics post-deployment
Verify lint checks passed

Devin session: https://app.devin.ai/sessions/d3c90a389a754e37932c1850826ece6b
Requested by: [email protected] (@sahil485)

Add maxTokens: 800 parameter to streamText() call in stream-anthropic.ts to cap output token usage and reduce costs. This prevents unbounded response lengths which were contributing to high AWS Bedrock bills. Co-Authored-By: [email protected] <[email protected]>

vercel · 2025-11-02T02:54:59Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Updated (UTC)
dev.ferndocs.com	Ready	Preview	Nov 2, 2025 3:08am
fern-dashboard	Ready	Preview	Nov 2, 2025 3:08am
fern-dashboard-dev	Ready	Preview	Nov 2, 2025 3:08am
ferndocs.com	Ready	Preview	Nov 2, 2025 3:08am
preview.ferndocs.com	Ready	Preview	Nov 2, 2025 3:08am
prod-assets.ferndocs.com	Ready	Preview	Nov 2, 2025 3:08am
prod.ferndocs.com	Ready	Preview	Nov 2, 2025 3:08am

1 Skipped Deployment

Project	Deployment	Preview	Updated (UTC)
fern-platform	Ignored		Nov 2, 2025 3:08am

devin-ai-integration · 2025-11-02T02:55:01Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

Fix TypeScript error by changing maxTokens to maxOutputTokens, which is the correct parameter name for AI SDK v5.0.0-beta.2. Also apply the same cap to stream-cohere.ts for consistency and add shared MAX_OUTPUT_TOKENS constant to stream-constants.ts. Co-Authored-By: [email protected] <[email protected]>

vercel bot had a problem deploying to Preview – prod-assets.ferndocs.com November 2, 2025 02:54 Failure

vercel bot had a problem deploying to Preview – dev.ferndocs.com November 2, 2025 02:54 Failure

vercel bot had a problem deploying to Preview – preview.ferndocs.com November 2, 2025 02:55 Failure

vercel bot had a problem deploying to Preview – fern-dashboard November 2, 2025 02:56 Failure

vercel bot had a problem deploying to Preview – fern-dashboard-dev November 2, 2025 02:56 Failure

vercel bot deployed to Preview – ferndocs.com November 2, 2025 03:01 View deployment

vercel bot deployed to Preview – preview.ferndocs.com November 2, 2025 03:06 View deployment

vercel bot deployed to Preview – prod-assets.ferndocs.com November 2, 2025 03:06 View deployment

vercel bot deployed to Preview – prod.ferndocs.com November 2, 2025 03:06 View deployment

vercel bot deployed to Preview – dev.ferndocs.com November 2, 2025 03:06 View deployment

vercel bot deployed to Preview – fern-dashboard November 2, 2025 03:07 View deployment

vercel bot deployed to Preview – fern-dashboard-dev November 2, 2025 03:08 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(ask-fern): add 800 token output cap to reduce AWS Bedrock costs #4666

feat(ask-fern): add 800 token output cap to reduce AWS Bedrock costs #4666

Uh oh!

devin-ai-integration bot commented Nov 2, 2025 •

edited by dannysheridan

Loading

Uh oh!

vercel bot commented Nov 2, 2025 •

edited

Loading

Uh oh!

devin-ai-integration bot commented Nov 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat(ask-fern): add 800 token output cap to reduce AWS Bedrock costs #4666

Are you sure you want to change the base?

feat(ask-fern): add 800 token output cap to reduce AWS Bedrock costs #4666

Uh oh!

Conversation

devin-ai-integration bot commented Nov 2, 2025 • edited by dannysheridan Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add 800 token output cap to Ask AI to reduce AWS Bedrock costs

Context & Motivation

Changes Made

Testing

Human Review Checklist

Uh oh!

vercel bot commented Nov 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

devin-ai-integration bot commented Nov 2, 2025

🤖 Devin AI Engineer

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

devin-ai-integration bot commented Nov 2, 2025 •

edited by dannysheridan

Loading

vercel bot commented Nov 2, 2025 •

edited

Loading