Skip to content

Conversation

@diwakar-s-maurya
Copy link

@diwakar-s-maurya diwakar-s-maurya commented Sep 6, 2025

Description

Support Qwen3-Coder-480B-A35B-Instruct on Google Vertex AI


Important

Add support for qwen/qwen3-coder-480b-a35b-instruct-maas model and us-south1 region in Vertex AI configuration.

  • Models:
    • Add qwen/qwen3-coder-480b-a35b-instruct-maas to vertexModels in vertex.ts with 65,536 max tokens, 262,144 context window, input price 1, output price 4.
  • Regions:
    • Add us-south1 to VERTEX_REGIONS in vertex.ts.

This description was created by Ellipsis for 4c5d79e. You can customize this summary. It will automatically update as commits are pushed.

@dosubot dosubot bot added size:S This PR changes 10-29 lines, ignoring generated files. enhancement New feature or request labels Sep 6, 2025
Copy link
Contributor

@roomote roomote bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for your contribution! I've reviewed the changes and have some suggestions for improvement to maintain consistency with the existing codebase.

outputPrice: 1.15,
description: "Meta Llama 4 Maverick 17B Instruct model, 128K context.",
},
"qwen/qwen3-coder-480b-a35b-instruct-maas": {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I noticed the model ID uses a namespace prefix qwen/ while the other MaaS model llama-4-maverick-17b-128e-instruct-maas doesn't use any prefix. Is this intentional? Should we maintain consistency across MaaS models, or is the namespace required for Qwen models specifically?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is intentional. The model name to be passed to vertex ai is "qwen/qwen3-coder-480b-a35b-instruct-maas".

https://console.cloud.google.com/vertex-ai/publishers/qwen/model-garden/qwen3-coder-480b-a35b-instruct-maas

ENDPOINT=us-south1-aiplatform.googleapis.com
REGION=us-south1
PROJECT_ID="YOUR_PROJECT_ID"

curl \
  -X POST \
  -H "Authorization: Bearer $(gcloud auth print-access-token)" \
  -H "Content-Type: application/json" https://${ENDPOINT}/v1/projects/${PROJECT_ID}/locations/${REGION}/endpoints/openapi/chat/completions \
  -d '{"model":"qwen/qwen3-coder-480b-a35b-instruct-maas", "stream":true, "messages":[{"role": "user", "content": "Write a quick sort algorithm"}]}'

supportsPromptCache: false,
inputPrice: 1,
outputPrice: 4,
description: "Qwen 3 Coder 480B Instruct",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could we make the description more detailed to match the pattern of other models? Consider:

Suggested change
description: "Qwen 3 Coder 480B Instruct",
description: "Qwen 3 Coder 480B Instruct model, 262K context.",

This would be consistent with the Llama model description above which includes the context window size.

@hannesrudolph hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Sep 6, 2025
@diwakar-s-maurya
Copy link
Author

@ssweens
Copy link
Contributor

ssweens commented Sep 7, 2025

Note: There's a more comprehensive PR that includes this model and others already pending: PR #7727

@diwakar-s-maurya
Copy link
Author

diwakar-s-maurya commented Sep 7, 2025

Agree. #7727 is superset of this PR.

@mrubens
Copy link
Collaborator

mrubens commented Sep 7, 2025

Closed in favor of #7727

@mrubens mrubens closed this Sep 7, 2025
@github-project-automation github-project-automation bot moved this from Triage to Done in Roo Code Roadmap Sep 7, 2025
@github-project-automation github-project-automation bot moved this from New to Done in Roo Code Roadmap Sep 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. size:S This PR changes 10-29 lines, ignoring generated files.

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

4 participants