Skip to content
Closed
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
55 changes: 55 additions & 0 deletions packages/types/src/providers/vertex.ts
Original file line number Diff line number Diff line change
Expand Up @@ -294,6 +294,60 @@ export const vertexModels = {
outputPrice: 1.15,
description: "Meta Llama 4 Maverick 17B Instruct model, 128K context.",
},
"gpt-oss-120b-maas": {
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The "-maas" suffix pattern is interesting. Could we add a comment explaining that these are Model-as-a-Service offerings? This would help future maintainers understand the naming convention.

maxTokens: 8192,
contextWindow: 128_000,
supportsImages: false,
supportsPromptCache: false,
inputPrice: 2.0,
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is the decimal format for pricing intentional here? I notice existing models use integers (15, 75) while these new models use decimals (2.0, 8.0). Would it be better to maintain consistency by using 2 instead of 2.0?

outputPrice: 8.0,
description: "GPT OSS 120B model via Model-as-a-Service.",
},
"gpt-oss-20b-maas": {
maxTokens: 8192,
contextWindow: 128_000,
supportsImages: false,
supportsPromptCache: false,
inputPrice: 0.5,
outputPrice: 2.0,
description: "GPT OSS 20B model via Model-as-a-Service.",
},
"deepseek-r1-0528-maas": {
maxTokens: 8192,
contextWindow: 128_000,
supportsImages: false,
supportsPromptCache: false,
inputPrice: 1.0,
outputPrice: 4.0,
description: "DeepSeek R1 0528 model via Model-as-a-Service.",
},
"deepseek-v3.1-maas": {
maxTokens: 8192,
contextWindow: 128_000,
supportsImages: false,
supportsPromptCache: false,
inputPrice: 0.8,
outputPrice: 3.2,
description: "DeepSeek V3.1 model via Model-as-a-Service.",
},
"qwen3-coder-480b-a35b-instruct-maas": {
maxTokens: 8192,
contextWindow: 128_000,
supportsImages: false,
supportsPromptCache: false,
inputPrice: 3.0,
outputPrice: 12.0,
description: "Qwen3 Coder 480B Instruct model via Model-as-a-Service.",
},
"qwen3-235b-a22b-instruct-2507-maas": {
maxTokens: 8192,
contextWindow: 128_000,
supportsImages: false,
supportsPromptCache: false,
inputPrice: 2.5,
outputPrice: 10.0,
description: "Qwen3 235B Instruct model via Model-as-a-Service.",
},
} as const satisfies Record<string, ModelInfo>

export const VERTEX_REGIONS = [
Expand All @@ -306,6 +360,7 @@ export const VERTEX_REGIONS = [
{ value: "us-west2", label: "us-west2" },
{ value: "us-west3", label: "us-west3" },
{ value: "us-west4", label: "us-west4" },
{ value: "us-south1", label: "us-south1" },
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice attention to detail placing this alphabetically! The ordering is correct between us-west4 and northamerica-northeast1. 👍

{ value: "northamerica-northeast1", label: "northamerica-northeast1" },
{ value: "northamerica-northeast2", label: "northamerica-northeast2" },
{ value: "southamerica-east1", label: "southamerica-east1" },
Expand Down
Loading