You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: add GLM-4.5 and OpenAI gpt-oss models to Fireworks provider
- Added GLM-4.5 (355B/32B active) and GLM-4.5-Air (106B/12B active) models from Z.ai
- Added gpt-oss-20b and gpt-oss-120b models from OpenAI
- All models configured with 128K context window
- Added comprehensive test coverage for all new models
Fixes#6753
"A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token from Deepseek. Note that fine-tuning for this model is only available through contacting fireworks at https://fireworks.ai/company/contact-us.",
60
64
},
65
+
"accounts/fireworks/models/glm-4-5": {
66
+
maxTokens: 16384,
67
+
contextWindow: 128000,
68
+
supportsImages: false,
69
+
supportsPromptCache: false,
70
+
inputPrice: 0.55,
71
+
outputPrice: 2.19,
72
+
description:
73
+
"Z.ai GLM-4.5 with 355B total parameters and 32B active parameters. Features unified reasoning, coding, and intelligent agent capabilities.",
74
+
},
75
+
"accounts/fireworks/models/glm-4-5-air": {
76
+
maxTokens: 16384,
77
+
contextWindow: 128000,
78
+
supportsImages: false,
79
+
supportsPromptCache: false,
80
+
inputPrice: 0.55,
81
+
outputPrice: 2.19,
82
+
description:
83
+
"Z.ai GLM-4.5-Air with 106B total parameters and 12B active parameters. Features unified reasoning, coding, and intelligent agent capabilities.",
84
+
},
85
+
"accounts/fireworks/models/gpt-oss-20b": {
86
+
maxTokens: 16384,
87
+
contextWindow: 128000,
88
+
supportsImages: false,
89
+
supportsPromptCache: false,
90
+
inputPrice: 0.07,
91
+
outputPrice: 0.3,
92
+
description:
93
+
"OpenAI gpt-oss-20b: Compact model for local/edge deployments. Optimized for low-latency and resource-constrained environments with chain-of-thought output, adjustable reasoning, and agentic workflows.",
94
+
},
95
+
"accounts/fireworks/models/gpt-oss-120b": {
96
+
maxTokens: 16384,
97
+
contextWindow: 128000,
98
+
supportsImages: false,
99
+
supportsPromptCache: false,
100
+
inputPrice: 0.15,
101
+
outputPrice: 0.6,
102
+
description:
103
+
"OpenAI gpt-oss-120b: Production-grade, general-purpose model that fits on a single H100 GPU. Features complex reasoning, configurable effort, full chain-of-thought transparency, and supports function calling, tool use, and structured outputs.",
0 commit comments