You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
For the most up-to-date pricing, visit the [Baseten Model APIs page](https://www.baseten.co/products/model-apis/).
43
+
For the complete, up-to-date model list and pricing, see [Baseten's Model APIs page](https://www.baseten.co/products/model-apis/).
57
44
58
45
---
59
46
@@ -73,50 +60,6 @@ To use the `moonshotai/Kimi-K2-Thinking` model, you must enable native tool call
73
60
74
61
---
75
62
76
-
## Production-First Architecture
77
-
78
-
Baseten's Model APIs are built for production environments with several key advantages:
79
-
80
-
### Enterprise-Grade Reliability
81
-
82
-
-**Four nines of uptime** (99.99%) through active-active redundancy
83
-
-**Cloud-agnostic, multi-cluster autoscaling** for consistent availability
84
-
-**SOC 2 Type II certified** and **HIPAA compliant** for security requirements
85
-
86
-
### Optimized Performance
87
-
88
-
-**Pre-optimized models** shipped with the Baseten Inference Stack
89
-
-**Latest-generation GPUs** with multi-cloud infrastructure
90
-
-**Ultra-fast inference** optimized from the bottom up for production workloads
91
-
92
-
### Cost Efficiency
93
-
94
-
-**5-10x less expensive** than closed alternatives
95
-
-**Optimized multi-cloud infrastructure** for efficient resource utilization
96
-
-**Transparent pricing** with no hidden costs or rate limit surprises
97
-
98
-
### Developer Experience
99
-
100
-
-**OpenAI compatible API** - migrate by swapping a single URL
101
-
-**Drop-in replacement** for closed models with comprehensive observability and analytics
102
-
-**Seamless scaling** from Model APIs to dedicated deployments
103
-
104
-
---
105
-
106
-
## Special Features
107
-
108
-
### Function Calling & Tool Use
109
-
110
-
All Baseten models support structured outputs, function calling, and tool use as part of the Baseten Inference Stack, making them ideal for agentic applications and coding workflows.
111
-
112
-
---
113
-
114
63
## Tips and Notes
115
64
116
-
-**Static Model List:** Roo Code uses a curated list of Baseten models. The default model is `zai-org/GLM-4.6`.
117
-
118
-
-**Multi-Cloud Capacity Management (MCM):** Baseten's multi-cloud infrastructure ensures high availability and low latency globally.
119
-
120
-
-**Support:** Baseten provides dedicated support for production deployments and can work with you on dedicated resources as you scale.
121
-
122
-
-**Pricing:** Current pricing is highly competitive and transparent. Prices typically range from $0.10-$6.00 per million tokens, making Baseten significantly more cost-effective than many closed-model alternatives while providing access to state-of-the-art open-source models.
65
+
-**Pricing:** See the [Baseten Model APIs page](https://www.baseten.co/products/model-apis/) for current pricing information.
0 commit comments