We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent b53c7c0 commit 978ef90Copy full SHA for 978ef90
blog/2025-07-20-k2-large-scale-ep.md
@@ -195,8 +195,8 @@ Note: The prefill-to-decode ratio is workload-dependent. We prioritized decode n
195
196
| Metric | Value |
197
| --- | --- |
198
-| **Prefill Throughput** | **896k tokens/sec** |
199
-| **Decode Throughput** | **384k tokens/sec** |
+| **Prefill Throughput** | **224k tokens/sec (4 P Nodes)** |
+| **Decode Throughput** | **288k tokens/sec (12 D Nodes)** |
200
| **Cost per 1M Output Tokens** | **~$0.21**(**H200 $2.3/hour**) |
201
202
---
0 commit comments