File tree Expand file tree Collapse file tree 1 file changed +4
-3
lines changed
pages/managed-inference/reference-content Expand file tree Collapse file tree 1 file changed +4
-3
lines changed Original file line number Diff line number Diff line change @@ -19,8 +19,8 @@ categories:
1919| -----------------| ------------------------------------|
2020| Provider | [ Deepseek] ( https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B ) |
2121| License | [ MIT] ( https://huggingface.co/datasets/choosealicense/licenses/blob/main/markdown/mit.md ) |
22- | Compatible Instances | H100-2 (BF16) |
23- | Context Length | up to 56k tokens |
22+ | Compatible Instances | H100 (FP8), H100 -2 (FP8, BF16) |
23+ | Context Length | up to 131k tokens |
2424
2525## Model names
2626
@@ -32,7 +32,8 @@ deepseek/deepseek-r1-distill-llama-70b:bf16
3232
3333| Instance type | Max context length |
3434| ------------- | -------------|
35- | H100-2 | 56k (BF16) |
35+ | H100 | 15k (FP8) |
36+ | H100-2 | 131k (FP8), 56k (BF16) |
3637
3738## Model introduction
3839
You can’t perform that action at this time.
0 commit comments