Skip to content

Commit 5007743

Browse files
authored
Update llama-3.3-70b-instruct.mdx
1 parent bc23306 commit 5007743

File tree

1 file changed

+5
-4
lines changed

1 file changed

+5
-4
lines changed

pages/managed-inference/reference-content/llama-3.3-70b-instruct.mdx

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -19,8 +19,8 @@ categories:
1919
|-----------------|------------------------------------|
2020
| Provider | [Meta](https://www.llama.com/) |
2121
| License | [Llama 3.3 community](https://www.llama.com/llama3_3/license/) |
22-
| Compatible Instances | H100-2 (BF16) |
23-
| Context length | Up to 70k tokens |
22+
| Compatible Instances | H100 (FP8), H100-2 (FP8, BF16) |
23+
| Context length | Up to 131k tokens |
2424

2525
## Model names
2626

@@ -32,7 +32,8 @@ meta/llama-3.3-70b-instruct:bf16
3232

3333
| Instance type | Max context length |
3434
| ------------- |-------------|
35-
| H100-2 | 62k (BF16) |
35+
| H100 | 15k (FP8) |
36+
| H100-2 | 131k (FP8), 62k (BF16) |
3637

3738
## Model introduction
3839

@@ -76,4 +77,4 @@ Process the output data according to your application's needs. The response will
7677

7778
<Message type="note">
7879
Despite efforts for accuracy, the possibility of generated text containing inaccuracies or [hallucinations](/managed-inference/concepts/#hallucinations) exists. Always verify the content generated independently.
79-
</Message>
80+
</Message>

0 commit comments

Comments
 (0)