Skip to content

Commit 285f294

Browse files
authored
Update llama-3.1-8b-instruct.mdx
1 parent fb62ad2 commit 285f294

File tree

1 file changed

+3
-2
lines changed

1 file changed

+3
-2
lines changed

pages/managed-inference/reference-content/llama-3.1-8b-instruct.mdx

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ categories:
1919
|-----------------|------------------------------------|
2020
| Provider | [Meta](https://llama.meta.com/llama3/) |
2121
| License | [Llama 3.1 community](https://llama.meta.com/llama3_1/license/) |
22-
| Compatible Instances | L4, H100, H100-2 (FP8, BF16) |
22+
| Compatible Instances | L4, L40S, H100, H100-2 (FP8, BF16) |
2323
| Context Length | up to 128k tokens |
2424

2525
## Model names
@@ -34,6 +34,7 @@ meta/llama-3.1-8b-instruct:bf16
3434
| Instance type | Max context length |
3535
| ------------- |-------------|
3636
| L4 | 96k (FP8), 27k (BF16) |
37+
| L40S | 96k (FP8), 27k (BF16) |
3738
| H100 | 128k (FP8, BF16)
3839
| H100-2 | 128k (FP8, BF16)
3940

@@ -82,4 +83,4 @@ Process the output data according to your application's needs. The response will
8283

8384
<Message type="note">
8485
Despite efforts for accuracy, the possibility of generated text containing inaccuracies or [hallucinations](/managed-inference/concepts/#hallucinations) exists. Always verify the content generated independently.
85-
</Message>
86+
</Message>

0 commit comments

Comments
 (0)