Update llama-3.1-8b-instruct.mdx

fpagny · web-flow · commit 285f294a5701 · 2025-02-10T11:49:28.000+01:00
diff --git a/pages/managed-inference/reference-content/llama-3.1-8b-instruct.mdx b/pages/managed-inference/reference-content/llama-3.1-8b-instruct.mdx
@@ -19,7 +19,7 @@ categories:
 |-----------------|------------------------------------|
 | Provider        | [Meta](https://llama.meta.com/llama3/)  |
 | License        | [Llama 3.1 community](https://llama.meta.com/llama3_1/license/)  |
-| Compatible Instances | L4, H100, H100-2 (FP8, BF16) |
+| Compatible Instances | L4, L40S, H100, H100-2 (FP8, BF16) |
 | Context Length | up to 128k tokens |
 
 ## Model names
@@ -34,6 +34,7 @@ meta/llama-3.1-8b-instruct:bf16
 | Instance type  | Max context length |
 | ------------- |-------------|
 | L4      | 96k (FP8), 27k (BF16) | 
+| L40S    | 96k (FP8), 27k (BF16) | 
 | H100      | 128k (FP8, BF16)
 | H100-2      | 128k (FP8, BF16)
 
@@ -82,4 +83,4 @@ Process the output data according to your application's needs. The response will
 
 <Message type="note">
   Despite efforts for accuracy, the possibility of generated text containing inaccuracies or [hallucinations](/managed-inference/concepts/#hallucinations) exists. Always verify the content generated independently.
-</Message>
+</Message>