We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent c919bd8 commit 1d0b9f9Copy full SHA for 1d0b9f9
accuracy/Meta-Llama-3.1-8B-Instruct/server.yml
@@ -2,4 +2,5 @@
2
model: "meta-llama/Meta-Llama-3.1-8B-Instruct"
3
trust-remote-code: true
4
enable-chunked-prefill: true
5
+tensor-parallel-size:
6
max-model-len: 4096
0 commit comments