Skip to content

Commit c956c95

Browse files
authored
Add more model configs (#13)
* add model configs for summit models * add more models for the summit * add more models * add more models * clean up * set values for num_fewshot and tensor-parallel-size * update client.yml * pull Derek's PRs * fix model path * add and update model configs * fix typo and set tensor-parallel-size to 1 for base config * fix a typo and add 3 more models added this morning
1 parent 60705c6 commit c956c95

File tree

105 files changed

+710
-20
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

105 files changed

+710
-20
lines changed
Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
# https://huggingface.co/Qwen/Qwen2.5-7B-Instruct
2+
model: "Qwen/Qwen2.5-7B-Instruct"
3+
chat_template: true
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
# https://huggingface.co/Qwen/Qwen2.5-7B-Instruct
2+
model: "Qwen/Qwen2.5-7B-Instruct"
3+
trust-remote-code: true
4+
add-bos-token: false
5+
tensor-parallel-size: 1
6+
max-model-len: 8192

Qwen/Qwen2.5-7B-Instruct/storage.yml

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
# https://huggingface.co/Qwen/Qwen2.5-7B-Instruct
2+
model: hf
3+
data: hf
Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
# llm-eval-test configs for https://huggingface.co/RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic
2+
model: "RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic"
3+
chat_template: true
4+
fewshot_as_multiturn: true
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
# server configs for https://huggingface.co/RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic
2+
model: "RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic"
3+
trust-remote-code: true
4+
enable-chunked-prefill: true
5+
tensor-parallel-size: 1
6+
max-model-len: 4096
Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
# storage configs for https://huggingface.co/RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic
2+
model: hf
3+
data: hf
Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
# llm-eval-test configs for https://huggingface.co/RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16
2+
model: "RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16"
3+
chat_template: true
4+
fewshot_as_multiturn: true
5+
num_fewshot: 5
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
# server configs for https://huggingface.co/RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16
2+
model: "RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16"
3+
trust-remote-code: true
4+
enable-chunked-prefill: true
5+
tensor-parallel-size: 1
6+
max-model-len: 4096
Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
# storage configs for https://huggingface.co/RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16
2+
model: hf
3+
data: hf
Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
# llm-eval-test configs for https://huggingface.co/RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8
2+
model: "RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8"
3+
chat_template: true
4+
fewshot_as_multiturn: true

0 commit comments

Comments
 (0)