Skip to content

Commit 901b1e3

Browse files
authored
clean up and update client.yml (#33)
* clean up and update client.yml * update chat-template setting based on discussions * update client config based on discussion * add one more model
1 parent d4170e4 commit 901b1e3

File tree

37 files changed

+4
-139
lines changed
  • Qwen/Qwen2.5-7B-Instruct/accuracy
  • RedHatAI
    • Llama-3.3-70B-Instruct-FP8-dynamic/accuracy
    • Llama-3.3-70B-Instruct-quantized.w4a16/accuracy
    • Llama-3.3-70B-Instruct-quantized.w8a8/accuracy
    • Llama-4-Scout-17B-16E-Instruct-FP8-dynamic/accuracy
    • Llama-4-Scout-17B-16E-Instruct-quantized.w4a16
    • Meta-Llama-3.1-8B-Instruct-FP8-dynamic/accuracy
    • Meta-Llama-3.1-8B-Instruct-quantized.w4a16/accuracy
    • Meta-Llama-3.1-8B-Instruct-quantized.w8a8/accuracy
    • Mistral-Small-24B-Instruct-2501-FP8-Dynamic/accuracy
    • Mistral-Small-24B-Instruct-2501-quantized.w4a16/accuracy
    • Mistral-Small-24B-Instruct-2501-quantized.w8a8/accuracy
    • Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic/accuracy
    • Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16/accuracy
    • Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8/accuracy
    • Qwen2.5-7B-Instruct-FP8-dynamic/accuracy
    • Qwen2.5-7B-Instruct-quantized.w4a16/accuracy
    • Qwen2.5-7B-Instruct-quantized.w8a8/accuracy
    • Qwen2.5-7B-quantized.w4a16/accuracy
    • granite-3.1-8b-instruct-FP8-dynamic/accuracy
    • granite-3.1-8b-instruct-quantized.w4a16/accuracy
    • granite-3.1-8b-instruct-quantized.w8a8/accuracy
    • phi-4-FP8-dynamic/accuracy
    • phi-4-quantized.w4a16/accuracy
    • phi-4-quantized.w8a8/accuracy
  • common/accuracy
  • ibm-granite/granite-3.1-8b-instruct/accuracy
  • meta-llama
  • microsoft/phi-4/accuracy
  • mistralai

37 files changed

+4
-139
lines changed

Qwen/Qwen2.5-7B-Instruct/accuracy/client.yml

Lines changed: 0 additions & 3 deletions
This file was deleted.

RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic/accuracy/client.yml

Lines changed: 0 additions & 4 deletions
This file was deleted.

RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16/accuracy/client.yml

Lines changed: 0 additions & 5 deletions
This file was deleted.

RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8/accuracy/client.yml

Lines changed: 0 additions & 4 deletions
This file was deleted.

RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic/accuracy/client.yml

Lines changed: 0 additions & 3 deletions
This file was deleted.
Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
# storage configs for https://huggingface.co/RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16
2+
model: hf
3+
data: hf

RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic/accuracy/client.yml

Lines changed: 0 additions & 4 deletions
This file was deleted.

RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w4a16/accuracy/client.yml

Lines changed: 0 additions & 4 deletions
This file was deleted.

RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8/accuracy/client.yml

Lines changed: 0 additions & 4 deletions
This file was deleted.

RedHatAI/Mistral-Small-24B-Instruct-2501-FP8-Dynamic/accuracy/client.yml

Lines changed: 0 additions & 4 deletions
This file was deleted.

0 commit comments

Comments
 (0)