Skip to content

Commit dad54ca

Browse files
angelayipytorchmergebot
authored andcommitted
Add mistral/gpt-oss to benchmarks (pytorch#163565)
Potential issues * gpt-oss-20b is probably too big (I can't run on my devserver) * Mistral requires HF authentication * Mistral also takes a while to run the performance checks (need to wait for CI) Pull Request resolved: pytorch#163565 Approved by: https://github.com/huydhn
1 parent 2c5a3d7 commit dad54ca

15 files changed

+94
-0
lines changed

benchmarks/dynamo/check_accuracy.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -78,6 +78,8 @@ def check_accuracy(actual_csv, expected_csv, expected_filename):
7878
"google/gemma-3-4b-it",
7979
"openai/whisper-tiny",
8080
"Qwen/Qwen3-0.6B",
81+
"mistralai/Mistral-7B-Instruct-v0.3",
82+
"openai/gpt-oss-20b",
8183
}
8284
)
8385

benchmarks/dynamo/check_graph_breaks.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -61,6 +61,8 @@ def check_graph_breaks(actual_csv, expected_csv, expected_filename):
6161
"google/gemma-3-4b-it",
6262
"openai/whisper-tiny",
6363
"Qwen/Qwen3-0.6B",
64+
"mistralai/Mistral-7B-Instruct-v0.3",
65+
"openai/gpt-oss-20b",
6466
}
6567
)
6668

benchmarks/dynamo/ci_expected_accuracy/aot_eager_huggingface_inference.csv

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -191,3 +191,11 @@ openai/whisper-tiny,pass,0
191191

192192

193193
Qwen/Qwen3-0.6B,pass,0
194+
195+
196+
197+
mistralai/Mistral-7B-Instruct-v0.3,pass,0
198+
199+
200+
201+
openai/gpt-oss-20b,pass,0

benchmarks/dynamo/ci_expected_accuracy/aot_inductor_huggingface_inference.csv

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -187,3 +187,11 @@ openai/whisper-tiny,fail_to_run,0
187187

188188

189189
Qwen/Qwen3-0.6B,fail_to_run,0
190+
191+
192+
193+
mistralai/Mistral-7B-Instruct-v0.3,fail_to_run,0
194+
195+
196+
197+
openai/gpt-oss-20b,fail_to_run,0

benchmarks/dynamo/ci_expected_accuracy/cpu_inductor_amp_freezing_huggingface_inference.csv

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -191,3 +191,11 @@ openai/whisper-tiny,pass_due_to_skip,0
191191

192192

193193
Qwen/Qwen3-0.6B,pass_due_to_skip,0
194+
195+
196+
197+
mistralai/Mistral-7B-Instruct-v0.3,pass_due_to_skip,0
198+
199+
200+
201+
openai/gpt-oss-20b,pass_due_to_skip,0

benchmarks/dynamo/ci_expected_accuracy/cpu_inductor_freezing_huggingface_inference.csv

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -191,3 +191,11 @@ openai/whisper-tiny,pass_due_to_skip,0
191191

192192

193193
Qwen/Qwen3-0.6B,pass_due_to_skip,0
194+
195+
196+
197+
mistralai/Mistral-7B-Instruct-v0.3,pass_due_to_skip,0
198+
199+
200+
201+
openai/gpt-oss-20b,pass_due_to_skip,0

benchmarks/dynamo/ci_expected_accuracy/cpu_inductor_huggingface_inference.csv

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -191,3 +191,11 @@ openai/whisper-tiny,pass_due_to_skip,0
191191

192192

193193
Qwen/Qwen3-0.6B,pass_due_to_skip,0
194+
195+
196+
197+
mistralai/Mistral-7B-Instruct-v0.3,pass_due_to_skip,0
198+
199+
200+
201+
openai/gpt-oss-20b,pass_due_to_skip,0

benchmarks/dynamo/ci_expected_accuracy/dynamic_aot_eager_huggingface_inference.csv

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -191,3 +191,11 @@ openai/whisper-tiny,pass,0
191191

192192

193193
Qwen/Qwen3-0.6B,pass,0
194+
195+
196+
197+
mistralai/Mistral-7B-Instruct-v0.3,pass,0
198+
199+
200+
201+
openai/gpt-oss-20b,pass,0

benchmarks/dynamo/ci_expected_accuracy/dynamic_cpu_inductor_huggingface_inference.csv

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -191,3 +191,11 @@ openai/whisper-tiny,pass,0
191191

192192

193193
Qwen/Qwen3-0.6B,pass,0
194+
195+
196+
197+
mistralai/Mistral-7B-Instruct-v0.3,pass,0
198+
199+
200+
201+
openai/gpt-oss-20b,pass,0

benchmarks/dynamo/ci_expected_accuracy/dynamic_inductor_huggingface_inference.csv

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -191,3 +191,11 @@ openai/whisper-tiny,pass,0
191191

192192

193193
Qwen/Qwen3-0.6B,pass,0
194+
195+
196+
197+
mistralai/Mistral-7B-Instruct-v0.3,pass,0
198+
199+
200+
201+
openai/gpt-oss-20b,pass,0

0 commit comments

Comments
 (0)