Added Wan2.2-T2V to submission checker (mlcommons#2457)

Akshat-Tripathi · pgmpablo157321 · v-shobhit · web-flow · commit 737382933317 · 2026-01-28T10:10:10.000-05:00
* Added Wan2.2-T2V to submission checker * Updated acc_pattern * Add accuracy_sample_count (mlcommons#2414) * add accuracy_sample_count * cap count to QSL->TotalSampleCount() * [Automated Commit] Format Codebase * empty commit to re-trigger test * [Automated Commit] Format Codebase * rm newline * rm test05 lines * add accuracy_sample_count to submission_checker * [Automated Commit] Format Codebase * fix check * [Automated Commit] Format Codebase * empty commit to trigger test * add gpt-oss-120b loadgen settings to mlperf.conf * Remove Rclone download instructions for datasets supported by R2 Downloader (mlcommons#2358) * Remove Rclone instructions from README.md * Remove Rclone download instructions from README.md * Tweak README.md * Switch from Rclone to R2 Downloader in README.md * Switch from Rclone to R2 Downloader in README.md * Switch from Rclone to R2 Downloader in README.md * Switch Rclone for R2 Downloader in README.md * Switch Rclone for R2 Downloader in README.md * Use r2 downloader for gpt j model download (mlcommons#2365) * Provide r2 download commands for mixtral model and datasets (mlcommons#2364) * Replace MLCFlow RClone command for criteo dataset with R2 (mlcommons#2363) * Deprecate MLCFlow rclone download command with r2 (mlcommons#2362) * Add instruction to download DeepSeek model through MLCflow (mlcommons#2361) * [Automated Commit] Format Codebase * Trigger cla-check * [Automated Commit] Format Codebase * Update build_wheels.yml * [Automated Commit] Format Codebase * Add dtypes to README.md --------- Co-authored-by: ANANDHU S <71482562+anandhu-eng@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Arjun Suresh <arjun@gateoverflow.com> Co-authored-by: Pablo Gonzalez <pablo.gonzalez@factored.ai> Co-authored-by: Pablo Gonzalez <pgmpablo97@gmail.com> * empty commit to trigger test * Update seeds for inference v6.0 (mlcommons#2437) * use total_sample_count for default of accuracy_sample_count * revert changes to .github * Add accuracy_sample_count to modularized submission checker * [Automated Commit] Format Codebase * empty commit to trigger test --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Nathan Wasson <nathanw@mlcommons.org> Co-authored-by: ANANDHU S <71482562+anandhu-eng@users.noreply.github.com> Co-authored-by: Arjun Suresh <arjun@gateoverflow.com> Co-authored-by: Pablo Gonzalez <pablo.gonzalez@factored.ai> Co-authored-by: Pablo Gonzalez <pgmpablo97@gmail.com> * Increment version to 6.0.6 * Change dataset access link to new URL (mlcommons#2458) Updated dataset access link in README.md. * MLCFlow automation: Add model and dataset download commands (mlcommons#2455) Co-authored-by: Arjun Suresh <arjun@gateoverflow.com> * Update v6.0 benchmark table (mlcommons#2460) Co-authored-by: hanyunfan <frank.han@dell.com> * Temporary turn off retinanet GitHub action (mlcommons#2467) * [DLRMv3] Add compliance test (mlcommons#2453) * compliance test for dlrmv3 * add test08 * update * update * squash changes --------- Co-authored-by: hanyunfan <frank.han@dell.com> * Fix PyTorch version issue for Llama2-70b and Mixtral-8x7b (mlcommons#2466) Update torch installation from unavailable nightly build (torch==2.2.0.dev20231006+cpu) to stable release (torch==2.2.0) which is available on the PyTorch CPU wheel index. Co-authored-by: Arjun Suresh <arjun@gateoverflow.com> * Add yolo run commands to inference docs (mlcommons#2461) * Quick fix: Add user conf to yolo reference implementation (mlcommons#2446) * Quick fix: Add user conf to yolo reference implementation * Add model_name to yolo implementation * Fix default name * Fix logging for YOLO benchmark (mlcommons#2450) * [Automated Commit] Format Codebase * fix formatting * Remove duplicate log tracing argument Removed duplicate argument for enabling log tracing. * Add performance_sample_count_override for yolo * Update version check and filter scenarios for 6.0 * Remove min_query_count - interfering with runs Remove minimum query count requirement for performance mode. * Update models from submission checker constants (mlcommons#2464) Removed 'stable-diffusion-xl' and 'dlrm-v3' from scenarios. * Generate final report: Update filter scenarios for version 6.0 (mlcommons#2465) * Generate final report: Update filter scenarios for version 6.0 * Update mlperf.conf --------- Co-authored-by: ANANDHU S <71482562+anandhu-eng@users.noreply.github.com> Co-authored-by: hanyunfan <frank.han@dell.com> Co-authored-by: Arjun Suresh <arjun@gateoverflow.com> * Increment version to 6.0.7 --------- Co-authored-by: Pablo Gonzalez <pablo.gonzalez@factored.ai> Co-authored-by: v-shobhit <161510941+v-shobhit@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Nathan Wasson <nathanw@mlcommons.org> Co-authored-by: ANANDHU S <71482562+anandhu-eng@users.noreply.github.com> Co-authored-by: Arjun Suresh <arjun@gateoverflow.com> Co-authored-by: Pablo Gonzalez <pgmpablo97@gmail.com> Co-authored-by: pgmpablo157321 <pgmpablo157321@users.noreply.github.com> Co-authored-by: hanyunfan <frank.han@dell.com> Co-authored-by: Linjian Ma <malinjianev@gmail.com> Co-authored-by: SamareshSingh <97642706+ssam18@users.noreply.github.com> Co-authored-by: arjunsuresh <arjunsuresh@users.noreply.github.com> Co-authored-by: Miro <mirhodak@amd.com>
diff --git a/tools/submission/submission_checker/constants.py b/tools/submission/submission_checker/constants.py
@@ -39,6 +39,7 @@
             "deepseek-r1": ["Offline"],
             "gpt-oss-120b": ["Offline"],
             "qwen3-vl-235b-a22b": ["Server", "Offline"],
+            "wan-2.2-t2v-a14b": ["Offline", "SingleStream"],
         },
         "optional-scenarios-datacenter": {
             "llama2-70b-99": ["Interactive", "Server"],
@@ -188,6 +189,7 @@
             "dlrm-v3": ("AUC", 78.663 * 0.99),  # TODO: Placeholder for now
             "yolo-95": ("mAP", 53.4 * 0.95),
             "yolo-99": ("mAP", 53.4 * 0.99),
+            "wan-2.2-t2v-a14b": ("vbench_score", 70.48 * 0.99),
         },
         "accuracy-upper-limit": {
             "stable-diffusion-xl": (
@@ -232,6 +234,7 @@
             # TODO: Need to add accuracy sample count checkers as well (4395)
             "gpt-oss-120b": 6396,
             "qwen3-vl-235b-a22b": 48289,
+            "wan-2.2-t2v-a14b": 247,
             "dlrm-v3": 34996,
             "yolo-95": 5000,
             "yolo-99": 5000,
@@ -262,6 +265,7 @@
             # TODO: Need to add accuracy sample count checkers as well (4395)
             "gpt-oss-120b": 6396,
             "qwen3-vl-235b-a22b": 48289,
+            "wan-2.2-t2v-a14b": 247,
             "dlrm-v3": 34996,
             "yolo-95": 1525,
             "yolo-99": 1525,
@@ -338,6 +342,7 @@
             "gpt-oss-120b": {"SingleStream": 1024, "Server": 270336, "Offline": 1},
             "qwen3-vl-235b-a22b": {"SingleStream": 1024, "Server": 270336, "Offline": 1},
             "dlrm-v3": {"Server": 270336, "Offline": 1},
+            "wan-2.2-t2v-a14b": {"SingleStream": 247, "Offline": 1},
             "yolo-95": {"SingleStream": 1024, "MultiStream": 270336, "Offline": 1},
             "yolo-99": {"SingleStream": 1024, "MultiStream": 270336, "Offline": 1},
         },
@@ -354,13 +359,15 @@
             "rgat",
             "pointpainting",
             "whisper",
+            "wan-2.2-t2v-a14b",
             "yolo-99",
             "yolo-95",
         ],
         "models_TEST04": [
             "resnet",
             "stable-diffusion-xl",
             "pointpainting",
+            "wan-2.2-t2v-a14b"
         ],
         "models_TEST06": [
             "llama2-70b-99",
@@ -1378,7 +1385,8 @@
     "FID_SCORE": r".*'FID_SCORE':\s+'?([\d.]+).*",
     "gsm8k_accuracy": r".*'gsm8k':\s([\d.]+).*",
     "mbxp_accuracy": r".*'mbxp':\s([\d.]+).*",
-    "exact_match": r".*'exact_match':\s([\d.]+).*"
+    "exact_match": r".*'exact_match':\s([\d.]+).*",
+    "vbench_score": r".*'vbench_score':\s([\d.]+).*",
 }
 
 SYSTEM_DESC_REQUIRED_FIELDS = [
diff --git a/tools/submission/submission_checker_old.py b/tools/submission/submission_checker_old.py
@@ -1,9 +1,7 @@
 """A checker for MLPerf Inference submissions from v5.0 onwards (for checking older submissions please use the submission checker from the respective release)
 """
 
-from __future__ import division
-from __future__ import print_function
-from __future__ import unicode_literals
+from __future__ import division, print_function, unicode_literals
 
 import argparse
 import datetime
@@ -12,7 +10,6 @@
 import os
 import re
 import sys
-
 from glob import glob
 
 from log_parser import MLPerfLog
@@ -67,7 +64,7 @@
             "deepseek-r1": ["Offline"],
             "gpt-oss-120b": ["Offline"],
             "qwen3-vl-235b-a22b": ["Server", "Offline"],
-            "dlrm-v3": ["Server", "Offline"],
+            "wan-2.2-t2v-a14b": ["Offline", "SingleStream"],
         },
         "optional-scenarios-datacenter": {
             "llama2-70b-99": ["Interactive", "Server"],
@@ -216,6 +213,7 @@
             "dlrm-v3": ("AUC", 78.663 * 0.99),  # TODO: Placeholder for now
             "yolo-95": ("mAP", 53.4 * 0.95),
             "yolo-99": ("mAP", 53.4 * 0.99),
+            "wan-2.2-t2v-a14b": ("vbench", 70.48 * 0.99),
         },
         "accuracy-upper-limit": {
             "stable-diffusion-xl": (
@@ -259,6 +257,7 @@
             "whisper": 1633,
             "gpt-oss-120b": 6396,
             "qwen3-vl-235b-a22b": 48289,
+            "wan-2.2-t2v-a14b": 247,
             "dlrm-v3": 34996,
             "yolo-95": 5000,
             "yolo-99": 5000,
@@ -288,6 +287,7 @@
             "whisper": 1633,
             "gpt-oss-120b": 4395,
             "qwen3-vl-235b-a22b": 48289,
+            "wan-2.2-t2v-a14b": 247,
             "dlrm-v3": 34996,
             "yolo-95": 1525,
             "yolo-99": 1525,
@@ -366,6 +366,7 @@
             "dlrm-v3": {"Server": 270336, "Offline": 1},
             "yolo-95": {"SingleStream": 1024, "MultiStream": 270336, "Offline": 1},
             "yolo-99": {"SingleStream": 1024, "MultiStream": 270336, "Offline": 1},
+            "wan-2.2-t2v-a14b": {"SingleStream": 247, "Offline": 1}
         },
     },
     "v5.1": {