Add A6000 benchmark integration tests #1616

Jackmin801 · 2026-01-19T23:46:35Z

Add integration tests that validate benchmark metrics (TPS, step time, MFU, peak memory) against baselines for A6000 GPU configurations.

Tests include:

test_benchmark_no_regression: Runs benchmarks and checks for regression
test_baseline_exists: Validates baseline files exist and are well-formed

The tests use a 5% regression threshold (consistent with CI workflow) and cover all 6 A6000 benchmark configurations:

Qwen3-0.6B RL Full (16384, 65536 seq_len)
Qwen3-0.6B RL LoRA r=16 (16384, 65536 seq_len)
Qwen3-0.6B SFT Full (8192 seq_len)
Qwen3-4B-Instruct-2507 RL LoRA r=16 (16384 seq_len)

Add integration tests that validate benchmark metrics (TPS, step time, MFU, peak memory) against baselines for A6000 GPU configurations. Tests include: - test_benchmark_no_regression: Runs benchmarks and checks for regression - test_baseline_exists: Validates baseline files exist and are well-formed The tests use a 5% regression threshold (consistent with CI workflow) and cover all 6 A6000 benchmark configurations: - Qwen3-0.6B RL Full (16384, 65536 seq_len) - Qwen3-0.6B RL LoRA r=16 (16384, 65536 seq_len) - Qwen3-0.6B SFT Full (8192 seq_len) - Qwen3-4B-Instruct-2507 RL LoRA r=16 (16384 seq_len)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add A6000 benchmark integration tests #1616

Add A6000 benchmark integration tests #1616

Uh oh!

Jackmin801 commented Jan 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add A6000 benchmark integration tests #1616

Are you sure you want to change the base?

Add A6000 benchmark integration tests #1616

Uh oh!

Conversation

Jackmin801 commented Jan 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants