Skip to content

Commit f4e1420

Browse files
authored
ci: add blackwell unittest scripts (#1372)
<!-- .github/pull_request_template.md --> ## 📌 Description Initial setup of blackwell unittest scripts * scripts/run_test_blackwell_attention_kernels.sh for moe/gemm/attention/utils/etc * scripts/task_test_multi_node_comm_kernels.sh for multi-node communication kernels * scripts/task_test_single_node_comm_kernels.sh for single-node communication kernels ## 🔍 Related Issues <!-- Link any related issues here --> ## 🚀 Pull Request Checklist Thank you for contributing to FlashInfer! Before we review your pull request, please make sure the following items are complete. ### ✅ Pre-commit Checks - [x] I have installed `pre-commit` by running `pip install pre-commit` (or used your preferred method). - [x] I have installed the hooks with `pre-commit install`. - [x] I have run the hooks manually with `pre-commit run --all-files` and fixed any reported issues. > If you are unsure about how to set up `pre-commit`, see [the pre-commit documentation](https://pre-commit.com/). ## 🧪 Tests - [ ] Tests have been added or updated as needed. - [ ] All tests are passing (`unittest`, etc.). ## Reviewer Notes <!-- Optional: anything you'd like reviewers to focus on, concerns, etc. -->
1 parent 22a62ea commit f4e1420

7 files changed

+86
-0
lines changed
Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,16 @@
1+
#!/bin/bash
2+
3+
set -eo pipefail
4+
set -x
5+
6+
pytest -s tests/test_blackwell_fmha.py
7+
pytest -s tests/test_deepseek_mla.py
8+
9+
# trtllm-gen
10+
pytest -s tests/test_trtllm_gen_context.py
11+
pytest -s tests/test_trtllm_gen_decode.py
12+
13+
# cudnn
14+
pytest -s tests/test_cudnn_decode.py
15+
pytest -s tests/test_cudnn_prefill.py
16+
pytest -s tests/test_cudnn_prefill_deepseek.py
Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
#!/bin/bash
2+
3+
set -eo pipefail
4+
set -x
5+
6+
pytest -s tests/test_mm_fp4.py
7+
pytest -s tests/test_groupwise_scaled_gemm_fp8.py
8+
pytest -s tests/test_groupwise_scaled_gemm_mxfp4.py
Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,7 @@
1+
#!/bin/bash
2+
3+
set -eo pipefail
4+
set -x
5+
6+
pytest -s tests/test_trtllm_gen_fused_moe.py
7+
pytest -s tests/test_trtllm_cutlass_fused_moe.py
Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
#!/bin/bash
2+
3+
# utils kernels
4+
pytest -s tests/test_fp4_quantize.py
Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,20 @@
1+
#!/bin/bash
2+
3+
set -eo pipefail
4+
set -x
5+
: ${MAX_JOBS:=$(nproc)}
6+
: ${CUDA_VISIBLE_DEVICES:=0}
7+
8+
pip install -e . -v
9+
10+
# run task_blackwell_utils_kernels.sh
11+
bash scripts/run_test_blackwell_utils_kernels.sh
12+
13+
# run task_blackwell_attention_kernels.sh
14+
bash scripts/run_test_blackwell_attention_kernels.sh
15+
16+
# gemm kernels
17+
bash scripts/run_test_blackwell_gemm_kernels.sh
18+
19+
# moe kernels
20+
bash scripts/run_test_blackwell_moe_kernels.sh
Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
#!/bin/bash
2+
3+
set -eo pipefail
4+
set -x
5+
: ${MAX_JOBS:=$(nproc)}
6+
: ${CUDA_VISIBLE_DEVICES:=0}
7+
8+
pip install -e . -v
9+
10+
pytest -s tests/test_mnnvl_memory.py
11+
pytest -s tests/test_trtllm_mnnvl_allreduce.py
Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,20 @@
1+
#!/bin/bash
2+
3+
set -eo pipefail
4+
set -x
5+
: ${MAX_JOBS:=$(nproc)}
6+
: ${CUDA_VISIBLE_DEVICES:=0}
7+
8+
pip install -e . -v
9+
10+
# vllm ar
11+
pytest -s tests/test_vllm_custom_allreduce.py
12+
# trtllm ar + fusion
13+
pytest -s tests/test_trtllm_allreduce.py
14+
pytest -s tests/test_trtllm_allreduce_fusion.py
15+
pytest -s tests/test_trtllm_cutlass_fused_moe.py
16+
pytest -s tests/test_trtllm_moe_allreduce_fusion.py
17+
pytest -s tests/test_trtllm_moe_allreduce_fusion_finalize.py
18+
# nvshmem ar
19+
pytest -s tests/test_nvshmem.py
20+
pytest -s tests/test_nvshmem_allreduce.py

0 commit comments

Comments
 (0)