Skip to content

Commit 10cba70

Browse files
authored
feat(aot): add nvshmem module for aot compilation (#1261)
## 📌 Description This change integrates the `gen_nvshmem_module` into the Ahead-of-Time (AOT) build process. By including this module, users can now pre-compile the NVSHMEM-based communication kernels, which is necessary for using this backend in environments where just-in-time compilation is not available or desired. ## 🔍 Related Issues Fixes #1260 ## 🚀 Pull Request Checklist Thank you for contributing to FlashInfer! Before we review your pull request, please make sure the following items are complete. ### ✅ Pre-commit Checks - [x] I have installed `pre-commit` by running `pip install pre-commit` (or used your preferred method). - [x] I have installed the hooks with `pre-commit install`. - [x] I have run the hooks manually with `pre-commit run --all-files` and fixed any reported issues. > If you are unsure about how to set up `pre-commit`, see [the pre-commit documentation](https://pre-commit.com/). ## 🧪 Tests - [ ] Tests have been added or updated as needed. - [ ] All tests are passing (`unittest`, etc.). ## Reviewer Notes <!-- Optional: anything you'd like reviewers to focus on, concerns, etc. --> Signed-off-by: Emilien Macchi <[email protected]>
1 parent 85d75ca commit 10cba70

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

flashinfer/aot.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,7 @@
1111

1212
from .activation import act_func_def_str, gen_act_and_mul_module
1313
from .cascade import gen_cascade_module
14+
from .comm.nvshmem import gen_nvshmem_module
1415
from .fp4_quantization import gen_fp4_quantization_sm100_module
1516
from .fused_moe import gen_cutlass_fused_moe_sm100_module
1617
from .gemm import gen_gemm_module, gen_gemm_sm90_module, gen_gemm_sm100_module
@@ -380,6 +381,7 @@ def gen_all_modules(
380381
jit_specs += [
381382
gen_cascade_module(),
382383
gen_norm_module(),
384+
gen_nvshmem_module(),
383385
gen_page_module(),
384386
gen_quantization_module(),
385387
gen_rope_module(),

0 commit comments

Comments
 (0)