-
Notifications
You must be signed in to change notification settings - Fork 676
Pull requests: flashinfer-ai/flashinfer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: improve numerical stability of Gumbel sampling
run-ci
#2438
opened Jan 29, 2026 by
ixlmar
Loading…
4 of 5 tasks
refactor: refactoring cuda code to cute-dsl (part 1)
#2428
opened Jan 28, 2026 by
yzh119
Loading…
5 tasks
refactor: reduce hopper's gdn prefill compilation time and fix docstring.
#2422
opened Jan 27, 2026 by
yzh119
Loading…
5 tasks
infra: add manual code owner override support in codeowner_analyzer.py
#2418
opened Jan 26, 2026 by
sricketts
Loading…
3 tasks done
[CI]: Enable Blackwell & Hopper in public CI testing
#2413
opened Jan 24, 2026 by
yongwww
Loading…
5 tasks
chore: update benchmark scripts; fix trtllm-gen moe comments
#2412
opened Jan 24, 2026 by
IwakuraRein
Loading…
5 tasks done
Add/update multi node/multi GPU test scripts
#2410
opened Jan 23, 2026 by
dierksen
Loading…
3 of 5 tasks
feat: cuteDSL fp4 moe for better DSR1 performance.
#2398
opened Jan 22, 2026 by
nv-yunzheq
Loading…
5 tasks
feat: add per-request generator support for sampling kernels
#2345
opened Jan 13, 2026 by
yzh119
Loading…
feat: add LSE return support to TRT LLM attention kernels
#2332
opened Jan 11, 2026 by
yzh119
Loading…
feat: expose swizzled_input_sf parameter for CUTLASS fused MOE
#2330
opened Jan 11, 2026 by
yzh119
Loading…
[wip] feat: add bias support to TGV and CUTLASS BF16 GEMM
#2329
opened Jan 11, 2026 by
yzh119
Loading…
feat: add batch_invariant option to trtllm decode functions
#2321
opened Jan 9, 2026 by
yzh119
Loading…
feat: Support Fused MoE non gated Relu2 NVFP4 & FP8 and support Nemotron
run-ci
#2304
opened Jan 7, 2026 by
amitz-nv
Loading…
3 of 5 tasks
[Perf][Feature] Add SM103-specific schedulers for NVFP4 CUTLASS kernels
v0.6.2
#2303
opened Jan 7, 2026 by
LopezCastroRoberto
Loading…
chore: Update XFails Report
automated
maintenance
testing
#2287
opened Jan 5, 2026 by
flashinfer-bot
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.