forked from NVIDIA/TensorRT-LLM
-
Notifications
You must be signed in to change notification settings - Fork 2
Pull requests: nv-auto-deploy/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][feat] Add AutoDeploy custom model for GLM-5 (glm_moe_dsa)
#246
opened Mar 13, 2026 by
suyoggupta
Loading…
[None][feat] Add Gemma3n model + Shared-kv attention support in AutoDeploy
#244
opened Mar 13, 2026 by
bmarimuthu-nv
Loading…
[None][feat] AutoDeploy: update the test list
#236
opened Mar 12, 2026 by
nvchenghaoz
Loading…
1 task
[None][revert] Revert "Add AutoDeploy custom model for Starcoder2 (#2…
#217
opened Mar 12, 2026 by
govind-ramnarayan
•
Draft
1 task
[None][feat] AutoDeploy: add support for mixtral/pixtral
#212
opened Mar 11, 2026 by
nvchenghaoz
Loading…
1 task
[None][feat] Add AD custom models for Jamba (Mamba v1 + Attention hybrid)
#200
opened Mar 11, 2026 by
govind-ramnarayan
Loading…
5 tasks
[AutoDeploy] onboarding allenai/OLMo-2-0325-32B-DPO
#186
opened Mar 10, 2026 by
taylor-yb-lee
Loading…
1 task
Gramnarayan/load superv3 mtp head rebased
#178
opened Feb 26, 2026 by
govind-ramnarayan
•
Draft
1 task
[TRTLLM-11567][feat] Manual config sharding support for Gated DeltaNet layer
#175
opened Feb 19, 2026 by
greg-kwasniewski1
Loading…
1 task done
Add fused Triton MoE routing kernel with pattern matcher transform
#173
opened Feb 17, 2026 by
suyoggupta
Loading…
1 task
[TRTLLM-11486][feat] GatedDeltaNet sharding
#171
opened Feb 12, 2026 by
greg-kwasniewski1
Loading…
1 task done
Optimize MOE export by tracing with reduced experts and expanding graph
#170
opened Feb 11, 2026 by
suyoggupta
•
Draft
1 task
Spec Dec: Fix Overlap Scheduler Garbage Output
#168
opened Feb 1, 2026 by
govind-ramnarayan
•
Draft
1 task done
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.