Skip to content

Comments

[pull] main from NVIDIA:main#132

Merged
pull[bot] merged 8 commits intoLarryXFly:mainfrom
NVIDIA:main
Jun 26, 2025
Merged

[pull] main from NVIDIA:main#132
pull[bot] merged 8 commits intoLarryXFly:mainfrom
NVIDIA:main

Conversation

@pull
Copy link

@pull pull bot commented Jun 26, 2025

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.1)

Can you help keep this open source service alive? 💖 Please sponsor : )

QiJune and others added 8 commits June 26, 2025 07:18
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
Signed-off-by: Xianjie <5410381+qiaoxj07@users.noreply.github.com>
Signed-off-by: Hui Gao <huig@nvidia.com>
…ning configs. (#5485)

The seq_len of 4096 will cause some unknown CUDA illegal memory access issue if run with some other tests consecutively.
Put a saturated upper bound for any sequence length larger than it.
Signed-off-by: Mingyang Jiang <13463932+jmydurant@users.noreply.github.com>
Signed-off-by: Christina Zhang <83400082+ChristinaZ@users.noreply.github.com>
…tention_dp=True-cuda_graph=True-overlap_scheduler=True-torch_compile=False]` (#5494)

Signed-off-by: Venky <23023424+venkywonka@users.noreply.github.com>
@pull pull bot added the ⤵️ pull label Jun 26, 2025
@pull pull bot merged commit 32d1573 into LarryXFly:main Jun 26, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants