Jj/cumsum nvfuserex opinfo tolerance #2586

jjsjann123 · 2025-10-02T17:40:50Z

nvfuserex's new codegen support for cumsum runs math in reduced precision. as pytorch does.
It's failing opinfo test, since reference implementation uses double. Bumping the tolerance to keep CI happy.

Linking the related PR NVIDIA/Fuser#5312

…race" This reverts commit 0354e9d. the promotion logic is also wrong. besides, having a type promotion to alter the behavior seems to be the wrong thing to do.

… original tests do run with un-promoted_math

for more information, see https://pre-commit.ci

…ance' into jj/cumsum_nvfuserex_opinfo_tolerance

jjsjann123 · 2025-10-02T23:26:50Z

Numerics looks pretty nasty for bf16/fp16.

jjsjann123 · 2025-10-02T23:27:51Z

@naoyam test passed for me locally on your nvfuser branch.

naoyam · 2025-10-02T23:41:48Z

Linking the related PR NVIDIA/Fuser#5312

crcrpar

Could you include the link to nvfuser pr in the comment?

jjsjann123 · 2025-10-07T16:55:46Z

Could you include the link to nvfuser pr in the comment?

Looks like Naoya added that right before your review. I added the link in the PR description.

jjsjann123 and others added 8 commits October 2, 2025 10:19

adding type promotion logic for nvfuserex.cumsum in thunder trace

0354e9d

bump test tolerance, which felt the right thing to do

8dde9ac

Revert "adding type promotion logic for nvfuserex.cumsum in thunder t…

1749bdf

…race" This reverts commit 0354e9d. the promotion logic is also wrong. besides, having a type promotion to alter the behavior seems to be the wrong thing to do.

this should work

47097c3

missed bfloat16, but this doesn't feel right to me any more since the…

4bb0aaa

… original tests do run with un-promoted_math

[pre-commit.ci] auto fixes from pre-commit.com hooks

213c5e9

for more information, see https://pre-commit.ci

updating comment

359691f

Merge remote-tracking branch 'origin/jj/cumsum_nvfuserex_opinfo_toler…

e64ba1e

…ance' into jj/cumsum_nvfuserex_opinfo_tolerance

jjsjann123 marked this pull request as ready for review October 2, 2025 23:26

jjsjann123 requested review from mruberry, lantiga, t-vi and KaelanDt as code owners October 2, 2025 23:26

jjsjann123 requested review from crcrpar and IvanYashchuk October 2, 2025 23:27

crcrpar approved these changes Oct 3, 2025

View reviewed changes

Merge branch 'main' into jj/cumsum_nvfuserex_opinfo_tolerance

e858886

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Jj/cumsum nvfuserex opinfo tolerance #2586

Jj/cumsum nvfuserex opinfo tolerance #2586

jjsjann123 commented Oct 2, 2025 •

edited

Loading

Uh oh!

jjsjann123 commented Oct 2, 2025

Uh oh!

jjsjann123 commented Oct 2, 2025

Uh oh!

naoyam commented Oct 2, 2025

Uh oh!

crcrpar left a comment

Uh oh!

jjsjann123 commented Oct 7, 2025

Uh oh!

Uh oh!

Jj/cumsum nvfuserex opinfo tolerance #2586

Are you sure you want to change the base?

Jj/cumsum nvfuserex opinfo tolerance #2586

Conversation

jjsjann123 commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jjsjann123 commented Oct 2, 2025

Uh oh!

jjsjann123 commented Oct 2, 2025

Uh oh!

naoyam commented Oct 2, 2025

Uh oh!

crcrpar left a comment

Choose a reason for hiding this comment

Uh oh!

jjsjann123 commented Oct 7, 2025

Uh oh!

Uh oh!

jjsjann123 commented Oct 2, 2025 •

edited

Loading