Skip to content

add fma arg to disable compiling with FMA#839

Open
edoyango wants to merge 1 commit intoNVIDIA:mainfrom
edoyango:fma-flag
Open

add fma arg to disable compiling with FMA#839
edoyango wants to merge 1 commit intoNVIDIA:mainfrom
edoyango:fma-flag

Conversation

@edoyango
Copy link
Copy Markdown

@edoyango edoyango commented Mar 24, 2026

This PR adapts my changes in numba/numba#10487 for this repo

closes #118 and numba/numba#2834 and partially addresses numba/numba#7719

This PR adds the fma flag to the cuda.jit decorator, which is passed onto nvvm options.

Setting this flag to False prevents the compiler from reordering operations to introduce FMAs. Very important for checking bitwise-reproducibility e.g. when comparing CPU and GPU results

@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot bot commented Mar 24, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[FEA] NVIDIA nvvm defaults to fma=1 and wish there was a way to tell it fma=0

1 participant