Feature: add cli option for torch's built in matmul precision on supported graphics cards by AeneasTews · Pull Request #413 · jwohlwend/boltz

AeneasTews · 2025-06-23T08:03:55Z

When using a supported card and using preview build of pytorch (currently tested on version 2.8.0.dev20250616+cu128 and NVIDIA 5070 Ti) pytorch informs about the availability of matmulprecision which results in drastically improved runtimes when using high or medium instead of highest setting. Improvements can be up to 100% faster. This commit includes a command line option to toggle this based on user preference, default is highest. Keeping the default at highest should not cause any compatibility issues, as this is also the current default.

When using a supported card and using preview build of pytorch (currently tested on version 2.8.0.dev20250616+cu128 and NVIDIA 5070 Ti) pytorch informs about the availability of matmulprecision which results in drastically improved runtimes when using high or medium instead of highest setting. This commit includes a command line option to toggle this based on user preference, default is highest. Keeping the default at highest should not cause any compatibility issues, as this is also the current default.

xavierholt · 2025-07-02T22:04:47Z

Here's another vote for this! I almost made the same PR, but then I saw that someone beat me to it... I've edited this manually in the past, and I've seen a significant speedup going from highest to high (and no appreciable difference between high and medium). This was for Boltz1 on A100s, if I recall correctly.

It might also be worth checking the warnings filter as part of this. There's a call to filterwarnings() just above the call to set_float32_matmul_precision() that looks like it should hide the following error message, but it's still showing up.

You are using a CUDA device ('NVIDIA A100-SXM4-80GB') that has Tensor Cores. To properly utilize them, you should set `torch.set_float32_matmul_precision('medium' | 'high')` which will trade-off precision for performance. For more details, read https://pytorch.org/docs/stable/generated/torch.set_float32_matmul_precision.html#torch.set_float32_matmul_precision

jwohlwend · 2025-07-10T19:35:17Z

We've found in the past that using high or medium can hurt performance, so I m not super eager to incentivize users to do this. It's not just a question of card compatibility.

xavierholt · 2025-07-11T23:19:57Z

@jwohlwend I don't see a problem if it's done in the way this PR does it: keep the default at highest, but add a command line option so that people who know what they're doing (which hopefully translates to "people who read the documentation and run benchmarks") can get results cheaper and faster, if their systems support it. Unless there's an accuracy problem when running with lower precision that I'm unaware of?

The main thing that would incentivize people to drop the accuracy level is the big "you're not fully using your GPU" warning message that PyTorch prints out, but that's not touched by this PR. As I mentioned above, it seems like there's code to suppress that message, but I still see it, even with the latest release.

jwohlwend · 2025-07-11T23:28:23Z

No that's my point, we've observed accuracy issues with TF32

xavierholt · 2025-07-12T00:12:48Z

Oh! I thought you meant "performance" in the time/efficiency sense. If there are accuracy problems then I agree that this is a lot more dubious.

AeneasTews · 2025-09-08T13:05:19Z

@jwohlwend thank you very much for your responses, would you be able to provide me with the tests that you performend to determine accuracy deterioration when using different levels of precision? Thank you very much for your help! Best regards!

xavierholt mentioned this pull request Jul 2, 2025

Problem with the triton package when calling boltz predict #433

Open

Merge branch 'main' into main

ee88ee2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: add cli option for torch's built in matmul precision on supported graphics cards#413

Feature: add cli option for torch's built in matmul precision on supported graphics cards#413
AeneasTews wants to merge 2 commits intojwohlwend:mainfrom
AeneasTews:main

AeneasTews commented Jun 23, 2025

Uh oh!

xavierholt commented Jul 2, 2025

Uh oh!

jwohlwend commented Jul 10, 2025

Uh oh!

xavierholt commented Jul 11, 2025

Uh oh!

jwohlwend commented Jul 11, 2025

Uh oh!

xavierholt commented Jul 12, 2025

Uh oh!

AeneasTews commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AeneasTews commented Jun 23, 2025

Uh oh!

xavierholt commented Jul 2, 2025

Uh oh!

jwohlwend commented Jul 10, 2025

Uh oh!

xavierholt commented Jul 11, 2025

Uh oh!

jwohlwend commented Jul 11, 2025

Uh oh!

xavierholt commented Jul 12, 2025

Uh oh!

AeneasTews commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants