Aoti CUDA runtime libraries #14731

larryliu0820 · 2025-10-01T22:56:51Z

This pull request adds a CUDA backend to the ExecutorTorch project, enabling Ahead-Of-Time Inductor (AOTI) model execution on CUDA devices. The changes include new build rules, backend implementation, and a test runner for CUDA, as well as minor related improvements. The most important changes are summarized below.

CUDA Backend Implementation:

Added a new C++ implementation for the CUDA backend, including dynamic loading of AOTI CUDA model containers, tensor management between CPU and GPU, and execution logic. (backends/cuda/runtime/cuda_backend.cpp)
Introduced a CMake build file for the CUDA backend, which sets up the necessary libraries, dependencies, and installation rules. (backends/cuda/CMakeLists.txt)
Added a test runner executable for the CUDA backend, allowing end-to-end testing of model loading and execution. (backends/cuda/tests/voxtral_runner.cpp)

Build System Integration:

Updated the top-level CMakeLists.txt to conditionally build the CUDA backend and its dependencies when EXECUTORCH_BUILD_CUDA is enabled, and to register the backend for use. (CMakeLists.txt)
Modified the executor runner build logic to include the flat tensor extension library if built, improving modularity. (CMakeLists.txt)

Supporting Changes:

Added a type alias for AOTITensorHandle in the AOTI model container header to improve code clarity and type safety. (backends/aoti/aoti_model_container.h)
Minor Python code fixes and deduplication in the CUDA backend Python module, including resolving a merge conflict and removing redundant enum definitions. (backends/cuda/cuda_backend.py) [1] [2]

pytorch-bot · 2025-10-01T22:56:55Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14731

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-10-01T22:57:30Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

…into aoti_runtime

github-actions · 2025-12-07T00:51:35Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

larryliu0820 added 2 commits October 1, 2025 15:55

Make it work

74e7ffa

ET AOTI CUDA runtime libraries

4222fe6

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 1, 2025

larryliu0820 added 10 commits October 1, 2025 22:39

Resize tensor

c838eee

Make Voxtral work

366c763

Fix merge conflict

a7ecb6f

Update

d1b21a0

Make it work

514832a

ET AOTI CUDA runtime libraries

ab58b48

Resize tensor

5152cf9

Make Voxtral work

1cdbd61

Fix merge conflict

a6ee575

Update

c73b059

larryliu0820 force-pushed the aoti_runtime branch from d1b21a0 to c73b059 Compare October 6, 2025 19:22

Gasoonjia added 4 commits October 6, 2025 12:26

Merge branch 'aoti_runtime' of https://github.com/pytorch/executorch …

c245360

…into aoti_runtime

add cudagurad and cudastreamguard support

9605b9d

add cudagurad and cudastreamguard support

e349cce

make voxtral runner exits nice and neat

57060e9

github-actions bot added the stale PRs inactive for over 60 days label Dec 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Aoti CUDA runtime libraries #14731

Aoti CUDA runtime libraries #14731

Uh oh!

larryliu0820 commented Oct 1, 2025

Uh oh!

pytorch-bot bot commented Oct 1, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 1, 2025

Uh oh!

github-actions bot commented Dec 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Aoti CUDA runtime libraries #14731

Are you sure you want to change the base?

Aoti CUDA runtime libraries #14731

Uh oh!

Conversation

larryliu0820 commented Oct 1, 2025

Uh oh!

pytorch-bot bot commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14731

Uh oh!

github-actions bot commented Oct 1, 2025

This PR needs a release notes: label

Uh oh!

github-actions bot commented Dec 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Oct 1, 2025 •

edited

Loading

This PR needs a `release notes:` label