Skip to content

[NEW] custom cuda kernels with agents #3277

Merged
burtenshaw merged 9 commits intomainfrom
custom-cuda-agent
Feb 13, 2026
Merged

[NEW] custom cuda kernels with agents #3277
burtenshaw merged 9 commits intomainfrom
custom-cuda-agent

Conversation

@burtenshaw
Copy link
Collaborator

This blog post requires this PR on kernels: huggingface/kernels#278

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a very bold thing to do for a thumbnail 😆

Copy link
Contributor

@ariG23498 ariG23498 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

- user: evalstate
---

<!-- TODO: PLEASE ADD YOURSELF TO THE AUTHORS LIST -->
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's remove this line?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I like to leave this in until merging so that colleagues feel comfortable taking their (deserved) authorship.


# Custom Kernels for All from Codex and Claude

![oprah custom cuda kernels](assets/custom-cuda-kernels/meme.png)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we want this in the blog post as well?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, I feel like it sets the tone and it's kinda lost as a pure thumbnail.

Copy link
Member

@pcuenca pcuenca left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Super cool.

Important (in case you missed): entry still not present in _blog.yml.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

custom-cuda-kernels.md ?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was seo maxing. lol


## Conclusion

We built an agent skill that teaches coding agents how to write production CUDA kernels. Then we pointed Claude and Codex at two real targets: a **diffusers** pipeline and a **transformers** model. The agents produced working kernels for both, with correct PyTorch bindings and benchmarks, end to end. We benchmarked the kernels and found that the optimized kernels can provide a speedup in both isolated and end-to-end performance.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🔥

Copy link
Member

@pcuenca pcuenca left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reminder: _blog.yml

@burtenshaw burtenshaw merged commit 1432c01 into main Feb 13, 2026
1 check passed
@burtenshaw burtenshaw deleted the custom-cuda-agent branch February 13, 2026 13:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants