[NEW] custom cuda kernels with agents #3277
Merged
burtenshaw merged 9 commits intomainfrom Feb 13, 2026
Merged
Conversation
ariG23498
reviewed
Feb 12, 2026
Contributor
There was a problem hiding this comment.
This is a very bold thing to do for a thumbnail 😆
ariG23498
approved these changes
Feb 12, 2026
| - user: evalstate | ||
| --- | ||
|
|
||
| <!-- TODO: PLEASE ADD YOURSELF TO THE AUTHORS LIST --> |
Collaborator
Author
There was a problem hiding this comment.
I like to leave this in until merging so that colleagues feel comfortable taking their (deserved) authorship.
|
|
||
| # Custom Kernels for All from Codex and Claude | ||
|
|
||
|  |
Contributor
There was a problem hiding this comment.
Do we want this in the blog post as well?
Collaborator
Author
There was a problem hiding this comment.
Yeah, I feel like it sets the tone and it's kinda lost as a pure thumbnail.
pcuenca
reviewed
Feb 12, 2026
Member
pcuenca
left a comment
There was a problem hiding this comment.
Super cool.
Important (in case you missed): entry still not present in _blog.yml.
Collaborator
Author
There was a problem hiding this comment.
I was seo maxing. lol
pcuenca
reviewed
Feb 13, 2026
|
|
||
| ## Conclusion | ||
|
|
||
| We built an agent skill that teaches coding agents how to write production CUDA kernels. Then we pointed Claude and Codex at two real targets: a **diffusers** pipeline and a **transformers** model. The agents produced working kernels for both, with correct PyTorch bindings and benchmarks, end to end. We benchmarked the kernels and found that the optimized kernels can provide a speedup in both isolated and end-to-end performance. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This blog post requires this PR on kernels: huggingface/kernels#278