Skip to content

Commit e8ffa93

Browse files
committed
Update blog
1 parent cfb143e commit e8ffa93

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

content/blog/2025-10-10-1760088945.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,7 @@ tags:
1111
Some notes on machine-learning compilers, gathered while researching tech for Easy Diffusion's next engine (i.e. sdkit v3). For context, see the [design constraints](https://cmdr2.github.io/notes/2025/10/1760085894/) of the new engine.
1212

1313
## tl;dr summary
14+
1415
Caveat: This analysis could change in the future.
1516

1617
The current state is:
@@ -22,6 +23,7 @@ The focus of cross-vendor compilers seems to be either on datacenter hardware, o
2223
The idea of a cross-vendor ML compiler is clearly awesome, and I think this is the way things should go. But we're not there yet for desktops/laptops, in terms of runtime performance.
2324

2425
## What's an ML compiler?
26+
2527
The basic idea of an ML compiler is to consider an ML model's execution graph as a program to compile, and produce an optimized set of GPU-specific instructions. The compiler can optimize the execution graph by doing things like fusing operations together, parallelizing operations when possible, and even mapping groups of operators to GPU-specific instructions. It can use its knowledge of the target GPU for optimizing the memory layout and parallelism of operations. Basically what compilers already do for CPUs today, but for GPUs.
2628

2729
We already have a decent graph format: ONNX. Every model that I intend to support has ONNX exports available (and it's easy to export one, for new models).
@@ -34,6 +36,7 @@ We already have a decent graph format: ONNX. Every model that I intend to suppor
3436
* [https://www.modular.com/blog/democratizing-ai-compute-part-6-what-about-ai-compilers](https://www.modular.com/blog/democratizing-ai-compute-part-6-what-about-ai-compilers)
3537

3638
## ML compiler projects
39+
3740
Cross-vendor ML compilers:
3841
- [XLA](https://openxla.org/xla), 2017 (the first major ML compiler)
3942
- [Apache TVM](https://tvm.apache.org/), 2019

0 commit comments

Comments
 (0)