Iโm an undergraduate CS student at Johns Hopkins University with a strong interest in ML Systems, GPU programming, and LLM inference optimization.
I focus on understanding how ML models run efficiently on modern hardware โ from attention kernels and KV cache management to compiler-level and system-level optimizations.
Most of my learning is driven by hands-on implementation, self-study, and technical writing.
๐ Blog (self-study & technical write-ups): https://minseoc03.github.io
๐ GitHub: Youโre already here ๐
- ML Systems & Inference Infrastructure
- GPU Programming (CUDA, Triton)
- Attention Kernels & KV Cache Optimization
- Compiler & IR-level Optimization (LLVM / MLIR)
- Multi-GPU & Performance Engineering
- Python
- C / C++
- CUDA (learning & experimenting)
- PyTorch
- Triton
- LLVM
- Optuna
- Hydra
- scikit-learn
- NumPy, Pandas, OpenCV, Matplotlib
- VS Code
- Visual Studio
- Git / GitHub
- Linux (Ubuntu / Arch-based)
I value:
- Learning by implementation
- Reading papers โ reproducing ideas โ analyzing performance
- Writing to clarify understanding
- Iterating fast based on feedback
- Email: cfi3288@gmail.com
- LinkedIn: https://www.linkedin.com/in/minseoc03/
- Blog: https://minseoc03.github.io
