gemv
Here are 8 public repositories matching this topic...
PCCX is an open NPU architecture for memory-bound Transformer inference on edge FPGAs, focused on GEMM/GEMV, KV-cache, W4A8 quantization, and custom ISA scheduling.
-
Updated
May 1, 2026 - SystemVerilog
Matilda is a library to repeatedly multiply a constant matrix with a variable vector
-
Updated
Jan 26, 2026 - C++
Measure and visualize why LLM inference is slow: bottleneck analysis, model dissection, KV-cache, GEMM/GEMV, quantization, and memory-bound decoding.
-
Updated
Apr 30, 2026 - HTML
🧮 CereMath is a library of Machine Learning kernels for the Wafer Scale Engine (WSE) from Cerebras. Made as a thesis project
-
Updated
Apr 15, 2026 - Zig
Improve this page
Add a description, image, and links to the gemv topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gemv topic, visit your repo's landing page and select "manage topics."