Skip to content

Commit 2d97782

Browse files
author
TinySemVer
committed
Release: v0.2.0 [skip ci]
### Minor - Add: Latency Hiding & Port Interleaving (086f8d7) - Add: AMX kernels (0cb024d) - Add: Inline Assembly kernels (89095a6) - Add: BLAS & Eigen TOPs benchmarks (28ca39b) - Add: AVX2 & low-precision AVX-512 TOPS (0a48108) - Add: `i8`, `f16`, and `bf16` kernels (3f54200) - Add: Arm NEON FMAs (d0e521e) - Add: `vfmadd231ps` kernels (7ca3161) - Add: Assembly micro-kernels (2e71e76) ### Patch - Docs: Zen4 matmul-benchmarks (2476310) - Docs: H100 Tensor Cores vs Intel (fa86663) - Fix: `Illegal instruction` for AMX (a7243dd) - Fix: Duplicate `.global` symbols (c732234) - Docs: Recommended Eigen macros (7be2d58) - Fix: Missing `tops_u8_neon` (d97bbfc) - Fix: Missing `tops_f64_neon` (4afa7e3) - Improve: Shorter TOPS names (be0c94b)
1 parent 714dad9 commit 2d97782

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

CMakeLists.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ cmake_minimum_required(VERSION 3.16 FATAL_ERROR)
55
# Project Setup
66
# ------------------------------------------------------------------------------
77
project(less_slow
8-
VERSION 0.1.1
8+
VERSION 0.2.0
99
LANGUAGES C CXX ASM
1010
DESCRIPTION "Learning how to write Less Slow code, from numerical micro-kernels and SIMD to coroutines, ranges, and polymorphic state machines"
1111
HOMEPAGE_URL "https://github.com/ashvardanian/less_slow.cpp")

VERSION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
0.1.1
1+
0.2.0

0 commit comments

Comments
 (0)