Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
54 commits
Select commit Hold shift + click to select a range
219c967
add scatter gather?
michaelfeil Nov 9, 2025
00a7b76
compiles
michaelfeil Nov 9, 2025
976f563
compiles
michaelfeil Nov 9, 2025
7b4b2e9
compiles and tests are passing
michaelfeil Nov 9, 2025
93af502
add another passing tests
michaelfeil Nov 9, 2025
632e54a
working and fast
michaelfeil Nov 9, 2025
ceb1aea
sleepup from mapfree operation
michaelfeil Nov 9, 2025
49dc90d
single pass indexing
michaelfeil Nov 9, 2025
618bc8d
just fmt
michaelfeil Nov 9, 2025
3105ac0
small removal
michaelfeil Nov 9, 2025
3a63882
fmt only
michaelfeil Nov 9, 2025
2ffbb9b
scatter gather help
michaelfeil Nov 9, 2025
4a49939
clippy fixes
michaelfeil Nov 9, 2025
04d7a07
project is compiling?!
michaelfeil Nov 9, 2025
40c3792
working e2e, prompetheus ingration
michaelfeil Nov 9, 2025
91cb5ad
add percentage in command line
michaelfeil Nov 21, 2025
26a1441
update defualt factor
michaelfeil Nov 21, 2025
1ebdf4e
move radix mlp to separate layer
michaelfeil Nov 21, 2025
07f9b95
radix mlp implementation
michaelfeil Nov 21, 2025
9ccd7e6
add comment
michaelfeil Nov 21, 2025
bf0c9cb
add comment
michaelfeil Nov 21, 2025
09fdf3d
flash qwen2
michaelfeil Nov 21, 2025
f35638d
add radix mlp folder
michaelfeil Nov 21, 2025
dba7ddc
compression ratio
michaelfeil Nov 21, 2025
bc47026
cargo releases
michaelfeil Nov 21, 2025
b87aeb4
add queue.rs
michaelfeil Nov 22, 2025
8bb103f
set padding
michaelfeil Nov 25, 2025
0b25115
clippy fix
michaelfeil Nov 25, 2025
a3e761b
fix >= 1.0 condition
michaelfeil Nov 26, 2025
cc88374
add bench crate
michaelfeil Dec 7, 2025
aba0825
improve benchmark
michaelfeil Dec 7, 2025
cfedb27
better bench
michaelfeil Dec 7, 2025
dc5d666
better benchmark
michaelfeil Dec 7, 2025
57c5566
b 32
michaelfeil Dec 7, 2025
088ba6b
normalized benchmark
michaelfeil Dec 7, 2025
89224ce
better bench
michaelfeil Dec 8, 2025
76b503c
8b benchmark
michaelfeil Dec 8, 2025
96a3be7
add flash-index-select-cu
michaelfeil Dec 8, 2025
3279f42
add index_select revision
michaelfeil Dec 8, 2025
a6aad91
Merge branch 'main' into mf/flash-scatter-gather
michaelfeil Dec 15, 2025
c1db7fc
pull in main
michaelfeil Dec 28, 2025
84c2a88
pull in master
michaelfeil Dec 28, 2025
c311d23
add radixmlp with and without padding
michaelfeil Dec 28, 2025
a357d0f
add radix_mlp dependency
michaelfeil Dec 28, 2025
b617460
rename benchmarks, refactor radix_mlp in separate crate
michaelfeil Dec 28, 2025
3b05b17
rename benchmarks, refactor radix_mlp in separate crate
michaelfeil Dec 28, 2025
7695ded
queue.rs fixes
michaelfeil Dec 28, 2025
b31d032
fix compile issues
michaelfeil Dec 28, 2025
9e007cf
radix mlp as workspace dependency instead
michaelfeil Dec 28, 2025
df7e0c5
move radixmlp
michaelfeil Dec 28, 2025
ebdf390
bump version in radix mlp
michaelfeil Dec 28, 2025
5ccabf1
add radix mlp threshold in info"
michaelfeil Jan 2, 2026
e0241e9
clippy fix
michaelfeil Jan 2, 2026
17c70fa
Merge branch 'main' into mf/flash-scatter-gather
michaelfeil Jan 2, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
162 changes: 161 additions & 1 deletion Cargo.lock

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

2 changes: 2 additions & 0 deletions Cargo.toml
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,7 @@
members = [
"backends",
"backends/candle",
"backends/candle-bench",
"backends/ort",
"backends/core",
"backends/python",
Expand Down Expand Up @@ -54,6 +55,7 @@ candle-index-select-cu = { version = "0.0.1", features = ["cuda-11"], default-f
candle-rotary = { version = "0.0.1" }
candle-flash-attn-v1 = { version = "0.0.1" }
half = { version = "2.3.1", features = ["num-traits"] }
radix_mlp = { version = "0.0.2" }

[patch.crates-io]
cudarc = { git = "https://github.com/Narsil/cudarc" , rev = "8b4f18b4bcd5e4b1a9daf40abc3a2e27f83f06e9"}
Expand Down
29 changes: 29 additions & 0 deletions backends/candle-bench/Cargo.toml
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
[package]
name = "candle-bench"
version = "0.1.0"
edition = "2021"
publish = false

[dependencies]
text-embeddings-backend-candle = { path = "../candle" }
text-embeddings-backend = { path = ".." }
text-embeddings-backend-core = { path = "../core" }
text-embeddings-core = { path = "../../core" }
anyhow = { workspace = true }
cudarc = { workspace = true, optional = true }
hf-hub = { workspace = true , features = ["ureq"] }
radix_mlp = { workspace = true }
serde_json = "*"
tracing = "*"

[dev-dependencies]
criterion = "0.5"

[[bench]]
name = "radix_mlp_benchmark"
harness = false

[features]
metal = ["text-embeddings-backend-candle/metal"]
cuda = ["text-embeddings-backend-candle/cuda", "text-embeddings-backend-candle/flash-attn", "dep:cudarc","cudarc?/dynamic-linking"]
candle = ["text-embeddings-backend/candle"]
3 changes: 3 additions & 0 deletions backends/candle-bench/Readme.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
```
cargo bench --manifest-path backends/candle-bench/Cargo.toml --bench radix_mlp_benchmark --features "candle,cuda"
```
Loading