Commit ba95752
Address benwtrent's technical accuracy feedback on bbq_disk
Fixes three technical inaccuracies identified by @benwtrent:
1. Line 153: Changed "simpler datasets" to "smaller datasets" for bbq_flat
- More accurate description of when to use bbq_flat
2. Line 176: Improved bbq_disk intro to clarify RAM constraint focus
- Changed "store vectors on disk" to "minimize memory usage"
- All indexes already store on disk, so this was misleading
3. Line 202: Complete rewrite of bbq_disk explanation with technical accuracy
- Removed dataset-dependent performance claims (100 MB RAM, 15ms latency)
- Added clear explanation of what makes bbq_disk different:
* Keeps vectors in compressed form on disk
* Only loads/decompresses portions on-demand during queries
* Avoids filesystem cache dependency (unlike standard HNSW)
* Dramatically reduces RAM requirements
* Enables vector search on larger datasets with minimal memory
* Trade-off: slower queries vs in-memory approaches
This explanation clarifies the key distinction: standard HNSW relies on
filesystem cache to load vectors into memory for fast search, while
DiskBBQ avoids this by streaming compressed vectors from disk.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude <[email protected]>1 parent 3174a6a commit ba95752
File tree
1 file changed
+3
-3
lines changed- solutions/search/semantic-search
1 file changed
+3
-3
lines changedLines changed: 3 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
150 | 150 | | |
151 | 151 | | |
152 | 152 | | |
153 | | - | |
| 153 | + | |
154 | 154 | | |
155 | 155 | | |
156 | 156 | | |
| |||
173 | 173 | | |
174 | 174 | | |
175 | 175 | | |
176 | | - | |
| 176 | + | |
177 | 177 | | |
178 | 178 | | |
179 | 179 | | |
| |||
199 | 199 | | |
200 | 200 | | |
201 | 201 | | |
202 | | - | |
| 202 | + | |
203 | 203 | | |
204 | 204 | | |
205 | 205 | | |
| |||
0 commit comments