Skip to content

Commit a372f99

Browse files
committed
docs: add semantic search roadmap
Technical roadmap for vector search, hybrid ranking, and RAG: Architecture: - VectorStore provider pattern (custom + LanceDB baseline) - Lean interface returning IDs, orchestrator hydrates - Three chunking strategies (line, sentence, semantic) - Embedding provider abstraction (Ollama, OpenAI, Gemini, Transformers.js) - LLM provider abstraction for RAG/reranking - Hybrid search with FTS5 + RRF fusion - Reranker preserves retrieval scores - Evaluation with per-category breakdown Phases: - Phase 1: Vector search core (chunking, embeddings, brute-force) - Phase 2: Hybrid search (FTS5, BM25, RRF, HNSW) - Phase 3: RAG & evaluation (reranking, MCP integration, metrics) Includes complete type definitions and CLI reference.
1 parent 2eb612f commit a372f99

File tree

1 file changed

+570
-0
lines changed

1 file changed

+570
-0
lines changed

0 commit comments

Comments
 (0)