Skip to content

Commit 1161ae6

Browse files
committed
chore: update semantic router docs
1 parent 30c304f commit 1161ae6

File tree

3 files changed

+74
-114
lines changed

3 files changed

+74
-114
lines changed

docs.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -248,4 +248,4 @@
248248
"youtube": "https://www.youtube.com/@jamesbriggs"
249249
}
250250
}
251-
}
251+
}

semantic-router/user-guide/changelog.mdx

Lines changed: 73 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,75 @@
1+
### v0.1.12
2+
3+
The `0.1.12` release brings Pinecone v7 support and AWS Bedrock boto3 client integration, improving compatibility and expanding cloud deployment options.
4+
5+
#### Feature: Pinecone v7 Upgrade
6+
7+
Semantic Router now supports Pinecone Python SDK v7, providing:
8+
- Improved data plane performance and reliability
9+
- Enhanced async support for high-throughput applications
10+
- Simplified API structure for better developer experience
11+
- Better error handling and retry logic
12+
13+
The upgrade is backward compatible - existing code continues to work without changes. For shared index deployments, set `PINECONE_INDEX_NAME` to reuse existing indexes and avoid quota limits. Semantic Router automatically isolates data using namespaces.
14+
15+
#### Feature: Bedrock boto3 Client Support
16+
17+
Added support for custom boto3 clients in the `BedrockEncoder`, enabling:
18+
- Advanced AWS credential management (IAM roles, cross-account access)
19+
- Custom retry policies and timeout configurations
20+
- VPC endpoint support for private deployments
21+
- Better integration with existing AWS infrastructure
22+
23+
This provides greater flexibility for enterprise AWS deployments while maintaining backward compatibility with the existing credential-based approach.
24+
25+
---
26+
27+
### v0.1.11
28+
29+
The `0.1.11` release introduces new encoder options for local and self-hosted deployments, alongside sparse encoding improvements and CI/CD enhancements.
30+
31+
#### Feature: Local Encoder
32+
33+
Added `LocalEncoder` for fully local embedding generation using sentence-transformers:
34+
- No API keys or internet connection required
35+
- Automatic device selection (CUDA, MPS, CPU)
36+
- Support for any sentence-transformers model
37+
- Privacy-first design - all data stays on your machine
38+
39+
Perfect for offline deployments, privacy-sensitive applications, or development environments.
40+
41+
#### Feature: Local Sparse Encoder
42+
43+
Added sparse encoder support using sentence transformers with `LocalSparseEncoder`.
44+
- Same offline advantages as the Local Encoder above.
45+
- Better handling of sentence boundaries
46+
- Improved BM25 and TF-IDF implementations
47+
- Enhanced compatibility with `HybridRouter`
48+
#### Feature: Ollama Encoder
49+
50+
Introduced `OllamaEncoder` for using Ollama-hosted embedding models:
51+
- Works with any Ollama embedding model (nomic-embed-text, mxbai-embed-large, etc.)
52+
- Full control over model versions and hosting
53+
- Low latency with local Ollama instances
54+
- Both sync and async support
55+
56+
Ideal for organizations wanting to self-host embedding models while maintaining API-like convenience.
57+
58+
59+
#### Feature: Dagger CI
60+
61+
Implemented Dagger for CI/CD pipeline, providing:
62+
- Faster, more reliable builds
63+
- Better caching and parallelization
64+
- Consistent local and remote execution
65+
- Improved developer experience
66+
67+
#### Chore: Torch Dependency Optimization
68+
69+
Removed torch from main dependencies to reduce installation size. PyTorch is now only installed when needed via `semantic-router[local]` extras, reducing default installation from ~2GB to <100MB.
70+
71+
---
72+
173
### v0.1.10
274

375
The `0.1.10` release was primarily focused on expanding async support for `QdrantIndex`, `PostgresIndex`, and `HybridRouter`, alongside many synchronization and testing improvements.
@@ -54,4 +126,4 @@ local = [
54126

55127
#### Chore: Broader and More Useful Tests
56128

57-
We have broken our tests apart into strict unit and integration test directories. Now, when incoming PRs are raised we will no longer trigger integration tests that require API keys to successfully run. To ensure we're still covering all components of the library we have broadened our testing suite to extensively test `LocalIndex`, `PineconeIndex` (via Pinecone Local), `PostgresIndex`, and `QdrantIndex` within those unit tests.
129+
We have broken our tests apart into strict unit and integration test directories. Now, when incoming PRs are raised we will no longer trigger integration tests that require API keys to successfully run. To ensure we're still covering all components of the library we have broadened our testing suite to extensively test `LocalIndex`, `PineconeIndex` (via Pinecone Local), `PostgresIndex`, and `QdrantIndex` within those unit tests.

semantic-router/user-guide/guides/pinecone-v7.mdx

Lines changed: 0 additions & 112 deletions
This file was deleted.

0 commit comments

Comments
 (0)