vllm-project
diff --git a/‎CONTRIBUTING.md‎
Lines changed: 2 additions & 2 deletions b/‎CONTRIBUTING.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎README.md‎
Lines changed: 4 additions & 32 deletions b/‎README.md‎
Lines changed: 4 additions & 32 deletions
diff --git a/‎deploy/kubernetes/README.md‎
Lines changed: 1 addition & 1 deletion b/‎deploy/kubernetes/README.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎src/semantic-router/cmd/main.go‎
Lines changed: 1 addition & 1 deletion b/‎src/semantic-router/cmd/main.go‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎website/.docusaurus/client-manifest.json‎
Lines changed: 18 additions & 18 deletions b/‎website/.docusaurus/client-manifest.json‎
Lines changed: 18 additions & 18 deletions
diff --git a/‎website/.docusaurus/docusaurus-plugin-content-docs/default/p/docs-175.json‎
Lines changed: 1 addition & 1 deletion b/‎website/.docusaurus/docusaurus-plugin-content-docs/default/p/docs-175.json‎
Lines changed: 1 addition & 1 deletion
@@ -1,6 +1,6 @@
-# Contributing to LLM Semantic Router
+# Contributing to vLLM Semantic Router
 
-Thank you for your interest in contributing to the LLM Semantic Router project! This guide will help you get started with development and contributing to the project.
+Thank you for your interest in contributing to the vLLM Semantic Router project! This guide will help you get started with development and contributing to the project.
 
 ## Table of Contents
 
 
@@ -1,6 +1,6 @@
 <div align="center">
 
-<img src="website/static/img/repo.png" alt="LLM Semantic Router"/>
+<img src="website/static/img/repo.png" alt="vLLM Semantic Router"/>
 
 [![Documentation](https://img.shields.io/badge/docs-read%20the%20docs-blue)](https://llm-semantic-router.readthedocs.io/en/latest/)
 [![Hugging Face](https://img.shields.io/badge/🤗%20Hugging%20Face-Community-yellow)](https://huggingface.co/LLM-Semantic-Router)
@@ -15,37 +15,9 @@
 
 ## Overview
 
-```mermaid
-graph TB
-    Client[Client Request] --> Router[vLLM Semantic Router]
-    
-    subgraph "Intent Understanding"
-        direction LR
-        PII[PII Detector] 
-        Jailbreak[Jailbreak Guard]
-        Category[Category Classifier]
-        Cache[Semantic Cache]
-    end
-    
-    Router --> PII
-    Router --> Jailbreak  
-    Router --> Category
-    Router --> Cache
-    
-    PII --> Decision{Security Check}
-    Jailbreak --> Decision
-    Decision -->|Block| Block[Block Request]
-    Decision -->|Pass| Category
-    Category --> Models[Route to Specialized Model]
-    Cache -->|Hit| FastResponse[Return Cached Response]
-    
-    Models --> Math[Math Model]
-    Models --> Creative[Creative Model] 
-    Models --> Code[Code Model]
-    Models --> General[General Model]
-```
-
-### Auto-Selection of Models
+![](./website/static/img/architecture.png)
+
+### Auto-Reasoning and Auto-Selection of Models
 
 An **Mixture-of-Models** (MoM) router that intelligently directs OpenAI API requests to the most suitable models from a defined pool based on **Semantic Understanding** of the request's intent (Complexity, Task, Tools).
 
 
@@ -17,7 +17,7 @@ The deployment consists of:
 
 ## Ports
 
-- **50051**: gRPC API (LLM Semantic Router ExtProc)
+- **50051**: gRPC API (vLLM Semantic Router ExtProc)
 - **9190**: Prometheus metrics
 
 ## Deployment
 
@@ -44,7 +44,7 @@ func main() {
 		log.Fatalf("Failed to create ExtProc server: %v", err)
 	}
 
-	log.Printf("Starting LLM Semantic Router ExtProc with config: %s", *configPath)
+	log.Printf("Starting vLLM Semantic Router ExtProc with config: %s", *configPath)
 
 	// Start Classification API server if enabled
 	if *enableAPI {
 
@@ -366,9 +366,9 @@
     "849": {
       "js": [
         {
-          "file": "assets/js/0058b4c6.33f169dd.js",
-          "hash": "842afc77d0aa620b",
-          "publicPath": "/assets/js/0058b4c6.33f169dd.js"
+          "file": "assets/js/0058b4c6.5774ef6d.js",
+          "hash": "72d4499e535070c8",
+          "publicPath": "/assets/js/0058b4c6.5774ef6d.js"
         }
       ]
     },
@@ -429,9 +429,9 @@
     "1869": {
       "css": [
         {
-          "file": "assets/css/styles.f55e26d4.css",
-          "hash": "3f6d30ecd8d89ed0",
-          "publicPath": "/assets/css/styles.f55e26d4.css"
+          "file": "assets/css/styles.267b8a8e.css",
+          "hash": "8a94587058cfc753",
+          "publicPath": "/assets/css/styles.267b8a8e.css"
         }
       ]
     },
@@ -492,9 +492,9 @@
     "2634": {
       "js": [
         {
-          "file": "assets/js/c4f5d8e4.f45b1ce6.js",
-          "hash": "80e68f3177a7ce28",
-          "publicPath": "/assets/js/c4f5d8e4.f45b1ce6.js"
+          "file": "assets/js/c4f5d8e4.b7348ab3.js",
+          "hash": "cb6219784beae8ad",
+          "publicPath": "/assets/js/c4f5d8e4.b7348ab3.js"
         }
       ]
     },
@@ -555,9 +555,9 @@
     "3976": {
       "js": [
         {
-          "file": "assets/js/0e384e19.f8f3d3f3.js",
-          "hash": "8c33224767770a06",
-          "publicPath": "/assets/js/0e384e19.f8f3d3f3.js"
+          "file": "assets/js/0e384e19.07a9307d.js",
+          "hash": "f883753ab784d216",
+          "publicPath": "/assets/js/0e384e19.07a9307d.js"
         }
       ]
     },
@@ -663,9 +663,9 @@
     "5354": {
       "js": [
         {
-          "file": "assets/js/runtime~main.71ea62a3.js",
-          "hash": "e2dce0d6e0f4c1f2",
-          "publicPath": "/assets/js/runtime~main.71ea62a3.js"
+          "file": "assets/js/runtime~main.9ae70a89.js",
+          "hash": "fd9cd34fb19d0d1d",
+          "publicPath": "/assets/js/runtime~main.9ae70a89.js"
         }
       ]
     },
@@ -753,9 +753,9 @@
     "7082": {
       "js": [
         {
-          "file": "assets/js/4bf05604.88ac84d0.js",
-          "hash": "4ec1afea60e64ac0",
-          "publicPath": "/assets/js/4bf05604.88ac84d0.js"
+          "file": "assets/js/4bf05604.e4033055.js",
+          "hash": "e142232da15bc602",
+          "publicPath": "/assets/js/4bf05604.e4033055.js"
         }
       ]
     },
 
@@ -1 +1 @@
-{"version":{"pluginId":"default","version":"current","label":"Next","banner":null,"badge":false,"noIndex":false,"className":"docs-version-current","isLast":true,"docsSidebars":{"tutorialSidebar":[{"type":"link","label":"LLM Semantic Router","href":"/docs/intro","docId":"intro","unlisted":false},{"type":"category","label":"Overview","items":[{"type":"link","label":"Semantic Router Overview","href":"/docs/overview/semantic-router-overview","docId":"overview/semantic-router-overview","unlisted":false},{"type":"link","label":"Why Mixture of Models?","href":"/docs/overview/mixture-of-models","docId":"overview/mixture-of-models","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"Architecture","items":[{"type":"link","label":"System Architecture","href":"/docs/architecture/system-architecture","docId":"architecture/system-architecture","unlisted":false},{"type":"link","label":"Envoy ExtProc Integration","href":"/docs/architecture/envoy-extproc","docId":"architecture/envoy-extproc","unlisted":false},{"type":"link","label":"Router Implementation Details","href":"/docs/architecture/router-implementation","docId":"architecture/router-implementation","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"Model Training","items":[{"type":"link","label":"Model Training Overview","href":"/docs/training/training-overview","docId":"training/training-overview","unlisted":false},{"type":"link","label":"Classification Models","href":"/docs/training/classification-models","docId":"training/classification-models","unlisted":false},{"type":"link","label":"Datasets and Purposes","href":"/docs/training/datasets","docId":"training/datasets","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"Getting Started","items":[{"type":"link","label":"Installation Guide","href":"/docs/getting-started/installation","docId":"getting-started/installation","unlisted":false},{"type":"link","label":"Quick Start Guide","href":"/docs/getting-started/quick-start","docId":"getting-started/quick-start","unlisted":false},{"type":"link","label":"Configuration Guide","href":"/docs/getting-started/configuration","docId":"getting-started/configuration","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"API Reference","items":[{"type":"link","label":"Router API Reference","href":"/docs/api/router","docId":"api/router","unlisted":false},{"type":"link","label":"Classification API Reference","href":"/docs/api/classification","docId":"api/classification","unlisted":false}],"collapsed":true,"collapsible":true}]},"docs":{"api/classification":{"id":"api/classification","title":"Classification API Reference","description":"The Classification API provides direct access to the Semantic Router's classification models for intent detection, PII identification, and security analysis. This API is useful for testing, debugging, and standalone classification tasks.","sidebar":"tutorialSidebar"},"api/router":{"id":"api/router","title":"Router API Reference","description":"The Semantic Router provides a gRPC-based API that integrates seamlessly with Envoy's External Processing (ExtProc) protocol. This document covers the API endpoints, request/response formats, and integration patterns.","sidebar":"tutorialSidebar"},"architecture/envoy-extproc":{"id":"architecture/envoy-extproc","title":"Envoy ExtProc Integration","description":"The Semantic Router leverages Envoy's External Processing (ExtProc) filter to implement intelligent routing decisions. This integration provides a clean separation between traffic management (Envoy) and business logic (Semantic Router), enabling sophisticated routing capabilities while maintaining high performance.","sidebar":"tutorialSidebar"},"architecture/router-implementation":{"id":"architecture/router-implementation","title":"Router Implementation Details","description":"This document provides detailed insights into the core routing algorithms, classification logic, and implementation specifics of the Semantic Router.","sidebar":"tutorialSidebar"},"architecture/system-architecture":{"id":"architecture/system-architecture","title":"System Architecture","description":"The Semantic Router implements a sophisticated Mixture-of-Models (MoM) architecture using Envoy Proxy as the foundation, with an External Processor (ExtProc) service that provides intelligent routing capabilities. This design ensures high performance, scalability, and maintainability for production LLM deployments.","sidebar":"tutorialSidebar"},"getting-started/configuration":{"id":"getting-started/configuration","title":"Configuration Guide","description":"This guide covers all configuration options available in the Semantic Router, from basic setup to advanced customization for production deployments.","sidebar":"tutorialSidebar"},"getting-started/installation":{"id":"getting-started/installation","title":"Installation Guide","description":"This guide will help you set up and install the Semantic Router on your system. The installation process includes setting up dependencies, downloading models, and configuring the routing system.","sidebar":"tutorialSidebar"},"getting-started/quick-start":{"id":"getting-started/quick-start","title":"Quick Start Guide","description":"This guide will get you up and running with the Semantic Router in just a few minutes. Follow these steps to see the router in action with intelligent model selection.","sidebar":"tutorialSidebar"},"intro":{"id":"intro","title":"LLM Semantic Router","description":"License","sidebar":"tutorialSidebar"},"overview/mixture-of-models":{"id":"overview/mixture-of-models","title":"Why Mixture of Models?","description":"The Mixture of Models (MoM) approach represents a fundamental shift from traditional single-model deployment to a more intelligent, cost-effective, and performance-optimized architecture. This section explores the compelling reasons why MoM has become the preferred approach for production LLM deployments.","sidebar":"tutorialSidebar"},"overview/semantic-router-overview":{"id":"overview/semantic-router-overview","title":"Semantic Router Overview","description":"Semantic routers represent a paradigm shift in how we deploy and utilize large language models at scale. By intelligently routing queries to the most appropriate model based on semantic understanding, these systems optimize the critical balance between performance, cost, and quality.","sidebar":"tutorialSidebar"},"training/classification-models":{"id":"training/classification-models","title":"Classification Models","description":"This document provides in-depth technical details about each classification model used in the Semantic Router, including architecture specifics, training procedures, and performance characteristics.","sidebar":"tutorialSidebar"},"training/datasets":{"id":"training/datasets","title":"Datasets and Purposes","description":"This document provides comprehensive details about the datasets used to train each classification model in the Semantic Router, including data sources, preprocessing methods, and the specific purposes each dataset serves in the routing pipeline.","sidebar":"tutorialSidebar"},"training/training-overview":{"id":"training/training-overview","title":"Model Training Overview","description":"The Semantic Router relies on multiple specialized classification models to make intelligent routing decisions. This section provides a comprehensive overview of the training process, datasets used, and the purpose of each model in the routing pipeline.","sidebar":"tutorialSidebar"}}}}
+{"version":{"pluginId":"default","version":"current","label":"Next","banner":null,"badge":false,"noIndex":false,"className":"docs-version-current","isLast":true,"docsSidebars":{"tutorialSidebar":[{"type":"link","label":"vLLM Semantic Router","href":"/docs/intro","docId":"intro","unlisted":false},{"type":"category","label":"Overview","items":[{"type":"link","label":"Semantic Router Overview","href":"/docs/overview/semantic-router-overview","docId":"overview/semantic-router-overview","unlisted":false},{"type":"link","label":"Why Mixture of Models?","href":"/docs/overview/mixture-of-models","docId":"overview/mixture-of-models","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"Architecture","items":[{"type":"link","label":"System Architecture","href":"/docs/architecture/system-architecture","docId":"architecture/system-architecture","unlisted":false},{"type":"link","label":"Envoy ExtProc Integration","href":"/docs/architecture/envoy-extproc","docId":"architecture/envoy-extproc","unlisted":false},{"type":"link","label":"Router Implementation Details","href":"/docs/architecture/router-implementation","docId":"architecture/router-implementation","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"Model Training","items":[{"type":"link","label":"Model Training Overview","href":"/docs/training/training-overview","docId":"training/training-overview","unlisted":false},{"type":"link","label":"Classification Models","href":"/docs/training/classification-models","docId":"training/classification-models","unlisted":false},{"type":"link","label":"Datasets and Purposes","href":"/docs/training/datasets","docId":"training/datasets","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"Getting Started","items":[{"type":"link","label":"Installation Guide","href":"/docs/getting-started/installation","docId":"getting-started/installation","unlisted":false},{"type":"link","label":"Quick Start Guide","href":"/docs/getting-started/quick-start","docId":"getting-started/quick-start","unlisted":false},{"type":"link","label":"Configuration Guide","href":"/docs/getting-started/configuration","docId":"getting-started/configuration","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"API Reference","items":[{"type":"link","label":"Router API Reference","href":"/docs/api/router","docId":"api/router","unlisted":false},{"type":"link","label":"Classification API Reference","href":"/docs/api/classification","docId":"api/classification","unlisted":false}],"collapsed":true,"collapsible":true}]},"docs":{"api/classification":{"id":"api/classification","title":"Classification API Reference","description":"The Classification API provides direct access to the Semantic Router's classification models for intent detection, PII identification, and security analysis. This API is useful for testing, debugging, and standalone classification tasks.","sidebar":"tutorialSidebar"},"api/router":{"id":"api/router","title":"Router API Reference","description":"The Semantic Router provides a gRPC-based API that integrates seamlessly with Envoy's External Processing (ExtProc) protocol. This document covers the API endpoints, request/response formats, and integration patterns.","sidebar":"tutorialSidebar"},"architecture/envoy-extproc":{"id":"architecture/envoy-extproc","title":"Envoy ExtProc Integration","description":"The Semantic Router leverages Envoy's External Processing (ExtProc) filter to implement intelligent routing decisions. This integration provides a clean separation between traffic management (Envoy) and business logic (Semantic Router), enabling sophisticated routing capabilities while maintaining high performance.","sidebar":"tutorialSidebar"},"architecture/router-implementation":{"id":"architecture/router-implementation","title":"Router Implementation Details","description":"This document provides detailed insights into the core routing algorithms, classification logic, and implementation specifics of the Semantic Router.","sidebar":"tutorialSidebar"},"architecture/system-architecture":{"id":"architecture/system-architecture","title":"System Architecture","description":"The Semantic Router implements a sophisticated Mixture-of-Models (MoM) architecture using Envoy Proxy as the foundation, with an External Processor (ExtProc) service that provides intelligent routing capabilities. This design ensures high performance, scalability, and maintainability for production LLM deployments.","sidebar":"tutorialSidebar"},"getting-started/configuration":{"id":"getting-started/configuration","title":"Configuration Guide","description":"This guide covers all configuration options available in the Semantic Router, from basic setup to advanced customization for production deployments.","sidebar":"tutorialSidebar"},"getting-started/installation":{"id":"getting-started/installation","title":"Installation Guide","description":"This guide will help you set up and install the Semantic Router on your system. The installation process includes setting up dependencies, downloading models, and configuring the routing system.","sidebar":"tutorialSidebar"},"getting-started/quick-start":{"id":"getting-started/quick-start","title":"Quick Start Guide","description":"This guide will get you up and running with the Semantic Router in just a few minutes. Follow these steps to see the router in action with intelligent model selection.","sidebar":"tutorialSidebar"},"intro":{"id":"intro","title":"vLLM Semantic Router","description":"License","sidebar":"tutorialSidebar"},"overview/mixture-of-models":{"id":"overview/mixture-of-models","title":"Why Mixture of Models?","description":"The Mixture of Models (MoM) approach represents a fundamental shift from traditional single-model deployment to a more intelligent, cost-effective, and performance-optimized architecture. This section explores the compelling reasons why MoM has become the preferred approach for production LLM deployments.","sidebar":"tutorialSidebar"},"overview/semantic-router-overview":{"id":"overview/semantic-router-overview","title":"Semantic Router Overview","description":"Semantic routers represent a paradigm shift in how we deploy and utilize large language models at scale. By intelligently routing queries to the most appropriate model based on semantic understanding, these systems optimize the critical balance between performance, cost, and quality.","sidebar":"tutorialSidebar"},"training/classification-models":{"id":"training/classification-models","title":"Classification Models","description":"This document provides in-depth technical details about each classification model used in the Semantic Router, including architecture specifics, training procedures, and performance characteristics.","sidebar":"tutorialSidebar"},"training/datasets":{"id":"training/datasets","title":"Datasets and Purposes","description":"This document provides comprehensive details about the datasets used to train each classification model in the Semantic Router, including data sources, preprocessing methods, and the specific purposes each dataset serves in the routing pipeline.","sidebar":"tutorialSidebar"},"training/training-overview":{"id":"training/training-overview","title":"Model Training Overview","description":"The Semantic Router relies on multiple specialized classification models to make intelligent routing decisions. This section provides a comprehensive overview of the training process, datasets used, and the purpose of each model in the routing pipeline.","sidebar":"tutorialSidebar"}}}}
Original file line number	Diff line number	Diff line change
`@@ -44,7 +44,7 @@ func main() {`
`44`	`44`	`log.Fatalf("Failed to create ExtProc server: %v", err)`
`45`	`45`	`}`
`46`	`46`
`47`		`- log.Printf("Starting LLM Semantic Router ExtProc with config: %s", *configPath)`
	`47`	`+ log.Printf("Starting vLLM Semantic Router ExtProc with config: %s", *configPath)`
`48`	`48`
`49`	`49`	`// Start Classification API server if enabled`
`50`	`50`	`if *enableAPI {`
Original file line number	Diff line number	Diff line change
`@@ -366,9 +366,9 @@`
`366`	`366`	`"849": {`
`367`	`367`	`"js": [`
`368`	`368`	`{`
`369`		`- "file": "assets/js/0058b4c6.33f169dd.js",`
`370`		`- "hash": "842afc77d0aa620b",`
`371`		`- "publicPath": "/assets/js/0058b4c6.33f169dd.js"`
	`369`	`+ "file": "assets/js/0058b4c6.5774ef6d.js",`
	`370`	`+ "hash": "72d4499e535070c8",`
	`371`	`+ "publicPath": "/assets/js/0058b4c6.5774ef6d.js"`
`372`	`372`	`}`
`373`	`373`	`]`
`374`	`374`	`},`
`@@ -429,9 +429,9 @@`
`429`	`429`	`"1869": {`
`430`	`430`	`"css": [`
`431`	`431`	`{`
`432`		`- "file": "assets/css/styles.f55e26d4.css",`
`433`		`- "hash": "3f6d30ecd8d89ed0",`
`434`		`- "publicPath": "/assets/css/styles.f55e26d4.css"`
	`432`	`+ "file": "assets/css/styles.267b8a8e.css",`
	`433`	`+ "hash": "8a94587058cfc753",`
	`434`	`+ "publicPath": "/assets/css/styles.267b8a8e.css"`
`435`	`435`	`}`
`436`	`436`	`]`
`437`	`437`	`},`
`@@ -492,9 +492,9 @@`
`492`	`492`	`"2634": {`
`493`	`493`	`"js": [`
`494`	`494`	`{`
`495`		`- "file": "assets/js/c4f5d8e4.f45b1ce6.js",`
`496`		`- "hash": "80e68f3177a7ce28",`
`497`		`- "publicPath": "/assets/js/c4f5d8e4.f45b1ce6.js"`
	`495`	`+ "file": "assets/js/c4f5d8e4.b7348ab3.js",`
	`496`	`+ "hash": "cb6219784beae8ad",`
	`497`	`+ "publicPath": "/assets/js/c4f5d8e4.b7348ab3.js"`
`498`	`498`	`}`
`499`	`499`	`]`
`500`	`500`	`},`
`@@ -555,9 +555,9 @@`
`555`	`555`	`"3976": {`
`556`	`556`	`"js": [`
`557`	`557`	`{`
`558`		`- "file": "assets/js/0e384e19.f8f3d3f3.js",`
`559`		`- "hash": "8c33224767770a06",`
`560`		`- "publicPath": "/assets/js/0e384e19.f8f3d3f3.js"`
	`558`	`+ "file": "assets/js/0e384e19.07a9307d.js",`
	`559`	`+ "hash": "f883753ab784d216",`
	`560`	`+ "publicPath": "/assets/js/0e384e19.07a9307d.js"`
`561`	`561`	`}`
`562`	`562`	`]`
`563`	`563`	`},`
`@@ -663,9 +663,9 @@`
`663`	`663`	`"5354": {`
`664`	`664`	`"js": [`
`665`	`665`	`{`
`666`		`- "file": "assets/js/runtime~main.71ea62a3.js",`
`667`		`- "hash": "e2dce0d6e0f4c1f2",`
`668`		`- "publicPath": "/assets/js/runtime~main.71ea62a3.js"`
	`666`	`+ "file": "assets/js/runtime~main.9ae70a89.js",`
	`667`	`+ "hash": "fd9cd34fb19d0d1d",`
	`668`	`+ "publicPath": "/assets/js/runtime~main.9ae70a89.js"`
`669`	`669`	`}`
`670`	`670`	`]`
`671`	`671`	`},`
`@@ -753,9 +753,9 @@`
`753`	`753`	`"7082": {`
`754`	`754`	`"js": [`
`755`	`755`	`{`
`756`		`- "file": "assets/js/4bf05604.88ac84d0.js",`
`757`		`- "hash": "4ec1afea60e64ac0",`
`758`		`- "publicPath": "/assets/js/4bf05604.88ac84d0.js"`
	`756`	`+ "file": "assets/js/4bf05604.e4033055.js",`
	`757`	`+ "hash": "e142232da15bc602",`
	`758`	`+ "publicPath": "/assets/js/4bf05604.e4033055.js"`
`759`	`759`	`}`
`760`	`760`	`]`
`761`	`761`	`},`
Original file line number	Diff line number	Diff line change
`@@ -1 +1 @@`
`1`		-{"version":{"pluginId":"default","version":"current","label":"Next","banner":null,"badge":false,"noIndex":false,"className":"docs-version-current","isLast":true,"docsSidebars":{"tutorialSidebar":[{"type":"link","label":"LLM Semantic Router","href":"/docs/intro","docId":"intro","unlisted":false},{"type":"category","label":"Overview","items":[{"type":"link","label":"Semantic Router Overview","href":"/docs/overview/semantic-router-overview","docId":"overview/semantic-router-overview","unlisted":false},{"type":"link","label":"Why Mixture of Models?","href":"/docs/overview/mixture-of-models","docId":"overview/mixture-of-models","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"Architecture","items":[{"type":"link","label":"System Architecture","href":"/docs/architecture/system-architecture","docId":"architecture/system-architecture","unlisted":false},{"type":"link","label":"Envoy ExtProc Integration","href":"/docs/architecture/envoy-extproc","docId":"architecture/envoy-extproc","unlisted":false},{"type":"link","label":"Router Implementation Details","href":"/docs/architecture/router-implementation","docId":"architecture/router-implementation","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"Model Training","items":[{"type":"link","label":"Model Training Overview","href":"/docs/training/training-overview","docId":"training/training-overview","unlisted":false},{"type":"link","label":"Classification Models","href":"/docs/training/classification-models","docId":"training/classification-models","unlisted":false},{"type":"link","label":"Datasets and Purposes","href":"/docs/training/datasets","docId":"training/datasets","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"Getting Started","items":[{"type":"link","label":"Installation Guide","href":"/docs/getting-started/installation","docId":"getting-started/installation","unlisted":false},{"type":"link","label":"Quick Start Guide","href":"/docs/getting-started/quick-start","docId":"getting-started/quick-start","unlisted":false},{"type":"link","label":"Configuration Guide","href":"/docs/getting-started/configuration","docId":"getting-started/configuration","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"API Reference","items":[{"type":"link","label":"Router API Reference","href":"/docs/api/router","docId":"api/router","unlisted":false},{"type":"link","label":"Classification API Reference","href":"/docs/api/classification","docId":"api/classification","unlisted":false}],"collapsed":true,"collapsible":true}]},"docs":{"api/classification":{"id":"api/classification","title":"Classification API Reference","description":"The Classification API provides direct access to the Semantic Router's classification models for intent detection, PII identification, and security analysis. This API is useful for testing, debugging, and standalone classification tasks.","sidebar":"tutorialSidebar"},"api/router":{"id":"api/router","title":"Router API Reference","description":"The Semantic Router provides a gRPC-based API that integrates seamlessly with Envoy's External Processing (ExtProc) protocol. This document covers the API endpoints, request/response formats, and integration patterns.","sidebar":"tutorialSidebar"},"architecture/envoy-extproc":{"id":"architecture/envoy-extproc","title":"Envoy ExtProc Integration","description":"The Semantic Router leverages Envoy's External Processing (ExtProc) filter to implement intelligent routing decisions. This integration provides a clean separation between traffic management (Envoy) and business logic (Semantic Router), enabling sophisticated routing capabilities while maintaining high performance.","sidebar":"tutorialSidebar"},"architecture/router-implementation":{"id":"architecture/router-implementation","title":"Router Implementation Details","description":"This document provides detailed insights into the core routing algorithms, classification logic, and implementation specifics of the Semantic Router.","sidebar":"tutorialSidebar"},"architecture/system-architecture":{"id":"architecture/system-architecture","title":"System Architecture","description":"The Semantic Router implements a sophisticated Mixture-of-Models (MoM) architecture using Envoy Proxy as the foundation, with an External Processor (ExtProc) service that provides intelligent routing capabilities. This design ensures high performance, scalability, and maintainability for production LLM deployments.","sidebar":"tutorialSidebar"},"getting-started/configuration":{"id":"getting-started/configuration","title":"Configuration Guide","description":"This guide covers all configuration options available in the Semantic Router, from basic setup to advanced customization for production deployments.","sidebar":"tutorialSidebar"},"getting-started/installation":{"id":"getting-started/installation","title":"Installation Guide","description":"This guide will help you set up and install the Semantic Router on your system. The installation process includes setting up dependencies, downloading models, and configuring the routing system.","sidebar":"tutorialSidebar"},"getting-started/quick-start":{"id":"getting-started/quick-start","title":"Quick Start Guide","description":"This guide will get you up and running with the Semantic Router in just a few minutes. Follow these steps to see the router in action with intelligent model selection.","sidebar":"tutorialSidebar"},"intro":{"id":"intro","title":"LLM Semantic Router","description":"License","sidebar":"tutorialSidebar"},"overview/mixture-of-models":{"id":"overview/mixture-of-models","title":"Why Mixture of Models?","description":"The Mixture of Models (MoM) approach represents a fundamental shift from traditional single-model deployment to a more intelligent, cost-effective, and performance-optimized architecture. This section explores the compelling reasons why MoM has become the preferred approach for production LLM deployments.","sidebar":"tutorialSidebar"},"overview/semantic-router-overview":{"id":"overview/semantic-router-overview","title":"Semantic Router Overview","description":"Semantic routers represent a paradigm shift in how we deploy and utilize large language models at scale. By intelligently routing queries to the most appropriate model based on semantic understanding, these systems optimize the critical balance between performance, cost, and quality.","sidebar":"tutorialSidebar"},"training/classification-models":{"id":"training/classification-models","title":"Classification Models","description":"This document provides in-depth technical details about each classification model used in the Semantic Router, including architecture specifics, training procedures, and performance characteristics.","sidebar":"tutorialSidebar"},"training/datasets":{"id":"training/datasets","title":"Datasets and Purposes","description":"This document provides comprehensive details about the datasets used to train each classification model in the Semantic Router, including data sources, preprocessing methods, and the specific purposes each dataset serves in the routing pipeline.","sidebar":"tutorialSidebar"},"training/training-overview":{"id":"training/training-overview","title":"Model Training Overview","description":"The Semantic Router relies on multiple specialized classification models to make intelligent routing decisions. This section provides a comprehensive overview of the training process, datasets used, and the purpose of each model in the routing pipeline.","sidebar":"tutorialSidebar"}}}}
	`1`	+{"version":{"pluginId":"default","version":"current","label":"Next","banner":null,"badge":false,"noIndex":false,"className":"docs-version-current","isLast":true,"docsSidebars":{"tutorialSidebar":[{"type":"link","label":"vLLM Semantic Router","href":"/docs/intro","docId":"intro","unlisted":false},{"type":"category","label":"Overview","items":[{"type":"link","label":"Semantic Router Overview","href":"/docs/overview/semantic-router-overview","docId":"overview/semantic-router-overview","unlisted":false},{"type":"link","label":"Why Mixture of Models?","href":"/docs/overview/mixture-of-models","docId":"overview/mixture-of-models","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"Architecture","items":[{"type":"link","label":"System Architecture","href":"/docs/architecture/system-architecture","docId":"architecture/system-architecture","unlisted":false},{"type":"link","label":"Envoy ExtProc Integration","href":"/docs/architecture/envoy-extproc","docId":"architecture/envoy-extproc","unlisted":false},{"type":"link","label":"Router Implementation Details","href":"/docs/architecture/router-implementation","docId":"architecture/router-implementation","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"Model Training","items":[{"type":"link","label":"Model Training Overview","href":"/docs/training/training-overview","docId":"training/training-overview","unlisted":false},{"type":"link","label":"Classification Models","href":"/docs/training/classification-models","docId":"training/classification-models","unlisted":false},{"type":"link","label":"Datasets and Purposes","href":"/docs/training/datasets","docId":"training/datasets","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"Getting Started","items":[{"type":"link","label":"Installation Guide","href":"/docs/getting-started/installation","docId":"getting-started/installation","unlisted":false},{"type":"link","label":"Quick Start Guide","href":"/docs/getting-started/quick-start","docId":"getting-started/quick-start","unlisted":false},{"type":"link","label":"Configuration Guide","href":"/docs/getting-started/configuration","docId":"getting-started/configuration","unlisted":false}],"collapsed":true,"collapsible":true},{"type":"category","label":"API Reference","items":[{"type":"link","label":"Router API Reference","href":"/docs/api/router","docId":"api/router","unlisted":false},{"type":"link","label":"Classification API Reference","href":"/docs/api/classification","docId":"api/classification","unlisted":false}],"collapsed":true,"collapsible":true}]},"docs":{"api/classification":{"id":"api/classification","title":"Classification API Reference","description":"The Classification API provides direct access to the Semantic Router's classification models for intent detection, PII identification, and security analysis. This API is useful for testing, debugging, and standalone classification tasks.","sidebar":"tutorialSidebar"},"api/router":{"id":"api/router","title":"Router API Reference","description":"The Semantic Router provides a gRPC-based API that integrates seamlessly with Envoy's External Processing (ExtProc) protocol. This document covers the API endpoints, request/response formats, and integration patterns.","sidebar":"tutorialSidebar"},"architecture/envoy-extproc":{"id":"architecture/envoy-extproc","title":"Envoy ExtProc Integration","description":"The Semantic Router leverages Envoy's External Processing (ExtProc) filter to implement intelligent routing decisions. This integration provides a clean separation between traffic management (Envoy) and business logic (Semantic Router), enabling sophisticated routing capabilities while maintaining high performance.","sidebar":"tutorialSidebar"},"architecture/router-implementation":{"id":"architecture/router-implementation","title":"Router Implementation Details","description":"This document provides detailed insights into the core routing algorithms, classification logic, and implementation specifics of the Semantic Router.","sidebar":"tutorialSidebar"},"architecture/system-architecture":{"id":"architecture/system-architecture","title":"System Architecture","description":"The Semantic Router implements a sophisticated Mixture-of-Models (MoM) architecture using Envoy Proxy as the foundation, with an External Processor (ExtProc) service that provides intelligent routing capabilities. This design ensures high performance, scalability, and maintainability for production LLM deployments.","sidebar":"tutorialSidebar"},"getting-started/configuration":{"id":"getting-started/configuration","title":"Configuration Guide","description":"This guide covers all configuration options available in the Semantic Router, from basic setup to advanced customization for production deployments.","sidebar":"tutorialSidebar"},"getting-started/installation":{"id":"getting-started/installation","title":"Installation Guide","description":"This guide will help you set up and install the Semantic Router on your system. The installation process includes setting up dependencies, downloading models, and configuring the routing system.","sidebar":"tutorialSidebar"},"getting-started/quick-start":{"id":"getting-started/quick-start","title":"Quick Start Guide","description":"This guide will get you up and running with the Semantic Router in just a few minutes. Follow these steps to see the router in action with intelligent model selection.","sidebar":"tutorialSidebar"},"intro":{"id":"intro","title":"vLLM Semantic Router","description":"License","sidebar":"tutorialSidebar"},"overview/mixture-of-models":{"id":"overview/mixture-of-models","title":"Why Mixture of Models?","description":"The Mixture of Models (MoM) approach represents a fundamental shift from traditional single-model deployment to a more intelligent, cost-effective, and performance-optimized architecture. This section explores the compelling reasons why MoM has become the preferred approach for production LLM deployments.","sidebar":"tutorialSidebar"},"overview/semantic-router-overview":{"id":"overview/semantic-router-overview","title":"Semantic Router Overview","description":"Semantic routers represent a paradigm shift in how we deploy and utilize large language models at scale. By intelligently routing queries to the most appropriate model based on semantic understanding, these systems optimize the critical balance between performance, cost, and quality.","sidebar":"tutorialSidebar"},"training/classification-models":{"id":"training/classification-models","title":"Classification Models","description":"This document provides in-depth technical details about each classification model used in the Semantic Router, including architecture specifics, training procedures, and performance characteristics.","sidebar":"tutorialSidebar"},"training/datasets":{"id":"training/datasets","title":"Datasets and Purposes","description":"This document provides comprehensive details about the datasets used to train each classification model in the Semantic Router, including data sources, preprocessing methods, and the specific purposes each dataset serves in the routing pipeline.","sidebar":"tutorialSidebar"},"training/training-overview":{"id":"training/training-overview","title":"Model Training Overview","description":"The Semantic Router relies on multiple specialized classification models to make intelligent routing decisions. This section provides a comprehensive overview of the training process, datasets used, and the purpose of each model in the routing pipeline.","sidebar":"tutorialSidebar"}}}}