You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Instead of relying solely on domain classification, the Signal-Decision Architecture extracts three complementary types of signals from each user query. Each signal type leverages different AI/ML techniques and serves distinct purposes in the routing decision process.
150
98
151
-
```mermaid
152
-
graph TB
153
-
A[User Query] --> B[Signal Extraction Layer]
154
-
B --> C[Keyword Extractor<br/>Regex Matching]
155
-
B --> D[Embedding Model<br/>Sentence Transformers]
156
-
B --> E[Domain Classifier<br/>MMLU + LoRA]
157
-
C --> F[Keyword Signals<br/>urgency, security, etc.]
158
-
D --> G[Embedding Signals<br/>intent similarity scores]
159
-
E --> H[Domain Signals<br/>computer_science, etc.]
@@ -222,22 +157,7 @@ Domain signals use MMLU-trained classification models to identify the academic o
222
157
223
158
This enables organizations to extend domain classification to their specific verticals while maintaining the base model's general knowledge.
224
159
225
-
```mermaid
226
-
graph TB
227
-
A[User Query:<br/>'Review this medical imaging protocol'] --> B[Domain Classifier]
228
-
B --> C{Base MMLU Model}
229
-
C --> D[Detect: Healthcare Domain]
230
-
D --> E{Load LoRA Adapter}
231
-
E --> F[medical_imaging LoRA]
232
-
E --> G[clinical_trials LoRA]
233
-
E --> H[pharmaceutical_research LoRA]
234
-
F --> I[Fine-grained Classification:<br/>medical_imaging]
235
-
I --> J[Route to Specialized Model:<br/>medical-imaging-expert]
236
-
237
-
style F fill:#e1f5ff
238
-
style I fill:#c3e6cb
239
-
style J fill:#d4edda
240
-
```
160
+

241
161
242
162
**Use Cases**:
243
163
@@ -297,34 +217,7 @@ Each decision consists of:
297
217
298
218
#### Decision Evaluation Flow
299
219
300
-
```mermaid
301
-
graph TB
302
-
A[Extracted Signals] --> B{Evaluate All Decisions}
303
-
B --> C[Decision 1: Priority 100<br/>Rule: urgency AND security AND cs]
304
-
B --> D[Decision 2: Priority 80<br/>Rule: code-review AND cs]
305
-
B --> E[Decision 3: Priority 60<br/>Rule: architecture-design OR cs]
306
-
C --> F{Match?}
307
-
D --> G{Match?}
308
-
E --> H{Match?}
309
-
F -->|Yes| I[Matched: Priority 100]
310
-
F -->|No| J[Not Matched]
311
-
G -->|Yes| K[Matched: Priority 80]
312
-
G -->|No| L[Not Matched]
313
-
H -->|Yes| M[Matched: Priority 60]
314
-
H -->|No| N[Not Matched]
315
-
I --> O{Multiple Matches?}
316
-
K --> O
317
-
M --> O
318
-
J --> P{Any Match?}
319
-
L --> P
320
-
N --> P
321
-
O -->|Yes| Q[Select Highest Priority]
322
-
O -->|No| R[Use Single Match]
323
-
P -->|No| S[Fallback to Default Model]
324
-
Q --> T[Execute Plugin Chain]
325
-
R --> T
326
-
S --> U[Route to Default Model]
327
-
```
220
+

328
221
329
222
When multiple decisions match, the system selects the one with the highest priority. If no decisions match, the system falls back to the default model.
330
223
@@ -344,20 +237,7 @@ Plugins execute in the configured order, with each plugin able to modify the req
0 commit comments