|
4 | 4 |
|
5 | 5 | ## β
Full Conceptual Pipeline: Speech β Storage β Retrieval |
6 | 6 |
|
7 | | -ποΈ Voice Input |
8 | | - β |
9 | | - βΌ |
10 | | -Whisper ASR (local/cloud) |
11 | | - β Converts speech to text |
12 | | - β |
13 | | - βΌ |
14 | | -LLM Output Processor API (POST /ingest) |
15 | | - β’ Token-based Authentication |
16 | | - β’ Rich metadata: context, priority, intent |
17 | | - β |
18 | | - βΌ |
19 | | -+------------------------------------------------+ |
20 | | -| Qdrant Vector Store + Markdown Files (.md) | |
21 | | -| + PostgreSQL (optional metadata DB) | |
22 | | -| + Version History Tracking per record | |
23 | | -+------------------------------------------------+ |
24 | | - β |
25 | | - βΌ |
26 | | -Prometheus / Grafana β Metrics & Monitoring |
27 | | -Sentry β Error Tracking |
28 | | -Structured JSON Logs β Correlation IDs |
29 | | - β |
30 | | - βΌ |
31 | | -Retrieval Endpoints: |
32 | | - β’ /search |
33 | | - β’ /ranked-search |
34 | | - β’ /records |
35 | | - β’ /ws/generate (streamed LLM responses) |
36 | | - |
37 | | ---- |
| 7 | +graph TD |
| 8 | + A[ποΈ Voice Input] --> B[Whisper ASR<br>(local/cloud)] |
| 9 | + B -->|Converts speech to text| C[LLM Output Processor API<br>(POST /ingest)] |
| 10 | + C -->|Token-based Authentication<br>Rich metadata: context, priority, intent| D[Qdrant Vector Store + Markdown Files (.md)<br>+ PostgreSQL (optional metadata DB)<br>+ Version History Tracking per record] |
| 11 | + D --> E[Prometheus / Grafana<br>β Metrics & Monitoring] |
| 12 | + D --> F[Sentry<br>β Error Tracking] |
| 13 | + D --> G[Structured JSON Logs<br>β Correlation IDs] |
| 14 | + D --> H[Retrieval Endpoints] |
| 15 | + H --> I[/search] |
| 16 | + H --> J[/ranked-search] |
| 17 | + H --> K[/records] |
| 18 | + H --> L[/ws/generate<br>(streamed LLM responses)] |
38 | 19 |
|
39 | 20 | ## β
Real-Time Voice Assistant UX |
40 | 21 |
|
|
0 commit comments