How this project works? #2
ajeetraina
announced in
Announcements
Replies: 1 comment
-
The Go Backend is the "Metrics Calculator"
Example of What the Backend Measures:
Why This Architecture Works Well:
Prometheus and Grafana get metrics from the Go backend, which acts as a bridge between the host-based llama.cpp and the containerized monitoring stack. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
The Metrics Flow Architecture
The Go Backend Acts as a "Metrics Bridge"
The key insight is that Prometheus and Grafana don't directly fetch from llama.cpp. Instead:
1. llama.cpp process runs natively on your machine
2. Go Backend (in container) connects to llama.cpp
3. Go Backend observes API interactions and calculates metrics
4. Go Backend exposes calculated metrics at /metrics endpoint
5. Prometheus (container) scrapes from Go Backend (container)
6. Grafana (container) queries Prometheus (container)
Visual Architecture
Beta Was this translation helpful? Give feedback.
All reactions