Name	Name	Last commit message	Last commit date
parent directory ..
grafana	grafana
ollama	ollama
prometheus	prometheus
Cluster.yaml	Cluster.yaml
README.md	README.md
Taskfile.yaml	Taskfile.yaml

Name

Last commit message

Last commit date

Monitoring Example

This example demonstrates monitoring integration with the Inference Gateway using:

Prometheus for metrics collection
Grafana for visualization
Helm chart for gateway deployment with monitoring enabled

Monitoring Example

Architecture

Metrics Collection: Prometheus scrapes gateway metrics
Visualization: Grafana dashboards display metrics
Gateway: Inference Gateway deployed via helm chart with monitoring enabled
Local LLM: Ollama provider included for testing

Prerequisites

Task
kubectl
helm
ctlptl (for cluster management)

Quick Start

Deploy infrastructure:

task deploy-infrastructure

Deploy Inference Gateway with monitoring:

task deploy-inference-gateway

Access Grafana dashboards:

kubectl -n monitoring port-forward svc/grafana-service 3000:3000

Or use the deployed ingress, add grafana.inference-gateway.local DNS to your /etc/hosts and open: http://grafana.inference-gateway.local/d/inference-gateway/inference-gateway-metrics

Username: admin Password: admin

Deploy Ollama and simulate requests responses being sent to the gateway:

task deploy-ollama

task simulate-requests

Configuration

Monitoring Setup

Edit YAMLs in prometheus/ and grafana/ directories
Configure scrape intervals and dashboards as needed

Gateway Monitoring

Monitoring settings configured via helm values in Taskfile.yaml
ServiceMonitor CRD enables Prometheus scraping

Cleanup

task clean

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Monitoring Example

Table of Contents

Architecture

Prerequisites

Quick Start

Configuration

Monitoring Setup

Gateway Monitoring

Cleanup

FilesExpand file tree

monitoring

Directory actions

More options

Directory actions

More options

Latest commit

History

monitoring

Folders and files

parent directory

README.md

Monitoring Example

Table of Contents

Architecture

Prerequisites

Quick Start

Configuration

Monitoring Setup

Gateway Monitoring

Cleanup