Skip to content

Latest commit

 

History

History
 
 

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 

README.md

Monitoring Example

This example demonstrates monitoring integration with the Inference Gateway using:

  • Prometheus for metrics collection
  • Grafana for visualization
  • Helm chart for gateway deployment with monitoring enabled

Table of Contents

Architecture

  • Metrics Collection: Prometheus scrapes gateway metrics
  • Visualization: Grafana dashboards display metrics
  • Gateway: Inference Gateway deployed via helm chart with monitoring enabled
  • Local LLM: Ollama provider included for testing

Prerequisites

  • Task
  • kubectl
  • helm
  • ctlptl (for cluster management)

Quick Start

  1. Deploy infrastructure:
task deploy-infrastructure
  1. Deploy Inference Gateway with monitoring:
task deploy-inference-gateway
  1. Access Grafana dashboards:
kubectl -n monitoring port-forward svc/grafana-service 3000:3000

Or use the deployed ingress, add grafana.inference-gateway.local DNS to your /etc/hosts and open: http://grafana.inference-gateway.local/d/inference-gateway/inference-gateway-metrics

Login credentials:

Username: admin Password: admin

  1. Deploy Ollama and simulate requests responses being sent to the gateway:
task deploy-ollama
task simulate-requests

Configuration

Monitoring Setup

  • Edit YAMLs in prometheus/ and grafana/ directories
  • Configure scrape intervals and dashboards as needed

Gateway Monitoring

  • Monitoring settings configured via helm values in Taskfile.yaml
  • ServiceMonitor CRD enables Prometheus scraping

Cleanup

task clean