🇺🇸 English |
🇩🇪 Deutsch |
🇷🇺 Русский |
🇫🇷 Français |
🇪🇸 Español |
🇯🇵 日本語 |
🇨🇳 中文 |
🇵🇹 Português |
🇰🇷 한국어 |
🇮🇳 हिंदी
MyGPU: Lightweight GPU Management Utility: a compact
nvidia-smiwrapper with an elegant web dashboard.
- Lightweight: Minimal resource footprint.
- Flexible: Runs as a CLI tool, or a full-featured Web Dashboard.
- Admin-Centric: Includes features like VRAM Enforcement (auto-kill processes exceeding limits) and Watchlists.
- Developer-Friendly: Built-in benchmarking and stress-testing tools (GEMM, Particle Physics) to validate system stability.
-
Real-time Monitoring:
- Detailed GPU metrics (Utilization, VRAM, Power, Temp).
- System metrics (CPU, RAM, etc.).
-
Admin & Enforcement:
- VRAM Caps: Set hard limits on VRAM usage per GPU.
- Auto-Termination: Automatically terminate processes that violate VRAM policies (Admin only).
- Watchlists: Monitor specific PIDs or process names.
-
Benchmarking & Simulation:
- Stress Testing: Configurable GEMM workloads to test thermal throttling and stability.
- Visual Simulation: Interactive 3D particle physics simulation to visualize GPU load.
Contributions are welcome! Main future points to cover would be:
- Multi-GPU Support: Enhanced handling for multi-card setups and NVLink topologies.
- Containerization: Official Docker support for easy deployment in containerized environments.
- Remote Access: SSH tunneling integration and secure remote management.
- Cross-Platform:
- Linux Support (Ubuntu/Debian focus).
- macOS Support (Apple Silicon monitoring).
- Hardware Agnostic:
- AMD ROCm support.
- Intel Arc support.
Multi-Language Documentation: Supporting most popular GitHub languages.
See CONTRIBUTING.md for how to get involved.
- OS: Windows 10/11
- Python: 3.10+
- Hardware: NVIDIA GPU with installed drivers.
- CUDA: Toolkit 12.x (Strictly required for Benchmarking/Simulation features).
- Note: If CUDA 12.x is not detected, GPU-specific benchmarking features will be disabled.
The tool supports modular installation to fit your needs:
Best for headless servers or background monitoring.
- Command-line interface.
- Basic system/GPU metrics.
Best for most users.
- Includes Web Dashboard.
- REST API endpoints.
- Real-time charts.
- But no Simulation or benchmarking.
Best for development and stress testing.
- Includes Simulation.
- PyTorch/CuPy dependencies for benchmarking.
- Download the latest release or clone the repo.
- Run Setup:
.\setup.ps1- Launch:
# Start the web dashboard (Standard/Full)
python health_monitor.py web
# Start the CLI
python health_monitor.py cliSee LICENSE for details.









