English | 简体中文
DeepStressModel is a powerful AI model performance testing and monitoring tool specifically designed for evaluating and analyzing the performance of large language models. Through an intuitive graphical interface and comprehensive data analysis capabilities, it helps developers and researchers better understand and optimize their AI models.
- Concurrent Testing: Support for customizable concurrent stress testing
- Multi-Dataset Support: Test multiple datasets simultaneously with weight configuration
- Real-time Monitoring: Visual display of key metrics including response time and generation speed
- Automated Testing: Support for batch testing and scheduled tasks (in development)
- Output Modes: Support for streaming output and direct output testing modes
- Multi-GPU Monitoring: Support for parallel monitoring and load balancing analysis of multiple GPU cards
- Real-time Tracking: Monitor local and remote GPU usage in real-time
- Remote Connection: Support custom SSH port (default 22) for flexible server configuration
- Key Metrics: Track memory usage, GPU utilization, temperature, power consumption and more
- Historical Records: Save monitoring data for trend analysis and load prediction
- Standardized Testing Process: Normalized testing based on preset test sets and fixed test environments
- Automatic Framework Detection: Automatically identify the framework type of the running model (such as Ollama, llama.cpp, vLLM, etc.)
- Multi-dimensional Evaluation: Comprehensive assessment of model performance across throughput, latency, response time, and more
- Leaderboard Support: Support both online and offline modes for submitting test results to the leaderboard
- Secure Encryption: Result encryption functionality to protect sensitive test data
- Automatic Scoring: Automatic calculation of comprehensive scores based on multi-dimensional performance metrics
- Rich Charts: Multi-dimensional data visualization
- Performance Metrics: Including average response time, TPS, generation speed, etc.
- Data Export: Support for test data export and report generation
- Intuitive Operation: Clear tab-based design
- Real-time Feedback: Live display of test progress and results
- Flexible Configuration: Support for various customizable test parameters
-
GUI Module
- Built on PyQt5
- Responsive interface design
- Multi-tab management
- Real-time data flow visualization
-
Testing Engine
- Asynchronous concurrent processing
- API call management
- Data collection and statistics
- Support for streaming and direct output modes
- Intelligent load balancing
-
Monitoring System
- Multi-GPU resource monitoring
- System performance tracking
- Remote monitoring support
- Load balancing analysis
- Performance warning mechanism
-
Benchmarking System
- Standardized testing protocols
- Automatic framework recognition
- Result encryption and verification
- Local result storage and upload
- Multi-mode (online/offline) support
-
Data Management
- SQLite data storage
- Configuration management
- Test record persistence
- Encrypted data processing
DeepStressModel provides a complete model performance leaderboard system to help users understand the performance of different models in various hardware environments. Access at: https://tops.ginease.cn:4433
- Global Ranking: View model performance rankings on a global scale
- Multi-dimensional Sorting: Sort by throughput, latency, memory efficiency and other dimensions
- Hardware Filtering: Filter leaderboard data based on hardware configurations
- Result Verification: Anti-cheating system ensures all submitted results are authentic and reliable
- Personal Records: Track testing history on personal devices
- Online/Offline Mode: Support both real-time online submission and offline batch submission
- Run Standard Tests: Use DeepStressModel's built-in standard testing process
- Submit Results: Choose to encrypt and upload results to the leaderboard server
- View Rankings: Check the latest rankings and detailed data analysis through the leaderboard website
- Encrypted result transmission
- Hardware fingerprint verification
- Anti-cheating system monitoring
- User anonymity options
-
Feature Enhancement
- Add more data visualization options
- Support more types of AI models
- Enhance remote monitoring capabilities
-
Performance Optimization
- Improve large-scale testing performance
- Optimize memory usage
- Improve data processing efficiency
-
Leaderboard Expansion
- Build comprehensive scoring system
- Add model efficiency analysis
- Support more hardware platforms
- Add community interaction features
- Optimize leaderboard UI and user experience
-
Cloud Integration
- Support cloud deployment
- Distributed testing support
- Multi-user collaboration features
-
Intelligent Analysis
- AI-assisted analysis
- Automatic optimization suggestions
- Intelligent report generation
-
Ecosystem Expansion
- Open API interface
- Third-party plugin support
- Cross-platform application support
We welcome community contributions! If you would like to participate in project development, please:
- Fork this repository
- Create your feature branch
- Submit your changes
- Create a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details
- Project Homepage: GitHub
- Issue Reporting: Issues
- Email Contact: your.email@example.com
DeepStressModel - Making AI model testing simpler and more efficient!