World-class machine learning system for geomagnetic storm prediction using 29 years of space weather data.
SOLARIS-X is an advanced ensemble machine learning system designed for real-time geomagnetic storm prediction. Built with 29 years of space weather data (1996-2025) and 79 physics-informed features, it achieves 98.02% AUC with superior storm detection capabilities.
- NASA OMNI Database: Solar wind parameters, IMF data.
- NOAA SWPC: Geomagnetic indices (Kp, AE, Dst).
- Temporal Coverage: 1996-2025 (29 years).
- Data Quality: 98.3% completeness after cleaning.
- Kp-index - Geomagnetic activity indicator.
- IMF Magnitude - Interplanetary magnetic field strength.
- Plasma Beta - Solar wind plasma parameter.
- AE Index - Auroral electrojet activity.
- Solar Cycle Phase - Long-term solar variability.
Clone repository:
git clone https://github.com/yourusername/SOLARIS-X.git
cd SOLARIS-X
Create virtual environment:
python -m venv venv
Activate virtual environment:
#Windows:
venv\Scripts\activate
#Linux/Mac:
source venv/bin/activate
Install dependencies:
pip install -r requirements.txt
Run complete training pipeline:
python scripts/training/complete_pipeline.py
Individual model training:
python scripts/training/test_lightgbm.py
python scripts/training/test_neural_network.py
import joblib
import pandas as pd
import numpy as np
Example space weather features
features = {
'Kp_index': 3.5,
'IMF_Magnitude_lag_12h': 7.2,
'Plasma_Beta': 0.8,
'AE_index': 150.0,
'Solar_Cycle_Phase': 0.6
# ... additional features required
}
print("SOLARIS-X Space Weather Prediction System")
print("Geomagnetic Storm Prediction: Ready for deployment")
π Data Pipeline
data/processed/features/- Engineered feature datasets (excluded).data/raw/omni/- Original OMNI database files (excluded).
π€ Training System
scripts/training/models/- Individual model trainers (4 advanced models).scripts/training/utils/- Training utilities and base classes.scripts/training/complete_pipeline.py- Main orchestrator.
πΎ Model Management
models/checkpoints/- Training checkpoints and metadata.models/trained/- Production models (excluded).
π Results & Analysis
results/plots/- Performance visualizations (10 charts).
π Configuration
requirements.txt- Python dependencies..gitignore- Repository optimization.README.md- Documentation.
- 25+ Python modules with professional architecture.
- 4 advanced ML models including meta-ensemble system.
- Complete MLOps pipeline with automated evaluation.
- Production-ready deployment configuration.
- Modular Design: Separated model trainers and utilities.
- Production Ready: Complete MLOps pipeline structure.
- Optimized Storage: Large files excluded via .gitignore.
- Comprehensive Evaluation: Visualization and metrics tracking.
- Professional Organization: Clear separation of concerns.
Note: Files marked as "(excluded)" are not tracked in git due to size constraints but are generated during training.
- Temporal Validation: Proper chronological train/validation/test splits.
- Physics-Informed: Features based on magnetospheric coupling theory.
- Imbalanced Learning: Specialized techniques for rare storm events.
- Ensemble Methods: Meta-learning for optimal prediction combination.
- No Data Leakage: Strict temporal separation of datasets.
- Multiple Metrics: AUC, F1, Precision, Recall for comprehensive assessment.
- Cross-Validation: Robust performance estimation.
- Uncertainty Quantification: Prediction confidence intervals.
- Space Weather Centers: NOAA, ESA integration.
- Satellite Operations: ISS, commercial satellite protection.
- Power Grid Safety: Geomagnetic storm early warning.
- Aviation: High-altitude flight safety alerts.
- Space Physics: Magnetospheric dynamics research.
- Climate Science: Space weather impact studies.
- Machine Learning: Rare event prediction techniques.
- Data Science: Time series ensemble methods.
- Python: 3.8+
- RAM: 8GB minimum, 16GB recommended.
- CPU: Multi-core processor (12+ cores optimal).
- Storage: 5GB for full dataset and models.
lightgbm==4.6.0
tensorflow-cpu==2.20.0
scikit-learn==1.3.0
pandas==2.0.3
numpy==1.24.3
matplotlib==3.7.1
seaborn==0.12.2
Sumanth - Space Weather & Machine Learning Research
- π GitHub: @Sumanth1410-git
- π§ Contact: sumanthp141005@gmail.com
This project is licensed under the MIT License.
- NASA OMNI Database - Space weather data provision.
- NOAA Space Weather Prediction Center - Operational data access.
- Space Weather Community - Research inspiration and validation.
- Open Source Contributors - Tool and library development.
Built with β€οΈ for the space weather research community
β Star this repository if it helps your research!