A production-ready Time Series application that forecasts the average price of conventional avocados in the US using SARIMA.
- Fully automated time-series pipeline
- Modular project architecture
- Statistical validation using ADF Test
- Automatic visualization generation
- Forecast evaluation with multiple metrics
- Production-style repository structure
This project analyzes historical avocado prices (2015–2018) to predict future market trends using SARIMA time-series modeling.
The project is structured as a modular Python application, not a notebook.
- Analyze market trend and seasonality
- Test stationarity using the ADF test
- Train a SARIMA forecasting model
- Generate automated visual reports
```text
avocado-sales/
│
├── data/
│ └── avocado.csv # Raw dataset (Kaggle)
│
├── src/ # Application Source Code
│ ├── main.py # 🚀 Entry point
│ ├── loader.py # Data cleaning & preprocessing
│ ├── model.py # Statistical modeling (SARIMA)
│ ├── plots.py # Visualization engine
│ └── pics/ # Generated charts (saved automatically)
│
├
├── requirements.txt
└── README.md
pip install -r requirements.txtcd src
python main.pyThe script will:
- Load data
- Train SARIMA model
- Print metrics
- Generate charts automatically
| Metric | Score | Interpretation |
|---|---|---|
| RMSE | 0.22 | ±22 cents error |
| MAE | 0.18 | Avg absolute error |
| MAPE | 13.58% | ~86.5% accuracy |
- Pandas
- Statsmodels
- Matplotlib / Seaborn
- Scikit-Learn
- Prophet & LSTM comparison
- Streamlit deployment
- Auto hyperparameter tuning
Konul Jafarova