Skip to content

MINAMOREED/LSTM-PySpark-Stream-Prediction

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

LSTM PySpark Stream Prediction

This project leverages LSTM (Long Short-Term Memory) networks for time-series sales prediction while utilizing PySpark Streaming for real-time data processing.

Data

The used data is Store Item on kaggle.

Technologies

  • Deep Learning: TensorFlow/Keras
  • Big Data & Streaming: Apache PySpark
  • Data Processing & Analysis: Pandas, NumPy, Scikit-learn
  • Visualization: Matplotlib, Plotly

Steps

  1. Data Preprocessing:
    • Load and clean the dataset using PySpark.
    • Split data into training and testing sets.
  2. Model Development:
    • Build an LSTM model using Keras.
    • Train the model on historical sales data.
  3. Evaluation & Visualization:
    • Evaluate model performance using MSE (Mean Squared Error).
    • Visualize predictions with Matplotlib & Plotly.
  4. Real-Time Streaming & Prediction:
    • Implement PySpark Streaming to process incoming sales data.
    • Dynamically update predictions based on new inputs.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%