- Sign-up for a Twitter developer account on this link
- Create a Bearer Token (documentation)
- Fill in the field
BEARER_TOKENin the.envfile - Create a Reddit developed application on this link (documentation)
- Fill in the fields
CLIENT_ID,SECRET_TOKEN,USERNAMEandPASSWORDin the.envfile - Install and run Kafka (documentation)
- Create a MongoDB database in the cloud (free) or install the server (documentation)
- Fill in the fields
CONNECTION_STRINGin the.envfile
docker-compose -f docker-compose.yml up- Clone the repository
git clone https://github.com/baptiste-pasquier/trend-tracker- Install the project
- With
poetry(installation) :
poetry install- With
pip:
pip install -e .- Install pre-commit
pre-commit installWarning Each script must be run in a separate console
- Twitter streaming:
python all_services/ingest_tweets/app.py- Reddit streaming:
python all_services/ingest_reddit/app.py- Data preprocessing:
python all_services/tsf_data/app.py- Data clustering:
python all_services/cluster_data/app.py- Data storage on MongoDB:
python all_services/store_data/app.pyReal-time visualization:
streamlit run streamlit_app.py