data_pipeline

Docker-pipeline collecting tweets and storing them in MongoDB. Next, the sentiment of tweets is analyzed and the result stored in PostgreSQL. Finally, the best sentiment tweets are sent to Slack automatically.

This repo is a result of my 6th weekly project at SPICED Academy Berlin.

The Goal of the Project:

Build a Dockerized Data Pipeline that analyzes the sentiment of tweets.

The pipeline should collect tweets and store them in a database. Next, the sentiment of tweets is analyzed and the result stored in a second database. Finally, the best or worst sentiment for a given time interval is put on Slack automatically.

Challenges of the project:

Get Docker running
Build a skeleton pipeline
Collect Tweets
Store Tweets in a Mongo DB
Create an ETL task
Run sentiment analysis
Build a Slack bot

What is what in this repo?

besides the readme, license and image/ in the root directory we have docker-compose file, which makes all the pipeline running;
to start, use "docker-compose build" and/or "docker-compose up -d" commands
docker-containers for MongoDB and Postgres were built with pre-made images
docker containers for tweepy, etl and slackbot were built from custom made docker-files (placed on folders accordingly)
the tweeter API, etl and Slackbot programs are written in python (placed on folders accordingly)

Passwords and credentials/.env:

to run this programm you would need .env files with:
- postgres password
- twitter developer api credentials
- slackbot url (not copied to this repo)

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Slackbot		Slackbot
etl		etl
images		images
tweet_collector		tweet_collector
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

data_pipeline

The Goal of the Project:

Challenges of the project:

What is what in this repo?

Passwords and credentials/.env:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

data_pipeline

The Goal of the Project:

Challenges of the project:

What is what in this repo?

Passwords and credentials/.env:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages