Skip to content

Build a Service that Pulls Tweets at a Regular Interval #78

@pahdo

Description

@pahdo

Key topics

  • Software Engineering
  • Infrastructure
  • Amazon Web Services

Objective
We are currently pulling tweets on an ad-hoc basis. Since we need as much training data as possible to build good models, and because Twitter only gives us 7 days of data at any given time, we want to build a service that regularly pulls and saves data from Twitter.

First steps
In order of complexity, we will want to spin up a server on glitch.me, Heroku, or AWS EC2. The simplest implementation of this tweet puller would be a job that hits the Twitter API and dumps the response to a file. We can schedule this job using cron.

Useful tools
Crontab Man Pages - man7.org
Twitter API Docs

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions