-
Notifications
You must be signed in to change notification settings - Fork 11
Create Pipeline to Classify Tweets by Topic #77
Copy link
Copy link
Open
Description
Key topics
- Machine learning
- Human-in-the-loop
- Predictive modeling
Objective
We want to develop a pipeline that can tag tweets by topic. In this project, a topic is something that Muni riders are talking about. For example, bus bunching and bike safety are topics. If we can tag tweets by topic in an automated fashion, we will have a data-backed understanding of what riders care about at a given moment.
First steps
We have an initial set of manually labeled tweets located in this Google Sheet. Also, it may be valuable to look at the ClassificationExperiment CodeLab, located here.
Useful tools
Classification - scikit-learn
Text preprocessing spacy
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels