Skip to content

Create Pipeline to Classify Tweets by Topic #77

@pahdo

Description

@pahdo

Key topics

  • Machine learning
  • Human-in-the-loop
  • Predictive modeling

Objective
We want to develop a pipeline that can tag tweets by topic. In this project, a topic is something that Muni riders are talking about. For example, bus bunching and bike safety are topics. If we can tag tweets by topic in an automated fashion, we will have a data-backed understanding of what riders care about at a given moment.

First steps
We have an initial set of manually labeled tweets located in this Google Sheet. Also, it may be valuable to look at the ClassificationExperiment CodeLab, located here.

Useful tools
Classification - scikit-learn
Text preprocessing spacy

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions