What are the simplest methods for the label noise problem?

If I have enough low quality data from unsupervised methods or rule-based methods.

In detail, I deal with a multi-label classification task. First I crawl web page such as wiki and use regex-based rule to mark the label. The model input is the wiki title and the model output is the rule-matched labels from wiki content. My task is to predict the labels for the wiki title.

Do you think **removing the wrong data predicted by trained model** is a simple but effective method?

@GuokaiLiu  Thank you very much!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

What are the simplest methods for the label noise problem? #3

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

What are the simplest methods for the label noise problem? #3

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions