Skip to content

Assignment 1.3 - Rare words #82

@daviddao

Description

@daviddao

In Assignment 1.3 it is written: "This will load the data in a bag-of-words representation where rare words (occurring less than 5 times in the training data) are removed". However, when I sum the word occurrences using the provided training dataset with

scr = srs.SentimentCorpus("books")

I get words, which doesn't appear at all (occurring less than 5 times)

>> scr.train_X.sum(0)
[..., 0.0, ...]

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions