Skip to content
Discussion options

You must be logged in to vote

It looks like the contents of your tokenizing column are Tokens, while the contents of your lemmatization column are strings. These are displayed differently because of the way repr works in Python - it renders strings with single quotes, and spaCy tokens just print the text.

You can make them the same by converting the spaCy tokens to strings.

Note this isn't really a spaCy specific issue.

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@k4waiiiii
Comment options

@polm
Comment options

@k4waiiiii
Comment options

Answer selected by k4waiiiii
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
third-party Third-party packages and services
2 participants