Skip to content
Discussion options

You must be logged in to vote

@wangxinyu0922 I think your question may be related to issue #1460. Perhaps you already saw that thread, but I wanted to make sure you didn't miss it. Like @polm said, creating a custom tokenizer is an option and I wanted to point out that you can extract morphological information from the Morphologizer, so you can get some information from there. See for instance the Label Scheme documentation for the Spanish models: https://spacy.io/models/es

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by svlandeg
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
lang / es Spanish language data and models feat / tokenizer Feature: Tokenizer more-info-needed This issue needs more information feat / morphology Feature: Morphology and MorphAnalysis
3 participants
Converted from issue

This discussion was converted from issue #8177 on May 24, 2021 03:55.