Skip to content
Discussion options

You must be logged in to vote

Hey @lio-data,

This is perhaps a naive suggestion, but when you're assigning things to be ._.is_cn in your custom code, can't you also label all numbers as such, so that you don't need the LIKE_NUM part anymore? For English, you can find its definition here.

One other alternative that is maybe a little more involved, but would result in nicer matcher patterns is this: you can influence the definition of the LIKE_NUM attribute directly by going in and writing new language defaults and replacing the lex_attr_getters with your own implementation. For more information on that, see here: https://spacy.io/usage/linguistic-features#language-data

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@lio-data
Comment options

Answer selected by svlandeg
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
usage General spaCy usage feat / matcher Feature: Token, phrase and dependency matcher
2 participants