add_special_case to Tokenizer based on regex #10824
-
Hi y'all, |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
You can define a https://spacy.io/usage/linguistic-features#how-tokenizer-works |
Beta Was this translation helpful? Give feedback.
-
Sweet! In case anyone else wonders, this is what I did:
Works beautifully! |
Beta Was this translation helpful? Give feedback.
You can define a
token_match
pattern that is matched against each full token as described here:https://spacy.io/usage/linguistic-features#how-tokenizer-works