Skip to content
Discussion options

You must be logged in to vote

Hi @Emporea , you might want to define a Tokenizer with special cases. For hyphens, you can pass a regex pattern to the Tokenizer's infix_finditer param as demonstrated in this example.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by svlandeg
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / tokenizer Feature: Tokenizer
2 participants