Skip to content
Discussion options

You must be logged in to vote

That means there is no fast tokenizer implementation. You have to port python code into java.
You might want to take a look at our SentencePiece extension, and see if you can use it.

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
1 reply
@xudongguan202
Comment options

Comment options

You must be logged in to vote
1 reply
@xudongguan202
Comment options

Answer selected by xudongguan202
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants