Skip to content
Discussion options

You must be logged in to vote
  1. The GPU setting just is a variable that is filled in in the commands below, so you can check out the steps/commands to see for sure what the options should be. In all our demo projects you'd set it the GPU ID project variable to 0 to enable GPU 0. The GPU settings aren't in the config file at all, since you might want to use the same config for training with/without GPU.
  2. You can use pythainlp with the built-in ThaiTokenizer. spacy doesn't currently include a trainable tokenizer, but some languages have options to use external trainable tokenizers like pkuseg (Chinese) or sudachipy (Japanese). For any other options, you'd need to implement a custom tokenizer and train it outside of spacy.
  3. I…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by kanayer
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
usage General spaCy usage lang / th Thai language data and models gpu Using spaCy on GPU feat / tokenizer Feature: Tokenizer feat / tok2vec Feature: Token-to-vector layer and pretraining feat / transformer Feature: Transformer
2 participants