Skip to content
Discussion options

You must be logged in to vote

Hello,

  1. One training file should work for both components
  2. There's no strict rule, I'd say that it's fine to use the raw text from both the train and dev dataset
  3. Yes, a json/jsonl with raw text should work

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@shrinidhin
Comment options

Answer selected by shrinidhin
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / tok2vec Feature: Token-to-vector layer and pretraining
2 participants