spacy debug data -V question #10352
-
Hello, could you please let me know if it is normal when the number of deprel tags displayed after running Also, if possible, could you please explain why |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
If each training doc contains only one sentence, then the parser does not learn to split sentences. Since a lot of training corpora provide annotation in sentences rather than longer documents, we recommend grouping them into paragraph-sized chunks for training. If you have the details for your training corpus it's probably even better if you can create real paragraphs rather than fake paragraphs in a custom conversion, but the details are usually corpus-specific so |
Beta Was this translation helpful? Give feedback.
debug data
counts the labels for the projectivized, aligned trees, so if there are a lot of misaligned tokens or non-projective trees (-V
also gives counts for this), the counts can look different. If you have a clear case where you think there's a bug in the counts, you can attach it here and we can double-check.If each training doc contains only one sentence, then the parser does not learn to split sentences. Since a lot of training corpora provide annotation in sentences rather than longer documents, we recommend grouping them into paragraph-sized chunks for training. If you have the details for your training corpus it's probably even better if you can create real paragraphs rather th…