rel_component issue with training my data #9567
-
I hope you are doing well. I watched video SPACY v3: Custom trainable relation extraction component and followed all the steps to train the rel_component spacy with my custom data. Training is working for unshuffled data but when data is shuffled, I faced weird message during training (spacy project run train_cpu) and error is as below and i am also attaching the screenshot of that "could not determine any instance in the doc" Is this any possible error. Can you help me to understand this? train.txt and dev.txt are attached for reference. Any help is appreciated. Thanks |
Beta Was this translation helpful? Give feedback.
Replies: 7 comments 9 replies
-
Please don't include screenshots of text, just copy/paste it as text. The error means what it says, namely that there are no annotations for it to predict. This can happen if there are no annotations, or no NER annotations, on your training docs for some reason. Can you share your config?
How are you changing between shuffling and not shuffling data? Unless you're using a custom data loader to stream data data is shuffled by default. |
Beta Was this translation helpful? Give feedback.
-
Thanks lot polm. Here is the attached files for train and dev files which includes annotations. Could you see the below files I created the .spacy files from the above txt files. Question about this point: Can you share your config? |
Beta Was this translation helpful? Give feedback.
-
Thanks polm. I used the binary_converter.py file for converting .txt file into .spacy file. I have modified the script to make the train, dev and test.spacy using the loop. Just a minor change.
|
Beta Was this translation helpful? Give feedback.
-
Answers to the Couple more questions from me
|
Beta Was this translation helpful? Give feedback.
-
Some issues to be addresses
|
Beta Was this translation helpful? Give feedback.
-
Hi Polm, thanks for the answer. I would like to ask you below question
|
Beta Was this translation helpful? Give feedback.
-
Thanks polm very much and my issue is resolved. |
Beta Was this translation helpful? Give feedback.
Please don't include screenshots of text, just copy/paste it as text.
The error means what it says, namely that there are no annotations for it to predict. This can happen if there are no annotations, or no NER annotations, on your training docs for some reason. Can you share your config?
How are you changing between shuffling and not shuffling data? Unless you're using a custom data loader to stream data data is shuffled by default.