Could not locate any .jsonl files in path 'pretest_data.spacy'. How can i make a file for pretraining in spacy 3? #10871
-
Hello, First of all, thank you very much for all your work and the new features supported by v3. I'm having trouble with the pre-training data. This is what explains espaciy for pretraining . {"text": "Can I ask where you work now and what you do, and if you enjoy it?"} There is something I am not understanding well
I have tried to pass the pretest_data to .jsonl and/or .spacy, but it always gives me the same error. Error |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
You don't show the part where you save the file, but if you are saving You should do something like this:
If you want to make a spaCy file see the serialization docs. |
Beta Was this translation helpful? Give feedback.
You don't show the part where you save the file, but if you are saving
pretest_data
as json to a file that won't work because it's not jsonl, it's just a JSON blob. You can tell because it starts with a[
and probably doesn't have newlines.You should do something like this:
If you want to make a spaCy file see the serialization docs.