How train a textcat with 1000 labels? #8438
info2000
started this conversation in
Help: Best practices
Replies: 1 comment 3 replies
-
There is no difference in data format between 1000 labels and 3 labels. You might have accuracy issues because the decision is probably harder, but you should be able to at least run training and see how it does. Note that the JSON format is just a temporary format and not particularly important, the key thing is creating the docs that are saved to a DocBin. See the conversion scripts with the example projects to see how that works. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I want to categorize text with spacy, based on the textcat demo project, the json dataset should had all the labels on each item, like this:
But how should looks this json (or what is the right way to do) when have 1000 labels?
Thanks
Beta Was this translation helpful? Give feedback.
All reactions