Making the model more robust #9096
JimDunlop
started this conversation in
Help: Best practices
Replies: 1 comment 5 replies
-
Hi! The pretrained models that we provide will only get you so far, as you've found. If your domain is significantly different, you should definitely consider retraining the models or even training them from scratch. The relevant documentation is here: https://spacy.io/usage/training |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I am using spacy for sentiment (or urgency) analysis of customer complains. I work with wordlist and get somewhat good results.
Now i want to explore more sophisticated possibiltys, like topic modeling, similaritys between texts, aspects of sentiments, etc.
The POS-Tagger and DEP-Parser seem to be rather inaccuarte with my dataset, which might be because of the direct speech of the complains, the long and complex sentences of some of those complaints, or because of our specialised domain with lots of unknown words for spacy.
So what is the best practise to get a more robust modell in my case? Is it possible without POS and DEP annotaded training-set?
Can i get better results with a custom vocab?
Theese are my first tries in enhancing the modell/pipeline, so any help is greatly appreciated.
Beta Was this translation helpful? Give feedback.
All reactions