Norwegian stopword list needs trimming #6364
khellan
started this conversation in
Language Support
Replies: 1 comment
-
I am happy to provide a PR with an improved stopword list. Before doing so, I want some guidelines on how exhaustive the stopword list should be. I have looked at the Danish one since Danish and Norwegian is very similar. There are words included in the Danish list that I would rather not have in a Norwegian stopword list. I have a more restrictive stopwords list that I could provde. Should I submit a PR with my trimmed down stopwords or should I add some borderline terms like in the Danish list? (A third option is of course that I also trim the Danish list :) ) |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
How to reproduce the behaviour
Look at the Norwegian stopword list. Country names and locations are included such as Tyskland (Germany), Frankrike (France) and Oslo.
Your Environment
The environment is not important.
Beta Was this translation helpful? Give feedback.
All reactions