How do I split two combined words that have no separators? #10972
Answered
by
pmbaumgartner
moonman239
asked this question in
Help: Coding & Implementations
-
I've got a dataset of forum posts. Sometimes users will forget to use the space bar, resulting in spaCy being unable to recognize the words. For example, sometimes they'll type "disneyworld" instead of "disney world." I have a function that works well for n=2 words, but not necessarily for anything greater than that; in fact, I cannot be sure it works at all for n>2. Here's the function:
|
Beta Was this translation helpful? Give feedback.
Answered by
pmbaumgartner
Jun 16, 2022
Replies: 1 comment
-
You could try using symspell, which I cover in a bit more detail in this discussion. |
Beta Was this translation helpful? Give feedback.
0 replies
Answer selected by
polm
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
You could try using symspell, which I cover in a bit more detail in this discussion.