merge_noun_chunks does not discard whitespace #11207
Answered
by
polm
eyaler
asked this question in
Help: Coding & Implementations
-
code to reproduce:
|
Beta Was this translation helpful? Give feedback.
Answered by
polm
Jul 26, 2022
Replies: 1 comment 2 replies
-
|
Beta Was this translation helpful? Give feedback.
2 replies
Answer selected by
polm
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
noun_chunks
is not implemented using a Matcher; you can see the implementation here, but it uses dependency relations, so it gets words that are not nouns. For example, if you hadhe saw the red book
,the red book
would be a noun chunk. So this is not a bug or anything, though dependency relations for spaces are always kind of weird and arbitrary.