Skip to content
Discussion options

You must be logged in to vote

Hi @lolfert! Customized noun chunk merging is out of the box not possible with merge_noun_chunks().

A spaCy-only alternative to using Textacy is a custom pipeline component post-processing noun chunks with a modified implementation of merge_noun_chunks(). If you look up the source for Textacy's noun_chunks(), you'll see it's rather straightforward (same with merge_noun_chunks()). It should be easy enough to adapt those for your purposes and write a solution achieving something similar using only spaCy.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@lolfert
Comment options

Answer selected by lolfert
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
usage General spaCy usage
2 participants