Skip to content
Discussion options

You must be logged in to vote

Noun chunks require both token.pos and a parse, and POS usually comes from either tagger+attribute_ruler or morphologizer in the provided trained pipelines. (Sorry, the error message looks like it's gotten a bit out-of-date.)

For German, it's: tok2vec/transformer, morphologizer, parser. Both of those listen to the same transformer and you don't want to duplicate the transformer component that many times with replace_listeners (it would be both huge and slow), so instead it would be better to use a custom name for the spancat's transformer component and use that as upstream for the spancat listener instead. Put all the new components after the existing frozen components. So:

pipeline = ["t…

Replies: 1 comment 8 replies

Comment options

You must be logged in to vote
8 replies
@phlobo
Comment options

@phlobo
Comment options

@adrianeboyd
Comment options

@phlobo
Comment options

@adrianeboyd
Comment options

Answer selected by phlobo
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / transformer Feature: Transformer feat / spancat Feature: Span Categorizer
2 participants