Skip to content
Discussion options

You must be logged in to vote

Disabled components aren't trained if they're disabled in the config used for training. Training it separately and then disabling it is part of the internal collate script (and part of why we're not using spacy assemble directly, which would 90% of what we need). We want to ship senter with the pipeline but leave the default as the parser because the quality is higher.

I had a feeling I'd answered some of this before: #7624 (comment)

None of them clobber the annotations from previous components and the parser respects existing sentence boundaries.

The parser is by far the slowest, but here it probably makes sense to run evaluations with your own pipelines / data. The senter can be even fa…

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@BramVanroy
Comment options

@adrianeboyd
Comment options

@BramVanroy
Comment options

Answer selected by BramVanroy
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
training Training and updating models feat / parser Feature: Dependency Parser feat / sentencizer Feature: Sentencizer (rule-based sentence segmenter) feat / senter Feature: Sentence Recognizer
2 participants