Skip to content
Discussion options

You must be logged in to vote

What version of spacy are you using (spacy info)? What components are in your pipeline (nlp.pipe_names)? Is there a component that sets sentence boundaries (sentencizer, senter, parser, custom)?

The most recent versions of spacy v2 and all versions of spacy v3 should raise an error instead of returning None, but I think that releases that are over a year old would return None if the doc doesn't have any sentence boundaries.

It's possible the rule-based sentencizer would work well enough for this example text because there are periods at the ends of the lines, but you may need a custom component for this kind of text, which doesn't contain the kinds of full (newspaper-ish) sentences that the

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by polm
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
usage General spaCy usage feat / pipeline Feature: Processing pipeline and components feat / sentencizer Feature: Sentencizer (rule-based sentence segmenter)
2 participants
Converted from issue

This discussion was converted from issue #8926 on August 11, 2021 06:15.