Skip to content
Discussion options

You must be logged in to vote

Sorry this isn't working well for you. Our pretrained components use data that doesn't contain newlines, so they don't always handle them well. We use whitespace augmentation when training to help with this, but for sentence boundaries in particular, the models have probably learned that only a period followed by a space terminates a sentence.

In this case I would preprocess your text to convert newlines to spaces, or, if your needs are more complicated, train a sentence recognizer.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@dsenkyr
Comment options

Answer selected by dsenkyr
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
lang / en English language data and models feat / senter Feature: Sentence Recognizer
2 participants
Converted from issue

This discussion was converted from issue #11388 on August 30, 2022 04:08.