Skip to content
Discussion options

You must be logged in to vote

For data where you have set sentence boundaries with token.is_sent_start, you want to train a senter component instead of a parser component.

(The parser uses dependency trees to identify sentence boundaries, so in data without dependency annotation, each word looks like its own separate tree and own separate sentence.)

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by adrianeboyd
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
training Training and updating models feat / parser Feature: Dependency Parser feat / senter Feature: Sentence Recognizer
2 participants
Converted from issue

This discussion was converted from issue #12571 on April 25, 2023 11:04.