Spacy 3 Sentence Segmentation #7841
Replies: 1 comment 2 replies
-
I think your best option at the moment is trying to train the SentenceRecognizer. Using a custom function to set Training the dependency parser would require a lot of hard-to-build data, so while it would work I imagine getting the data would be a problem. We don't have any examples of fine-tuning the dependency parser partly because our primary training data can't be shared publicly due to licensing issues, partly because the data is hard to build, and partly because the dependency parser rarely needs fine-tuning. There's nothing specific for sentence splitting in displaCy or Prodigy, though in Prodigy you can just mark the first token in a sentence to use as training data. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I want to fine tune the Sentence Segmentation functionality for my use case. Namely I want to handle cases where "XYZ Inc. " is treated as a sentence end but it really is just and abbreviation of Incorporated. What is the recommended way of doing this? Are there any tools in either Displacy or Prodigy that can visualize or annotate this.
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions