What do the labels (["I", "S"]) mean in SentenceRecognizer senter? #7858
-
For the SentenceRecognizer, what does "I" stand for? "S" I assume is the start of sentence. I am planning to use Prodigy Nightly to fine tune the SentenceRecognizer. What is reasoning behind having 3 different ways of specifying sent_start?
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
I've written docs for the new
The We've thought a bit about how we could customize further for |
Beta Was this translation helpful? Give feedback.
I've written docs for the new
sent
recipes, but I'm not sure they've made it anywhere public for users yet.S
means sentence start andI
means sentence-internal, like theI
inIOB
.The
senter
model is a word-level tagger just like thetagger
, it just happens to learnToken.is_sent_start
asI
/S
instead ofToken.tag
likeNN
orJJ
. When the annotations are set, it just setsToken.is_sent_start
asFalse
/True
forI
/S
instead of setting a tag. For spacy it's really an internal detail that you don't need to know about (you never see it when you train or run thesenter
), but prodigy is currently using the same annotation interface as for the tagger recipes (pos
andpos_manual
), so it ends up disp…