Skip to content
Discussion options

You must be logged in to vote

Hi @Vfgandara ,

...After a lot of trial and error I've been able to generate enough data to be able to train a model that I'm really happy with (although some of the data still raises errors when calling doc.set_ents())

I'm curious as to what kind of errors you're getting when calling doc.set_ents(). Just be careful as it sounds like there are overlapping spans in the dataset/annotations. If you can paste a traceback, I'd appreciate that!

...The thing is, is there a problem with working like that?

For set_ents() vs. char_span, there shouldn't be any effect in the accuracy if that's what you meant by damaging. Although if you're just using the tokenizer, I'd suggest starting off from a…

Replies: 1 comment 5 replies

Comment options

You must be logged in to vote
5 replies
@svlandeg
Comment options

@Vfgandara
Comment options

@svlandeg
Comment options

@Vfgandara
Comment options

@polm
Comment options

Answer selected by svlandeg
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / doc Feature: Doc, Span and Token objects
4 participants