Skip to content
Discussion options

You must be logged in to vote

The parser and ner models can run into this issue if there's not much training data, or in this case I think it's also due to updating on individual examples rather than larger batches.

Punctuation shouldn't matter (I see that this error message is a bit out-of-date and still refers to some spacy v2 features), but whitespace does matter. It is hard-coded in the ner component that entity spans can't start or end with whitespace.

I strongly strongly recommend using spacy train instead of a minimal hand-written training loop. A hand-written loop is useful pedagogically to understand how training works, but you can easily run into problems once you move away from toy examples to real data.

So…

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@ckald
Comment options

@adrianeboyd
Comment options

Answer selected by ckald
@ckald
Comment options

@ckald
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / cli Feature: Command-line interface
2 participants