Skip to content
Discussion options

You must be logged in to vote

So that example uses Example.from_dict, but first note there's no requirement you do that at all, the only requirement is that you supply an Example. If you already have code that creates Examples from your training data you can re-use that (and I think we have code to deal with that format in our docs already).

If you want to use from_dict, it links to docs describing the required input format. NER uses the entities attribute.

Also note you can stream your corpus just by setting max_epochs = -1, and a custom corpus reader isn't required to do that. There are other things to look out for - you probably want to shuffle your data, and you may need to supply labels up front - but streaming i…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by polm
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / ner Feature: Named Entity Recognizer scaling Scaling, serving and parallelizing spaCy
2 participants