Skip to content
Discussion options

You must be logged in to vote

Hey, thanks for your question!

All information generated by the individual components is saved in the doc object. The SpanRuler adds all extracted spans to the doc.spans attribute. You can read more about this in our docs about processing pipelines. See here how to create a DocBin object with a list of docs, you can then save the DocBin as a .spacy file to use it for training/testing.

Looking at your code, there are some minor issues. I would recommend watching videos like these first to get a better understanding of how matching works in spaCy + having some nice examples to refer back to:

https://www.youtube.com/watch?v=BXzFAjtenHM&ab_channel=Explosion
https://www.youtube.com/watch?v=1Un…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by goonhoon
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / doc Feature: Doc, Span and Token objects
2 participants