SciSpaCy vs. SpaCy Entity Linker #7883
Replies: 1 comment 5 replies
-
You're right that the ScispaCy's entity linker mainly looks at string overlap, while spaCy's entity linker relies mostly on the context of a sentence to determine similarity. The two will be quite different in performance, and I can imagine that it depends on the use-case which is better. If the entities you're linking have high variability & ambiguity in their synonyms, spaCy's Machine Learning approach might make more sense. But if the variety isn't as high, ScispaCy's approach could potentially yield higher precision. Some kind of ensemble would probably be even more ideal, but to my knowledge this hasn't been done yet. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I’m using SciSpaCy to link genes to UMLS entities. On the other hand, SpaCy released EntityLinker and I’ve been thinking to improve the baseline SciSpaCy by training a better EL based on MedMentions dataset. It seems that KnowledgeBase defined in SciSpaCy is a standalone class, so all SpaCy functionalities are not available, does it correct?
In this situation, what’s the best way to develop a supervised EL leveraging both SpaCy and SciSpacy? --If I create a KB from UMLS and train an Entity Linker based on SpaCy, what I will miss in comparison to SciSpacy EL?
Beta Was this translation helpful? Give feedback.
All reactions