v3.5 Matcher performance degradation #12261
-
switching from v3.4.4 to v3.5.0 with the same set of Matcher rules gives a large performance decrease.
|
Beta Was this translation helpful? Give feedback.
Replies: 7 comments 2 replies
-
Do you mean I did a fair amount of profiling related to the fuzzy matcher changes to try to make sure that we weren't introducing a performance regression, but it's possible I missed something or there's a particular case my tests didn't cover. My first guess is that it is related to caching and the changes introduced in I wouldn't be surprised by a slight performance hit (2-3%) but nothing like this. Can you provide more details about what you're doing here? |
Beta Was this translation helpful? Give feedback.
-
Thanks @adrianeboyd. I'm trying to narrow a large rule set down to a test case if possible. It is the |
Beta Was this translation helpful? Give feedback.
-
I found the cause: set operators in extension attributes. Not yet sure of the fix. rules with
rules with only
|
Beta Was this translation helpful? Give feedback.
-
I think the |
Beta Was this translation helpful? Give feedback.
-
proposed fix https://github.com/explosion/spaCy/compare/master...kwhumphreys:spaCy:matcher_fix?expand=1 |
Beta Was this translation helpful? Give feedback.
-
now just a small impact from adding extension predicates. I'll put together a PR with some tests. with
with
|
Beta Was this translation helpful? Give feedback.
-
cleaner fix now at #12272 |
Beta Was this translation helpful? Give feedback.
cleaner fix now at #12272
any recommendations for including performance tests in unit tests?