You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Clearly, the optimal search strategy is to (i) rely on name filters to ensure
339
-
that searches are conducted within the appropriate NCBI division, and (ii) only
340
-
rely on fuzzy matching when the strict or lowercase match fails to return a
341
-
name, as fuzzy matching can result in order of magnitude more run time and
342
-
memory footprint. These numbers were obtained on a single Intel i7-8665U CPU (@
343
-
(1.90GHz). Using `"chimpanzees"` as the search string (one of the NCBI
344
-
recognized vernaculars for _Pan_) gave qualitatively similar results, suggesting
327
+
consequences. This is illustrated in @benchmark[Tab.]. When possible, the optimal search strategy is to (i) rely on name filters to ensure that searches are conducted within the appropriate NCBI division, and (ii) only rely on fuzzy matching when the strict or lowercase match fails to return a name, as fuzzy matching can result in order of magnitude more run time and memory footprint.
caption: [Time and performance of different search strategies for the string `"chimpanzees"`. These numbers were obtained on a single Intel i7-8665U CPU (1.90GHz). Using `"Pan"` as the search string (for which `"chimpanzees"`is a recognized vernacular) gave qualitatively similar results, suggesting
345
349
that there is no performance cost associated with working with synonyms or
0 commit comments