Skip to content

Conversation

@sdn4z
Copy link
Collaborator

@sdn4z sdn4z commented Sep 18, 2025

We were getting too much noise when parsing yarn.lock and package-lock.json files. The output is still not as clean as when we're parsing Python lock files though.

I've increased the cutoff value to 10, as it produces less noise, and also changed the default selector_method to first-letter as it's more performant and it as well returns less noise.

closes #97
BREAKING CHANGE

@sdn4z sdn4z marked this pull request as ready for review September 18, 2025 08:10
@sdn4z sdn4z requested a review from scastlara as a code owner September 18, 2025 08:10
@scastlara
Copy link
Collaborator

I am worried about a game of whac-a-mole here tweaking threshodls. How can we find out what are the best thresholds?

@scastlara
Copy link
Collaborator

nit: this isn't really a refactor, since it affects the functionality (hopefully better predictions)

@scastlara
Copy link
Collaborator

I am worried about a game of whac-a-mole here tweaking threshodls. How can we find out what are the best thresholds?

#97 (comment)

@sdn4z sdn4z closed this Sep 19, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Tweak thresholds for typosquat detection

2 participants