-
Notifications
You must be signed in to change notification settings - Fork 25.6k
fsst wildcard #132482
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
fsst wildcard #132482
Conversation
|
@martijnvg This PR currently re-samples from the whole dataset during a merge. So it's doing more iterations over doc values than is ideal. It also has a single symbol table per segment. I ran elastic/logs against baseline, but unfortunately ran both baseline and contender with patterned_text=true. Though this shouldn't effect the results much. It results in a 3.16% reduction in total index size. This was mostly in the fields
Overall metrics Url field data sizes |
|
And here are the timing stats from the above benchmark comparison. Click to see the details |
No description provided.