Conversation
…zations - DynamicFetcher became 20% faster - StealthyFetcher became 99% faster - Scrapling size decreased - Code became ~400 lines shorter - Most importantly, scrapling is more stable and reliable now. - Less confusing for new users. - More...
The async version are the same tests
Added missing fields and reordered classes as well for easier comparisons
When I was caching it with the rest of the data
and docstrings
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This is a big update with many improvements across many places, but also many breaking changes for good reasons. Please read the below before updating
For many reasons, we decided that from now on, we will stop using Camoufox entirely, and we might switch back to it in the future if its development continues. If you prefer to continue using Camoufox as before this release, there are instructions for that in this section.
Previously, we were using patchright in the stealth mode inside
DynamicFetcherand its session classes. Now we removed the stealth mode from them and started using patchright insideStealthyFetcherand its session classes, with A LOT of improvements, as you will see, improving the stealth overall on top of patchright.This makes
StealthyFetcherand its session classes 101% faster than before, use less memory and space, and have ~400 lines of code shorter, but, most importantly, are more stable than when we used Camoufox before.This will also shorten the installation time of the
scrapling installcommand, reduce the size of the Docker image, improve test smoothness in GitHub's CI, and make scrapling less confusing for new users.Breaking changes
stealthargument was removed from theDynamicFetcherclass and its session class, while thehide_canvasargument was moved to theStealthyFetcherand its session classes.disable_webglargument has been moved fromDynamicFetcherto theStealthyFetcherclass and renamed asallow_webgl. All session classes as well.StealthyFetcherclass is now basically the new stealthy version ofDynamicFetcher, so the following arguments are removed:block_images,humanize,addons,os_randomize,disable_ads, andgeoip. I tried to replicate them in Chromium, but each had its own problem. This might change with upcoming releases before v0.4.Now to the good news, we have improved and fixed a lot of stuff :)
Improvements
StealthyFetcherclass and its session classes are now 101% faster than before, but now also theDynamicFetcherclass and its session class are 20% faster.Fixes
google_searchargument was creating a Google referrer even if the URL is a localhost/IP.🙏 Special thanks to our Discord community for all the continuous testing and feedback
Big shoutout to our biggest Sponsors