Skip to content

Add new Python template - Scrapy & Playwright #252

@vdusek

Description

@vdusek

Can you check why our Beautiful Soup template fails on tripadvisor.com? https://console.apify.com/actors/jWYbXHu32SvZf1Cgb/runs/0IYh4rWH9Ig2vIUSM#output

  • Solution: We can provide a new Scrapy Actor template using a headless browser like Playwright.
  • PyPI packages: scrapy and scrapy-playwright.
  • The integration of Playwright into the Scrapy project is pretty simple, scrapy-playwright provides a Scrapy component ScrapyPlaywrightDownloadHandler, which needs to be added to the project.
  • Check the Web scraping with Scrapy blog post for more information and inspiration.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request.t-toolingIssues with this label are in the ownership of the tooling team.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions