Skip to content

Enhance Error Handling in Web Scraping #1

@sriraamav

Description

@sriraamav

Description:

The current web scraping logic could fail due to network issues or changes in the structure of the target website. This issue focuses on enhancing error handling mechanisms by adding retry logic, timeouts, and better exception management in the web scraping function.

Tips for the issue:

  • Implement retries with exponential backoff for failed network requests.
  • Ensure that timeout handling is properly configured in the requests library.
  • Log failed scraping attempts for later review.

To do:

  • Ask us to assign the issue.
  • Once the issue is assigned, you can start working on it.
  • Create a PR.

Resource:

  • 'requests' documentation
  • Python 'logging' library

Notes:
The task is assigned on a first-come, first-serve basis, and the contributor must report progress every 3 days to ensure active development.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions