Skip to content

[BUG] InvalidSchema Error when Fetching URLs in getSoup and NoneType Error When Parsing HTML in getReviewTextย #312

@shristirwt

Description

@shristirwt

Description

When running the code, the InvalidSchema error occurs in the getSoup function while trying to fetch URLs for IMDb movie reviews. This error suggests that some URLs may not be formatted correctly or are being generated as HTML content instead of valid URLs.

The getReviewText function may encounter an AttributeError if the movie_soup object is None or does not contain the expected HTML structure (e.g., no review text div). This can happen if getSoup returns None due to a failed request or an invalid URL.
Screenshot 2024-11-03 192018

Screenshots

No response

Any additional information?

No response

What browser are you seeing the problem on?

No response

Checklist

  • I have checked the existing issues
  • I have read the Contributing Guidelines
  • I'm a GSSoC'24-Extd contributor
  • I'm a Hacktoberfest'24 contributor
  • I am willing to work on this issue (optional)

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions