Support "html5" type to use html5lib parser

[Every](https://github.com/scrapy/scrapy/issues/2205) [now](https://github.com/scrapy/scrapy/issues/1858) and [then](https://github.com/scrapy/scrapy/issues/2730) we get a bug report about some HTML source not being parsed as a browser would.

There was the idea in Scrapy of [adding an "html5" type](https://github.com/scrapy/scrapy/pull/1043) to switch to an HTML5 compliant parser.
One of these is [html5lib](https://github.com/html5lib/html5lib-python) that can be used with lxml.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support "html5" type to use html5lib parser #83

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Support "html5" type to use html5lib parser #83

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions