[Every](https://github.com/scrapy/scrapy/issues/2205) [now](https://github.com/scrapy/scrapy/issues/1858) and [then](https://github.com/scrapy/scrapy/issues/2730) we get a bug report about some HTML source not being parsed as a browser would. There was the idea in Scrapy of [adding an "html5" type](https://github.com/scrapy/scrapy/pull/1043) to switch to an HTML5 compliant parser. One of these is [html5lib](https://github.com/html5lib/html5lib-python) that can be used with lxml.