Skip to content

Add methods to fixup html charset to utf-8#110

Closed
sftse wants to merge 13 commits intostalwartlabs:mainfrom
sftse:replace-html-charset
Closed

Add methods to fixup html charset to utf-8#110
sftse wants to merge 13 commits intostalwartlabs:mainfrom
sftse:replace-html-charset

Conversation

@sftse
Copy link
Copy Markdown
Contributor

@sftse sftse commented May 14, 2025

Fixes #109

@sftse sftse force-pushed the replace-html-charset branch from af64e55 to 468ff8f Compare May 14, 2025 12:05
@sftse sftse force-pushed the replace-html-charset branch 2 times, most recently from 21d5104 to b40e3d7 Compare May 15, 2025 13:32
@sftse sftse force-pushed the replace-html-charset branch from b40e3d7 to e9138b2 Compare May 15, 2025 13:44
@sftse
Copy link
Copy Markdown
Contributor Author

sftse commented Oct 1, 2025

It would be nice if there was some indication that the charset could be wrong, but I don't think this PR is valuable anymore, it is also incorrect because <META> is not covered and at that point it's best to use a real html parser.

@sftse sftse closed this Oct 1, 2025
@sftse sftse deleted the replace-html-charset branch October 1, 2025 08:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Charset attribute can be out-of-sync with actual encoding after decoding to utf8

1 participant