Change content handling to use byte buffer instead of parsing chunks directly #299

ybalbert · 2026-01-14T22:22:21Z

This change is to prevent an error that a multi-bytes UTF-8 character gets split across multiple chunks and fails to decode. The error could happen when pasting a long paragraph without whitespace in language like Chinese:

called `Result::unwrap()` on an `Err` value: Utf8Error

Tested by building a local docker image. The error doesn't happen any more on the same content.

Refactor content handling to use BytesMut to avoid splitting multi-bytes UTF-8 across two chunks.

Refactor content handling to use a buffer to prevent UTF-8 parsing error.

szabodanika · 2026-01-15T09:42:04Z

Thanks @ybalbert ! Can you provide an example string to test this with?

ybalbert · 2026-01-15T12:40:29Z

Sure, here's the pasta (expires in 6 days) I used for testing.

ybalbert added 5 commits January 14, 2026 16:04

Change content handling to use BytesMut

9ec2ab9

Refactor content handling to use BytesMut to avoid splitting multi-bytes UTF-8 across two chunks.

updating dependency

d2bd54d

Improve content processing in edit endpoint

5c08f91

Refactor content handling to use a buffer to prevent UTF-8 parsing error.

Refactor content handling in edit.rs

d243db1

Add ErrorBadRequest import to edit.rs

0c86019

szabodanika self-requested a review January 15, 2026 09:35

szabodanika merged commit d44d15f into szabodanika:master Jan 24, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change content handling to use byte buffer instead of parsing chunks directly #299

Change content handling to use byte buffer instead of parsing chunks directly #299

Uh oh!

ybalbert commented Jan 14, 2026 •

edited

Loading

Uh oh!

szabodanika commented Jan 15, 2026

Uh oh!

ybalbert commented Jan 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Change content handling to use byte buffer instead of parsing chunks directly #299

Change content handling to use byte buffer instead of parsing chunks directly #299

Uh oh!

Conversation

ybalbert commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

szabodanika commented Jan 15, 2026

Uh oh!

ybalbert commented Jan 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ybalbert commented Jan 14, 2026 •

edited

Loading