fix: Handle non utf-8 characters in OME-XML #141

weiji14 · 2025-11-22T23:41:49Z

By using String::from_utf8_lossy.

Changes:

Used another sample OME-TIFF file that has non utf8 characters in the OME-XML
Increased prefetch bytes to more than 32kb as workaround to avoid hanging when parsing the StripOffsets tag (xref Tokio hangs when opening TIFF #89)

Fixes #101

By using String::from_utf8_lossy. Changed to another sample OME-TIFF file, and increased prefetch bytes to more than 32kb to avoid hanging when parsing the StripOffsets tag.

kylebarron · 2025-11-23T02:58:53Z

Ok so to clarify the spec says that these characters should always be UTF8? And which spec are we talking about? TIFF or OME-TIFF?

This tag type is named Value:Ascii, so it seems like a mismatch that we're hitting utf8 issues. Should we rename that to Value::Text? or Value::String?

Alternatively we could store a Vec<u8> but if this is string data, then Vec<u8> seems improper

fix: Handle non utf-8 characters in OME-XML

52bda55

By using String::from_utf8_lossy. Changed to another sample OME-TIFF file, and increased prefetch bytes to more than 32kb to avoid hanging when parsing the StripOffsets tag.

weiji14 self-assigned this Nov 22, 2025

github-actions bot added the fix label Nov 22, 2025

weiji14 mentioned this pull request Nov 22, 2025

OME-TIFF: Result::unwrap() on an Err value: General("invalid utf-8 sequence of 1 bytes from index 2144") #101

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Handle non utf-8 characters in OME-XML #141

fix: Handle non utf-8 characters in OME-XML #141

Uh oh!

weiji14 commented Nov 22, 2025 •

edited

Loading

Uh oh!

kylebarron commented Nov 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: Handle non utf-8 characters in OME-XML #141

Are you sure you want to change the base?

fix: Handle non utf-8 characters in OME-XML #141

Uh oh!

Conversation

weiji14 commented Nov 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kylebarron commented Nov 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

weiji14 commented Nov 22, 2025 •

edited

Loading