Fix docs_from_attrs truncating mid-UTF-8 codepoint by leighmcculloch · Pull Request #1769 · stellar/rs-soroban-sdk

leighmcculloch · 2026-03-17T12:44:26Z

What

Use floor_char_boundary to truncate doc comments at a valid UTF-8 character boundary in docs_from_attrs. Add a test that confirms a doc string where a multi-byte character straddles the truncation boundary produces valid UTF-8.

Why

The current implementation truncates doc bytes with Vec::truncate at an arbitrary byte offset, which could split multi-byte UTF-8 codepoints and store invalid UTF-8 in the contract's spec XDR.

Close #1768

Copilot

Pull request overview

This PR fixes a UTF-8 correctness bug in soroban-sdk-macros where doc-comment truncation could split multi-byte characters, potentially encoding invalid UTF-8 into the contract spec XDR.

Changes:

Truncate doc strings using floor_char_boundary so truncation always occurs at a valid UTF-8 boundary.
Add a unit test covering the case where a multi-byte character straddles the truncation boundary.

leighmcculloch added 2 commits March 17, 2026 22:18

add test for multibyte UTF-8 truncation in docs_from_attrs

9afaa33

truncate docs on char boundary using floor_char_boundary

75425fa

leighmcculloch requested a review from a team March 17, 2026 12:44

leighmcculloch marked this pull request as ready for review March 17, 2026 12:44

Copilot AI review requested due to automatic review settings March 17, 2026 12:44

Copilot started reviewing on behalf of leighmcculloch March 17, 2026 12:45 View session

Copilot AI reviewed Mar 17, 2026

View reviewed changes

leighmcculloch added 2 commits March 20, 2026 13:13

Merge branch 'main' into docs-from-attrs

ac098c2

bump rust-version to 1.91.0

288215b

leighmcculloch commented Mar 23, 2026

View reviewed changes

Comment thread Cargo.toml

leighmcculloch and others added 4 commits March 23, 2026 17:02

Merge branch 'main' into docs-from-attrs

702c122

Merge branch 'main' into docs-from-attrs

7fe4df6

update test files

88ace37

update expanded test snapshots for associated type chained

25eea4b

leighmcculloch enabled auto-merge March 23, 2026 22:35

mootz12 approved these changes Mar 23, 2026

View reviewed changes

leighmcculloch added this pull request to the merge queue Mar 23, 2026

Merged via the queue into main with commit 0e14f5c Mar 24, 2026
192 of 194 checks passed

leighmcculloch deleted the docs-from-attrs branch March 24, 2026 00:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix docs_from_attrs truncating mid-UTF-8 codepoint#1769

Fix docs_from_attrs truncating mid-UTF-8 codepoint#1769
leighmcculloch merged 8 commits intomainfrom
docs-from-attrs

leighmcculloch commented Mar 17, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

leighmcculloch commented Mar 17, 2026

What

Why

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants