perf(runtime): rewrite case-change in C++ to skip UTF-8 round-trip by cirospaciari · Pull Request #26773 · oven-sh/bun

cirospaciari · 2026-02-06T08:57:41Z

Summary

Follow-up to #26772, which introduced the 11 case-changing utility methods in Zig.

Rewrites the implementation from Zig to C++, eliminating two unnecessary allocations and transcoding steps per call
The Zig implementation converted every JS string to UTF-8 via bunstr.toUTF8(allocator), processed codepoints, then converted back via bun.String.cloneUTF8(result_bytes). The new C++ implementation works directly with JSC's native string encoding (Latin1 or UTF-16) using StringView, StringBuilder, and ICU — same pattern as stripANSI.cpp
New CaseChange.cpp + CaseChange.h with the algorithm templated on Latin1Character/UChar; 11 functions registered directly in bunObjectTable as C++ host functions; deleted string_case.zig and removed the icu_toUpper/icu_toLower C-extern bridge wrappers

Test plan

All 2098 existing case-change tests pass (bun bd test test/js/bun/util/case-change.test.ts)
No regressions in other tests

Changelog

…5087) Add Bun.camelCase, pascalCase, snakeCase, kebabCase, constantCase, dotCase, capitalCase, trainCase, pathCase, sentenceCase, and noCase matching the change-case npm package. Uses ICU for full Unicode support and bun.strings.UnsignedCodepointIterator for codepoint iteration. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…-trip Rewrites the 11 case-changing utility methods (camelCase, pascalCase, snakeCase, etc.) from Zig to C++, eliminating two unnecessary allocations and transcoding steps. The Zig implementation converted every JS string to UTF-8 via bunstr.toUTF8(allocator), processed codepoints, then converted back via bun.String.cloneUTF8(result_bytes) — two unnecessary allocations + transcoding for every call. Move to C++ and work directly with the JSC string's native encoding (Latin1 or UTF-16) using StringView, StringBuilder, and ICU — same pattern as stripANSI.cpp. - New: CaseChange.cpp + CaseChange.h with the case-change algorithm templated on Latin1Character/UChar - Wiring: 11 functions registered directly in bunObjectTable as C++ host functions - Cleanup: Deleted string_case.zig and all Zig/C++ bridge wiring (icu_toUpper/icu_toLower wrappers) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

robobun · 2026-02-06T08:57:51Z

^{Updated 1:46 AM PT - Feb 6th, 2026}

❌ @cirospaciari, your commit a669587 has 3 failures in Build #36644 (All Failures):

test/js/node/http/node-http-backpressure.test.ts - SIGKILL on 🐧 3.23 x64-baseline
test/js/node/http/node-http-backpressure.test.ts - SIGKILL on 🐧 25.04 x64-baseline
test/integration/next-pages/test/dev-server.test.ts - 1 failing on 🍎 14 aarch64
test/integration/next-pages/test/dev-server.test.ts - 1 failing on 🍎 13 aarch64
test/js/bun/util/case-change.test.ts - 1 failing on 🍎 13 aarch64 (new)
test/js/bun/util/case-change.test.ts - 1 failing on 🍎 14 x64 (new)
test/js/bun/util/case-change.test.ts - 1 failing on 🍎 14 aarch64 (new)
test/js/bun/util/case-change.test.ts - 1 failing on 🍎 13 x64 (new)
test/js/bun/util/case-change.test.ts - 1 failing on 🪟 2019 x64-baseline (new)
test/js/bun/util/case-change.test.ts - 1 failing on 🪟 2019 x64 (new)
test/js/bun/util/case-change.test.ts - 1 failing on 🐧 13 x64-asan (new)
test/js/bun/util/case-change.test.ts - 1 failing on 🐧 13 aarch64 (new)
test/js/bun/util/case-change.test.ts - 1 failing on 🐧 25.04 aarch64 (new)
test/js/bun/util/case-change.test.ts - 1 failing on 🐧 3.23 aarch64 (new)
test/js/bun/util/case-change.test.ts - 1 failing on 🐧 3.23 x64 (new)
test/js/bun/util/case-change.test.ts - 1 failing on 🐧 3.23 x64-baseline (new)
test/js/bun/util/case-change.test.ts - 1 failing on 🐧 13 x64 (new)
test/js/bun/util/case-change.test.ts - 1 failing on 🐧 25.04 x64 (new)
test/js/bun/util/case-change.test.ts - 1 failing on 🐧 25.04 x64-baseline (new)
test/js/bun/util/case-change.test.ts - 1 failing on 🐧 13 x64-baseline (new)

🧪 To try this PR locally:

bunx bun-pr 26773

That installs a local version of the PR into your bun-26773 executable, so you can run:

bun-26773 --bun

coderabbitai · 2026-02-06T09:05:31Z

Walkthrough

Adds eleven string case-transformation functions (camelCase, pascalCase, snakeCase, kebabCase, constantCase, dotCase, capitalCase, trainCase, pathCase, sentenceCase, noCase) to Bun. Includes TypeScript declarations, native C++ implementations with character classification and separator logic, registration in the Bun object, and comprehensive test suite validating cross-compatibility.

Changes

Cohort / File(s)	Summary
TypeScript Type Declarations `packages/bun-types/bun.d.ts`	Added type signatures for 11 string case-transformation functions to the Bun module, each accepting a string input and returning a string output.
C++ Implementation `src/bun.js/bindings/CaseChange.h`, `src/bun.js/bindings/CaseChange.cpp`	Implemented case conversion logic with character classification, word boundary detection, and per-style separators. Created 11 host functions wrapping a templated `convertCase` function supporting both Latin1 and UTF-16 string inputs.
Bun Object Integration `src/bun.js/bindings/BunObject.cpp`	Registered 11 new case-transformation functions in the BunObject's export lookup table and included the CaseChange header.
Test Suite `test/js/bun/util/case-change.test.ts`	Added comprehensive test coverage including compatibility matrix against the change-case library, round-trip conversions, idempotency checks, edge cases (empty strings, unicode, acronyms), and error handling validation.

🚥 Pre-merge checks | ✅ 2

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately describes the main change: rewriting case-change functions from Zig to C++ to eliminate UTF-8 round-trips, which aligns with the substantial refactoring across multiple files (CaseChange.cpp/h, BunObject.cpp, and test additions).
Description check	✅ Passed	PR description includes both required sections with clear explanation of changes and test plan, though changelog placeholder is empty.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@src/bun.js/bindings/CaseChange.cpp`:
- Around line 236-266: The per-codepoint mapping using u_toupper/u_tolower
inside the loop (see getTransform, WordTransform, u_toupper, u_tolower,
builder.append, CharType/Latin1Character) must be replaced with ICU full-string
case mappings to handle multi-codepoint expansions and context/locale rules;
extract the word slice from input, call u_strToUpper/u_strToLower or use
UCaseMap to transform the whole word (or transform first character + remainder
for Capitalize) into a temporary UTF-16/UTF-32 buffer, then append that
transformed string to builder (handling length changes and encoding conversion)
instead of appending per-codepoint results. Ensure Capitalize semantics use
full-mapping for the first grapheme/character and full-mapping lowercase for the
rest, and thread locale/flags through the UCaseMap calls if applicable.

src/bun.js/bindings/CaseChange.cpp

test/js/bun/util/case-change.test.ts

cirospaciari and others added 2 commits February 6, 2026 00:31

cirospaciari requested a review from alii as a code owner February 6, 2026 08:57

github-actions bot added the claude label Feb 6, 2026

coderabbitai bot reviewed Feb 6, 2026

View reviewed changes

src/bun.js/bindings/CaseChange.cpp Show resolved Hide resolved

claude bot reviewed Feb 6, 2026

View reviewed changes

test/js/bun/util/case-change.test.ts Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(runtime): rewrite case-change in C++ to skip UTF-8 round-trip#26773

perf(runtime): rewrite case-change in C++ to skip UTF-8 round-trip#26773
cirospaciari wants to merge 2 commits intomainfrom
ciro/case-change-cpp

cirospaciari commented Feb 6, 2026 •

edited

Loading

Uh oh!

robobun commented Feb 6, 2026 •

edited

Loading

Uh oh!

coderabbitai bot commented Feb 6, 2026 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cirospaciari commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Changelog

Uh oh!

robobun commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai bot commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cirospaciari commented Feb 6, 2026 •

edited

Loading

robobun commented Feb 6, 2026 •

edited

Loading

coderabbitai bot commented Feb 6, 2026 •

edited

Loading