feat: enable async token counting with piscina workers #465

ThomasK33 · 2025-10-28T17:17:34Z

Port tokenizer logic into a Piscina worker pool so token counting no
longer blocks the main thread and exposes async APIs across IPC.
Render live token estimates in chat input with Suspense fallback and
tighten defensive assertions while adapting tests to the new async flow.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

src/utils/main/tokenizer.ts

Port tokenizer logic into a Piscina worker pool so token counting no longer blocks the main thread and exposes async APIs across IPC. Render live token estimates in chat input with Suspense fallback and tighten defensive assertions while adapting tests to the new async flow.

Detect Bun runtime and bypass Piscina so token counting works in tests. Assert encoding responses and reuse worker logic when pooling is off. Disable Git signing in tests and align fixtures with current adapters. Change-Id: I12fc5053de8f2fb906a99e14b97689a6f74d0d7f Signed-off-by: Thomas Kosiewski <[email protected]>

Introduce bounded tokenizer fallback for mock streams to avoid hangs. Add assertions and debug logging to surface invalid tokenizer results. Switch mock scenarios to openai:gpt-5 to align with tokenizer choice. Expose PLAYWRIGHT_ARGS in test-e2e make target to ease overrides.

Standardize test runner to jest across CI and local development: - Update CI workflow to use jest for coverage reporting - Update Makefile test targets to use jest - Update tokenizer worker to use node:assert (jest-compatible) - Remove async/await from tokenizer.loadTokenizerModules (Promise.allSettled already returns Promise) - Change test model to gpt-5 for future-proofing This ensures consistent test execution between local and CI environments and prepares for better coverage reporting.

Removes the piscina dependency in favor of a custom worker pool implementation using Node.js built-in worker_threads. This reduces external dependencies while maintaining the same async tokenization functionality. Changes: - Added workerPool.ts to manage worker thread lifecycle and message passing - Updated tokenizer.worker.ts to handle parentPort messages directly - Modified tokenizer.ts to use new run() function instead of Piscina - Migrated unit tests from Jest to bun test for consistency - Updated CI and Makefile to use bun test for coverage The custom pool creates a single persistent worker at module load time and handles request/response matching via message IDs.

Increased expect timeout from 5s to 15s to accommodate worker thread encoding imports (~10s). This prevents test timeouts during startup.

Increase delays between stream deltas to better simulate realistic streaming behavior. Changes spacing from 2x+300ms, 2x+400ms to 3x+50ms, 3x+150ms, and extends final delay to 3x+500ms for more natural pacing in mock scenarios.

Map claude-haiku-4-5 to claude-3.5-haiku tokenizer as temporary workaround until ai-tokenizer adds native support for the newer model.

Changes tokenizer to gracefully handle unknown model names by falling back to a similar model's tokenizer with a warning, instead of throwing an error. This prevents crashes when new models are used before tokenizer support is added. Also fixes runtime tests on macOS by resolving symlinks in temp paths (/tmp -> /private/tmp) to match git worktree paths.

Defer worker thread creation until first use. Previously the worker was initialized at module load time, causing unnecessary overhead. Now createWorker() is called lazily from run(), reducing startup cost.

Reduces Jest maxWorkers from 100% to 50% in CI integration tests to prevent overwhelming the CI environment with excessive parallelism. Updates comment to reflect 16 workers instead of 32 on typical runners.

chatgpt-codex-connector bot reviewed Oct 28, 2025

View reviewed changes

src/utils/main/tokenizer.ts Outdated Show resolved Hide resolved

ThomasK33 force-pushed the thomask33/10-28-add_tokenizer_pool branch 7 times, most recently from 93e0fbf to 1c51477 Compare October 30, 2025 13:27

ThomasK33 added 7 commits October 31, 2025 10:21

fix model names in mocks for tokenizer

6c298f2

test: increase playwright expect timeout for worker imports

103aaea

Increased expect timeout from 5s to 15s to accommodate worker thread encoding imports (~10s). This prevents test timeouts during startup.

ThomasK33 force-pushed the thomask33/10-28-add_tokenizer_pool branch from 1c51477 to 103aaea Compare October 31, 2025 09:25

ThomasK33 added 5 commits October 31, 2025 10:46

test: adjust mock stream timing delays in toolFlows

95fa53b

Increase delays between stream deltas to better simulate realistic streaming behavior. Changes spacing from 2x+300ms, 2x+400ms to 3x+50ms, 3x+150ms, and extends final delay to 3x+500ms for more natural pacing in mock scenarios.

fix: add tokenizer fallback for claude-haiku-4-5

1356b8f

Map claude-haiku-4-5 to claude-3.5-haiku tokenizer as temporary workaround until ai-tokenizer adds native support for the newer model.

refactor: make worker initialization lazy in workerPool

1ea4720

Defer worker thread creation until first use. Previously the worker was initialized at module load time, causing unnecessary overhead. Now createWorker() is called lazily from run(), reducing startup cost.

ci: reduce integration test parallelism to 50% maxWorkers

06609fe

Reduces Jest maxWorkers from 100% to 50% in CI integration tests to prevent overwhelming the CI environment with excessive parallelism. Updates comment to reflect 16 workers instead of 32 on typical runners.

ThomasK33 force-pushed the thomask33/10-28-add_tokenizer_pool branch from 84da25b to 06609fe Compare October 31, 2025 17:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: enable async token counting with piscina workers #465

feat: enable async token counting with piscina workers #465

Uh oh!

ThomasK33 commented Oct 28, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: enable async token counting with piscina workers #465

Are you sure you want to change the base?

feat: enable async token counting with piscina workers #465

Uh oh!

Conversation

ThomasK33 commented Oct 28, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant