[Misc] Use persistent thread pool #438

DarkSharpness · 2025-09-19T08:38:58Z

Previously, each function call of MultiThreadCompileGrammar created its own thread pool. This could be misleading, as the total number of active worker threads might significantly exceed the configured max_threads — potentially reaching up to $n \times \text{max-threads}$ for $n$ concurrent compilation tasks.

This PR changes the implementation to use a shared global thread pool across all compilation tasks in one compiler, ensuring that the number of worker threads stays within the specified limit. For different grammar compilers, they still have their own thread pool.

Note: This change may introduce performance regressions in scenarios where the old behavior implicitly allowed over-subscription of threads, as thread usage is now strictly bounded.

DarkSharpness · 2025-11-07T08:11:53Z

Updated cc @Ubospica @Seven-Streams . The rate limit policy should be refined later to achieve a balance between fairness(FIFO) and shortest-first (greedy-execution). FIFO may cause head-of-line blocking, while shortest-first may lead to starvation (worse average latency), deteriorating grammars that needs longer compilation (worse tail latency).

The old implementation creates a new thread pool per-compilation, which is closer to the latter I guess.

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

DarkSharpness force-pushed the fix_thread_pool branch from 374d844 to c6d0ce6 Compare September 19, 2025 09:03

Copilot AI review requested due to automatic review settings November 7, 2025 08:04

DarkSharpness force-pushed the fix_thread_pool branch from c6d0ce6 to a83618d Compare November 7, 2025 08:04

feat: rewrite the thread pool; set hard limit

be406a5

DarkSharpness force-pushed the fix_thread_pool branch from a83618d to be406a5 Compare November 7, 2025 08:05

Copilot AI reviewed Nov 7, 2025

View reviewed changes

DarkSharpness added 2 commits November 7, 2025 23:05

fix: fix race condition

f2b80d8

fix: fix seg fault

f7377d8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Misc] Use persistent thread pool #438

[Misc] Use persistent thread pool #438

Uh oh!

DarkSharpness commented Sep 19, 2025

Uh oh!

DarkSharpness commented Nov 7, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[Misc] Use persistent thread pool #438

Are you sure you want to change the base?

[Misc] Use persistent thread pool #438

Uh oh!

Conversation

DarkSharpness commented Sep 19, 2025

Uh oh!

DarkSharpness commented Nov 7, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant