Motivation.
Improved performance of long input/long output scenarios (input 2500, output 900) in high-concurrency scenarios (concurrency 4, 10). E2E performance improved by 20%.
Proposed Change.
Please provide the detailed design document of the RFC using the template.
Feedback Period.
No response
CC List.
No response
Any Other Things.
No response
Before submitting a new issue...