Skip to content

Commit d6bf6de

Browse files
ming1753Jiang-Jia-Junrainyfly
authored
[Bug Fix] Fix mm performance degradation (#3942)
* [Bug Fix] Fix mm performance degradation * formate --------- Co-authored-by: Jiang-Jia-Jun <[email protected]> Co-authored-by: chenjian <[email protected]>
1 parent 38e734e commit d6bf6de

File tree

1 file changed

+7
-1
lines changed

1 file changed

+7
-1
lines changed

fastdeploy/engine/common_engine.py

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -527,8 +527,14 @@ def _fetch_request():
527527
self.cfg.max_prefill_batch,
528528
)
529529

530+
if self.cfg.model_config.enable_mm:
531+
self.resource_manager.check_and_free_block_tables()
532+
available_blocks = self.resource_manager.available_block_num()
533+
else:
534+
available_blocks = self.cfg.cache_config.max_block_num_per_seq
535+
530536
tasks = self.scheduler.get_requests(
531-
available_blocks=self.cfg.cache_config.max_block_num_per_seq,
537+
available_blocks=available_blocks,
532538
block_size=self.cfg.cache_config.block_size,
533539
reserved_output_blocks=self.cfg.cache_config.enc_dec_block_num,
534540
max_num_batched_tokens=self.cfg.max_model_len,

0 commit comments

Comments
 (0)