Clear the Mamba cache table if the running queue is empty #2046

Wei-Lin-Intel · 2025-10-17T14:39:12Z

This PR fixed the corner case when all the sequences in the running queue are clear, the indices of Mamba cache table would be out of bound.
Hence it fixed the accuracy issue when bs > 256. Now with bs=512, the scores of gsm8k task are stable:

vllm (pretrained=/data/Qwen3-Next-80B-A3B-Instruct,trust_remote_code=True,enable_expert_parallel=True,tensor_parallel_size=4,distributed_executor_backend=mp,max_length=16384,max_gen_toks=2048,max_num_seqs=512,max_num_prefill_seqs=16), gen_kwargs: (None), limit: None, num_fewshot: None, batch_size: 512

Tasks	Version	Filter	n-shot	Metric		Value		Stderr
gsm8k	3	flexible-extract	5	exact_match	↑	0.9386	±	0.0066
		strict-match	5	exact_match	↑	0.8878	±	0.0087

czhu15

LGTM

Clear the Mamba cache table if the running queue is empty

ac485fa

Wei-Lin-Intel requested review from PatrykWo, afierka-intel, jikunshang, kzawora-intel, madamczyk-intel, mgawarkiewicz-intel, michalkuligowski, mswiniarsk, vivekgoe and xuechendi as code owners October 17, 2025 14:39

czhu15 approved these changes Oct 20, 2025

View reviewed changes

czhu15 merged commit 5813d47 into HabanaAI:aice/v1.22.0 Oct 20, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clear the Mamba cache table if the running queue is empty #2046

Clear the Mamba cache table if the running queue is empty #2046

Uh oh!

Wei-Lin-Intel commented Oct 17, 2025 •

edited by github-actions bot

Loading

Uh oh!

czhu15 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Clear the Mamba cache table if the running queue is empty #2046

Clear the Mamba cache table if the running queue is empty #2046

Uh oh!

Conversation

Wei-Lin-Intel commented Oct 17, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

czhu15 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Wei-Lin-Intel commented Oct 17, 2025 •

edited by github-actions bot

Loading