fix asr ut failures #41332

yao-matrix · 2025-10-03T19:48:47Z

4 tests failed with AttributeError: 'AudioDecoder' object has no attribute 'copy':

pytest -rA tests/pipelines/test_pipelines_automatic_speech_recognition.py::AutomaticSpeechRecognitionPipelineTests::test_speculative_decoding_whisper_non_distil pytest -rA tests/test_pipeline_mixin.py::AutomaticSpeechRecognitionPipelineTests::test_speculative_decoding_whisper_non_distil pytest -rA tests/pipelines/test_pipelines_automatic_speech_recognition.py::AutomaticSpeechRecognitionPipelineTests::test_whisper_prompted pytest -rA tests/test_pipeline_mixin.py::AutomaticSpeechRecognitionPipelineTests::test_whisper_prompted

after fixing, they all passed

@SunMarc @ydshieh , pls help review, thx very much.

Signed-off-by: Yao, Matrix <[email protected]>

yao-matrix · 2025-10-03T20:00:39Z

src/transformers/generation/logits_process.py

        vocab_tensor = torch.arange(scores.shape[-1], device=scores.device)
-        suppress_token_mask = isin_mps_friendly(vocab_tensor, self.suppress_tokens)
+        suppress_token_mask = isin_mps_friendly(vocab_tensor, self.suppress_tokens.to(scores.device))
        scores = torch.where(suppress_token_mask, -float("inf"), scores)


In multi-device cases(like put 2 devices to run):
in current implementation, in assistant decoding case, assistant model will reuse main model's SuppressTokensLogitsProcessor, which place the suppress_tokens in the same device as input_tensor (which is device 0). assistant model will ingest encoder_outputs of the main model and do the decoder(in whisper case), while encoder_outputs may in device 1 but main model's suppress_tokens which is main model's is in device 0, so lead to RuntimeError:

RuntimeError: Expected all tensors to be on the same device, but got test_elements is on xpu:0, different from other tensors on xpu:1 (when checking argument in method wrapper_XPU_isin_Tensor_Tensor)

So based on current implementation(that assistant model shares main model's SuppressTokensLogitsProcessor), I move suppress_tokens to scores.device while doing isin.

SunMarc

Thanks for fixing ! cc @eustlb for final checks

HuggingFaceDocBuilderDev · 2025-10-06T09:33:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

eustlb

LGTM! thanks for your help 🤗

eustlb · 2025-10-06T17:03:15Z

tests/pipelines/test_pipelines_automatic_speech_recognition.py

+        transcription_ass = pipe(sample.clone().detach(), generate_kwargs={"assistant_model": assistant_model})["text"]
+        transcription_non_ass = pipe(sample)["text"]


thanks for catching the incorrect inversion here!

fix asr ut failures

acf220a

Signed-off-by: Yao, Matrix <[email protected]>

yao-matrix commented Oct 3, 2025

View reviewed changes

SunMarc approved these changes Oct 6, 2025

View reviewed changes

Merge branch 'main' into issue-562

aa7284b

eustlb approved these changes Oct 6, 2025

View reviewed changes

eustlb enabled auto-merge (squash) October 6, 2025 17:04

eustlb merged commit 73f8c4b into huggingface:main Oct 6, 2025
25 checks passed

yao-matrix deleted the issue-562 branch October 6, 2025 18:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix asr ut failures #41332

fix asr ut failures #41332

yao-matrix commented Oct 3, 2025

Uh oh!

yao-matrix Oct 3, 2025

Uh oh!

SunMarc left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Oct 6, 2025

Uh oh!

eustlb left a comment

Uh oh!

eustlb Oct 6, 2025

Uh oh!

Uh oh!

Uh oh!

		transcription_ass = pipe(sample.clone().detach(), generate_kwargs={"assistant_model": assistant_model})["text"]
		transcription_non_ass = pipe(sample)["text"]

fix asr ut failures #41332

fix asr ut failures #41332

Conversation

yao-matrix commented Oct 3, 2025

Uh oh!

yao-matrix Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Oct 6, 2025

Uh oh!

eustlb left a comment

Choose a reason for hiding this comment

Uh oh!

eustlb Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!