fix: Correct Switch Transformers sparse layer logic and outdated spectrogram comment #43336

harshaljanjani · 2026-01-17T11:29:20Z

What does this PR do?

The following issues were identified and fixed in this PR:

SwitchTransformersConfig incorrectly created sparse (MoE) layers when num_sparse_encoder_layers=0 and num_layers=1 due to prev logic. The modeling code’s sparse_step == 1 condition would then trigger, creating a sparse layer when none were requested. This is fixed by setting encoder_sparse_step = 0 and decoder_sparse_step = 0 when no sparse layers are requested, which the modeling code already handles correctly via if sparse_step > 0 else False.
Updated an outdated comment in audio_utils.py that stated spectrogram() does not support batching, even though spectrogram_batch() already exists. Changed to a note referencing the batch function.

Fixes #43335.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the [contributor guideline](https://github.com/huggingface/transformers/blob/main/CONTRIBUTING.md#create-a-pull-request),
Pull Request section?
Was this discussed/approved via a GitHub issue or the [forum](https://discuss.huggingface.co/)? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
[documentation guidelines](https://github.com/huggingface/transformers/tree/main/docs), and
[here are tips on formatting docstrings](https://github.com/huggingface/transformers/tree/main/docs#writing-source-documentation).
Did you write any new necessary tests?

cc: @Rocketknight1

…trogram comment

github-actions · 2026-01-17T11:30:22Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: switch_transformers

github-actions · 2026-01-17T11:38:55Z

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=43336&sha=372d5d

harshaljanjani · 2026-01-17T13:50:01Z

The failing tests are unrelated to this change; would appreciate a review when you get a chance thanks!

fix: Correct Switch Transformers sparse layer logic and outdated spec…

372d5d7

…trogram comment

harshaljanjani marked this pull request as ready for review January 17, 2026 13:48

github-actions bot requested review from ArthurZucker and Rocketknight1 January 17, 2026 13:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Correct Switch Transformers sparse layer logic and outdated spectrogram comment #43336

fix: Correct Switch Transformers sparse layer logic and outdated spectrogram comment #43336

harshaljanjani commented Jan 17, 2026

Uh oh!

github-actions bot commented Jan 17, 2026

Uh oh!

github-actions bot commented Jan 17, 2026

Uh oh!

harshaljanjani commented Jan 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix: Correct Switch Transformers sparse layer logic and outdated spectrogram comment #43336

Are you sure you want to change the base?

fix: Correct Switch Transformers sparse layer logic and outdated spectrogram comment #43336

Conversation

harshaljanjani commented Jan 17, 2026

What does this PR do?

Before submitting

Uh oh!

github-actions bot commented Jan 17, 2026

Uh oh!

github-actions bot commented Jan 17, 2026

Uh oh!

harshaljanjani commented Jan 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant