Add input unpacking for tuple inputs in decoder layers #2777

NicoGrande · 2025-12-02T21:12:50Z

Description

This PR finishes the work started by @gagika in #2753. Credits to @gagika for uncovering this regression!

This PR fixes a regression introduced in a recent commit where decoder layers began returning a tuple (hidden_states, kv_cache) even when scan_layers=False. This caused failures in SequentialBlockDecoderLayers (used in pipeline parallelism) because the subsequent layer would receive a tuple as input instead of a tensor, leading to a ValueError in normalization layers.

Changes

Added a check at the beginning of the __call__ method for all Decoder Layer implementations to handle tuple inputs gracefully:

# Unpack inputs if it's a tuple (e.g. from a previous layer returning (hidden_states, kv_cache))
if isinstance(inputs, tuple):
  inputs = inputs[0]

Changes

Updated Qwen3NextScannableBlock to correctly unpack the (output, kv_cache) tuple inside its scan loop.

If the change fixes a bug:
FIXES: b/462751017

Affected Models

This fix covers the following model architectures:

DeepSeek (Dense & MoE)
Gemma, Gemma 2, Gemma 3
GPT-3, GPT-OSS
Llama 2, Llama 4
Mistral, Mixtral
Qwen 3 (Dense, MoE, and Next)
Simple Layer (Testing)

Tests

internal google3 tests.

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

Remove extra spaces. adding smoke test for no scan.

NuojCheng

Thank you Nico! LGTM

gagika

Thanks Nico

Add input unpacking for tuple inputs in decoder layers

5499607

Remove extra spaces. adding smoke test for no scan.

NicoGrande marked this pull request as ready for review December 3, 2025 15:06

NicoGrande requested review from A9isha, NuojCheng, RissyRan, SurbhiJainUSC, aireenmei, bvandermoon, gagika, gobbleturk, hengtaoguo, jiangjy1982, khatwanimohit, parambole, richjames0, shralex, shuningjin, suexu1025 and vipannalla as code owners December 3, 2025 15:06

NuojCheng approved these changes Dec 3, 2025

View reviewed changes

gagika approved these changes Dec 3, 2025

View reviewed changes

NicoGrande added the pull ready label Dec 3, 2025

copybara-service bot merged commit 73e449d into main Dec 3, 2025
167 of 184 checks passed

copybara-service bot deleted the nicogrande/fix-kvcache-unpacking branch December 3, 2025 18:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add input unpacking for tuple inputs in decoder layers #2777

Add input unpacking for tuple inputs in decoder layers #2777

Uh oh!

NicoGrande commented Dec 2, 2025

Uh oh!

NuojCheng left a comment

Uh oh!

gagika left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add input unpacking for tuple inputs in decoder layers #2777

Add input unpacking for tuple inputs in decoder layers #2777

Uh oh!

Conversation

NicoGrande commented Dec 2, 2025

Description

Changes

Changes

Affected Models

Tests

Checklist

Uh oh!

NuojCheng left a comment

Choose a reason for hiding this comment

Uh oh!

gagika left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants