fix(examples): te_llama compatibility with transformers >= 4.57 #2572

sbhavani · 2026-01-07T16:45:50Z

Description

The te_llama.py example fails with HF transformers 4.57+ due to a breaking change in how decoder layer outputs are handled. In transformers 4.57+, the LlamaModel forward loop changed causing TELlamaDecoderLayer to fail because it was returning a tuple (tensor,) instead of the tensor directly.

Fixes #2567

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refactoring

Changes

Please list the changes introduced in this PR:

Handle case where hidden_states is passed as a tuple (for backward compatibility with older HF versions)
Return tensor directly instead of wrapping in tuple (required for HF transformers >= 4.57)
Fix regex SyntaxWarning by using raw string prefix (r"model.layers.\d+.")

Checklist:

I have read and followed the contributing guidelines
The functionality is complete
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Testing

Tested with:

transformer_engine 2.5.0+f05f12c
transformers 4.57.3
nvcr.io/nvidia/pytorch:25.08-py3

…= 4.57 The te_llama.py example was failing with HuggingFace transformers 4.57+ due to API changes in how decoder layer outputs are handled. Changes: - Handle case where hidden_states is passed as a tuple (older HF versions) - Return tensor directly instead of wrapped in tuple (HF 4.57+ expects this) - Fix regex pattern to use raw string (fixes SyntaxWarning) Error fixed: AttributeError: 'tuple' object has no attribute 'contiguous' Tested with: - transformer_engine 2.5.0 - transformers 4.57.3 - PyTorch container nvcr.io/nvidia/pytorch:25.08-py3 Signed-off-by: Santosh Bhavani <[email protected]>

greptile-apps · 2026-01-07T16:50:09Z

Greptile Summary

This PR fixes compatibility issues with HuggingFace transformers >= 4.57 in the te_llama.py example.

The main changes:

Added tuple unpacking for hidden_states input to maintain backward compatibility with older transformers versions that passed outputs as tuples
Changed return value from (tensor,) to tensor directly, which is required for transformers 4.57+ where the LlamaModel forward loop expects tensors rather than tuple-wrapped outputs
Fixed SyntaxWarning in regex pattern by using raw string prefix (r"model.layers.\d+.")

The fix properly addresses the reported AttributeError: 'tuple' object has no attribute 'contiguous' error that occurred when running the example with transformers 4.57.3.

Confidence Score: 4/5

This PR is safe to merge with minimal risk
The changes are focused and directly address the reported compatibility issue. The fix is well-documented with inline comments. One minor concern is the lack of empty tuple validation when unpacking hidden_states[0], though this is unlikely to cause issues in practice given HuggingFace's API guarantees.
No files require special attention

Important Files Changed

Filename	Overview
docs/examples/te_llama/te_llama.py	Fixed transformers 4.57+ compatibility by handling tuple inputs and returning tensors directly, plus fixed regex warning

Sequence Diagram

sequenceDiagram
    participant HF as HuggingFace LlamaModel
    participant TELayer as TELlamaDecoderLayer
    participant TE as TransformerEngine TransformerLayer
    
    Note over HF,TE: Transformers < 4.57 (Old Behavior)
    HF->>TELayer: forward(hidden_states)
    Note over TELayer: hidden_states is tensor
    TELayer->>TE: super().forward(hidden_states, ...)
    TE-->>TELayer: returns tensor
    TELayer-->>HF: returns (tensor,)
    Note over HF: Extracts with layer_outputs[0]
    
    Note over HF,TE: Transformers >= 4.57 (New Behavior)
    HF->>TELayer: forward((hidden_states,))
    Note over TELayer: Check if tuple and unpack [0]
    TELayer->>TELayer: hidden_states = hidden_states[0]
    TELayer->>TE: super().forward(hidden_states, ...)
    TE-->>TELayer: returns tensor
    TELayer-->>HF: returns tensor directly
    Note over HF: Uses tensor directly (no unpacking)

greptile-apps

Additional Comments (1)

docs/examples/te_llama/te_llama.py, line 77-78 (link)

logic: No check for empty tuple before accessing [0]. If hidden_states is an empty tuple, this will raise an IndexError.

_{1 file reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

sudhakarsingh27 · 2026-01-07T22:37:24Z

Thanks for fixing, @sbhavani! Lgtm.
(I guess the other example - te_gemma also would need a change, let me take care of that - will fix other pending issues with it)

sudhakarsingh27

Actually, fixing this for 4.57+ would break it for previous versions then?

sbhavani · 2026-01-08T04:51:38Z

Actually, fixing this for 4.57+ would break it for previous versions then?

nope it handles both the prev and current version of transformers. I think we should fix the version it to support the latest transformers as both TE and transformers APIs are constantly changing.

sudhakarsingh27

Okay I see that it correctly handles version dependencies.

I agree with fixing library versions. Would you be open to

Create a requirements.txt file with correct versions for TE, huggingface, accelerate, peft, datasets libraries
Add a small section at the start of the tutorial which mentions install the prereqs using pip install -r requirements.txt?

:)

(I did that for te_gemma for your reference)

Signed-off-by: Santosh Bhavani <[email protected]>

greptile-apps

Greptile Overview

Greptile Summary

This PR fixes compatibility issues with HuggingFace transformers >= 4.57 by changing how TELlamaDecoderLayer returns outputs. The breaking change in transformers 4.57+ modified the LlamaModel forward loop to expect decoder layers to return tensors directly instead of tuples.

Key changes:

Modified TELlamaDecoderLayer.forward() to return tensor directly instead of wrapping in tuple
Added defensive tuple unpacking for hidden_states input (backward compatibility safety)
Fixed regex SyntaxWarning by using raw string prefix (r"model.layers.\d+.")
Added requirements.txt to pin tested dependency versions

Issues found:

Minor: Confusing comment about when tuple unpacking is needed
The tuple unpacking check appears to be defensive programming rather than necessary for backward compatibility, as the return type change itself handles version compatibility

Confidence Score: 4/5

This PR is safe to merge with minimal risk - it addresses a critical compatibility issue with clear, targeted fixes
Score reflects that the fix correctly addresses the transformers 4.57+ breaking change by modifying return types. The regex fix is correct. Minor deduction for a misleading comment about the tuple unpacking logic, which doesn't affect functionality but could confuse future maintainers. The changes are well-tested according to PR description and follow a clear pattern.
No files require special attention - all changes are straightforward compatibility fixes

Important Files Changed

File Analysis

Filename	Score	Overview
docs/examples/te_llama/te_llama.py	4/5	Fixed forward method to return tensor directly instead of tuple for transformers >= 4.57, added input tuple handling for backward compatibility, and fixed regex SyntaxWarning with raw string prefix
docs/examples/te_llama/requirements.txt	4/5	New file pinning dependency versions for the example, including transformers==4.57.0 which contains the breaking change this PR addresses

Sequence Diagram

sequenceDiagram
    participant HF as HuggingFace LlamaModel
    participant TELayer as TELlamaDecoderLayer
    participant TE as TransformerLayer (TE)
    
    Note over HF,TE: Transformers >= 4.57
    HF->>TELayer: forward(hidden_states=tensor)
    TELayer->>TELayer: Check isinstance(hidden_states, tuple)
    Note over TELayer: False - continue
    TELayer->>TE: super().forward(hidden_states, ...)
    TE-->>TELayer: returns tensor
    TELayer-->>HF: returns tensor directly
    
    Note over HF,TE: Transformers < 4.57 (with old code)
    HF->>TELayer: forward(hidden_states=tensor)
    TELayer->>TE: super().forward(hidden_states, ...)
    TE-->>TELayer: returns tensor
    TELayer-->>HF: returns (tensor,)
    Note over HF: Extracts with layer_outputs[0]

greptile-apps · 2026-01-09T01:05:24Z

docs/examples/te_llama/te_llama.py

+        # Handle case where hidden_states might be a tuple (from previous layer output)
+        # This can happen with older versions of HuggingFace transformers
+        if isinstance(hidden_states, tuple):
+            hidden_states = hidden_states[0]


Comment is misleading about when tuple unpacking is needed.

With the new code that returns tensor directly, hidden_states should never be a tuple when called from HuggingFace's LlamaModel forward loop (in any version). The old code returned (tensor,) for transformers < 4.57, but HF's loop extracted it with layer_outputs[0] before passing to the next layer.

This check appears to be defensive programming rather than addressing a real backward compatibility scenario. Consider clarifying the comment to explain this is a safety check rather than expected behavior.

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}

sbhavani force-pushed the fix/te-llama-hf-transformers-457-compat branch from 2ca056c to a65fa49 Compare January 7, 2026 16:46

greptile-apps bot reviewed Jan 7, 2026

View reviewed changes

sudhakarsingh27 self-requested a review January 7, 2026 22:35

sudhakarsingh27 approved these changes Jan 7, 2026

View reviewed changes

sudhakarsingh27 requested changes Jan 7, 2026

View reviewed changes

sudhakarsingh27 reviewed Jan 8, 2026

View reviewed changes

docs(te_llama): add requirements.txt

3105972

Signed-off-by: Santosh Bhavani <[email protected]>

greptile-apps bot reviewed Jan 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(examples): te_llama compatibility with transformers >= 4.57 #2572

fix(examples): te_llama compatibility with transformers >= 4.57 #2572

sbhavani commented Jan 7, 2026

Uh oh!

greptile-apps bot commented Jan 7, 2026

Uh oh!

greptile-apps bot left a comment •

edited

Loading

Uh oh!

sudhakarsingh27 commented Jan 7, 2026

Uh oh!

sudhakarsingh27 left a comment

Uh oh!

sbhavani commented Jan 8, 2026

Uh oh!

sudhakarsingh27 left a comment •

edited

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

greptile-apps bot Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix(examples): te_llama compatibility with transformers >= 4.57 #2572

Are you sure you want to change the base?

fix(examples): te_llama compatibility with transformers >= 4.57 #2572

Conversation

sbhavani commented Jan 7, 2026

Description

Type of change

Changes

Checklist:

Testing

Uh oh!

greptile-apps bot commented Jan 7, 2026

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Additional Comments (1)

Uh oh!

sudhakarsingh27 commented Jan 7, 2026

Uh oh!

sudhakarsingh27 left a comment

Choose a reason for hiding this comment

Uh oh!

sbhavani commented Jan 8, 2026

Uh oh!

sudhakarsingh27 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Overview

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

greptile-apps bot left a comment •

edited

Loading

sudhakarsingh27 left a comment •

edited

Loading