Fix GPT2 DynamicCache handling for Transformers 4.55 compatibility by gplutop7 · Pull Request #2342 · huggingface/optimum-habana

gplutop7 · 2025-11-07T14:22:52Z

This PR updates GaudiGPT2LMHeadModel to ensure full compatibility with Transformers v4.55 cache behavior.

In Transformers 4.55, the past_key_values object may still be returned as a legacy tuple when use_cache=True, even though the internal implementation now relies on DynamicCache.

HuggingFaceDocBuilderDev · 2025-11-07T14:27:24Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

regisss

LGTM

…2342)

GPT2 DynamicCache handling for Transformers 4.55 - fix

560802d

gplutop7 changed the title ~~Fix GPT2 DynamicCache handling for Transformers 4.55 compatibilit~~ Fix GPT2 DynamicCache handling for Transformers 4.55 compatibility Nov 7, 2025

GPT2 DynamicCache handling for Transformers 4.55 - fix - part.2

7d6a294

regisss approved these changes Dec 2, 2025

View reviewed changes

regisss merged commit f87daf5 into huggingface:main Dec 2, 2025
3 of 5 checks passed

astachowiczhabana pushed a commit that referenced this pull request Dec 19, 2025

Fix GPT2 DynamicCache handling for Transformers 4.55 compatibility (#…

d8576f9

…2342)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix GPT2 DynamicCache handling for Transformers 4.55 compatibility#2342

Fix GPT2 DynamicCache handling for Transformers 4.55 compatibility#2342
regisss merged 2 commits intohuggingface:mainfrom
HabanaAI:main-GPT2LMHeadModel_forward_fix

gplutop7 commented Nov 7, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Nov 7, 2025

Uh oh!

regisss left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

gplutop7 commented Nov 7, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Nov 7, 2025

Uh oh!

regisss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants