Use float32 for LoRA weights to avoid the risk of underflow and overflow. by james77777778 · Pull Request #22559 · keras-team/keras

james77777778 · 2026-03-27T02:48:20Z

As reported in keras-team/keras-hub#2629

We should use high precision (float32) for LoRA weights to stabilize the finetuning.

References:

https://huggingface.co/docs/peft/developer_guides/troubleshooting (autocast_adapter_dtype)
Adapters saved in float16 are loaded in float32 huggingface/peft#2421
https://github.com/huggingface/peft/blob/c75485a2144a15a526e0290835a7439daefc3925/src/peft/tuners/lora/layer.py#L966-L968 (huggingface/peft impl)

gemini-code-assist

Code Review

This pull request updates the LoRA implementation across several layers—including Convolutional, Dense, EinsumDense, and Embedding—to ensure that LoRA weights are initialized as float32 to prevent numerical instability. It also introduces explicit casting to the appropriate variable or compute dtypes during kernel composition and forward passes. A critical issue was identified in the EinsumDense layer where a trailing comma incorrectly converts the LoRA update into a tuple, which will cause a TypeError during tensor operations.

keras/src/layers/core/einsum_dense.py

codecov-commenter · 2026-03-27T02:57:37Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 83.27%. Comparing base (ebb7e78) to head (28a8fe9).
⚠️ Report is 1 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #22559      +/-   ##
==========================================
+ Coverage   77.33%   83.27%   +5.94%     
==========================================
  Files         596      596              
  Lines       67828    67835       +7     
  Branches    10562    10562              
==========================================
+ Hits        52452    56487    +4035     
+ Misses      12612     8605    -4007     
+ Partials     2764     2743      -21

Flag	Coverage Δ
keras	`83.09% <100.00%> (+5.89%)`	⬆️
keras-jax	`59.82% <100.00%> (+<0.01%)`	⬆️
keras-numpy	`54.43% <46.15%> (-0.01%)`	⬇️
keras-openvino	`51.70% <46.15%> (-0.01%)`	⬇️
keras-tensorflow	`61.14% <100.00%> (?)`
keras-torch	`60.00% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…low.

google-ml-butler bot added the size:M label Mar 27, 2026

google-ml-butler bot assigned gbaned Mar 27, 2026

gemini-code-assist bot reviewed Mar 27, 2026

View reviewed changes

keras/src/layers/core/einsum_dense.py Outdated Show resolved Hide resolved

james77777778 mentioned this pull request Mar 27, 2026

[Bug] Adam optimizer produces near-zero effective updates when training LoRA weights in bfloat16 with ModelParallel on TPU keras-team/keras-hub#2629

Open

james77777778 added 2 commits March 27, 2026 13:43

Use float32 for LoRA weights to avoid the risk of underflow and overf…

3ef1b40

…low.

Resolve gemini's comment.

8b7ce6c

james77777778 force-pushed the use-float32-for-lora-weights branch from a9a06aa to 7722450 Compare March 27, 2026 06:09

Fix autocasting issues.

28a8fe9

james77777778 force-pushed the use-float32-for-lora-weights branch from 7722450 to 28a8fe9 Compare March 27, 2026 06:20

keerthanakadiri added the stat:awaiting keras-eng Awaiting response from Keras engineer label Mar 27, 2026

hertschuh added the keras-team-review-pending Pending review by a Keras team member. label Mar 31, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use float32 for LoRA weights to avoid the risk of underflow and overflow.#22559

Use float32 for LoRA weights to avoid the risk of underflow and overflow.#22559
james77777778 wants to merge 3 commits intokeras-team:masterfrom
james77777778:use-float32-for-lora-weights

james77777778 commented Mar 27, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

codecov-commenter commented Mar 27, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

james77777778 commented Mar 27, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

codecov-commenter commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

codecov-commenter commented Mar 27, 2026 •

edited

Loading