Skip to content

Commit 9326ff2

Browse files
committed
Release 1.0.25
1 parent 75684fa commit 9326ff2

File tree

2 files changed

+9
-1
lines changed

2 files changed

+9
-1
lines changed

README.md

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,14 @@
1212

1313
## What's New
1414

15+
## Feb 23, 2026
16+
* Add token distillation training support to distillation task wrappers
17+
* Remove some torch.jit usage in prep for official deprecation
18+
* Caution added to AdamP optimizer
19+
* Call reset_parameters() even if meta-device init so that buffers get init w/ hacks like init_empty_weights
20+
* Tweak Muon optimizer to work with DTensor/FSDP2 (clamp_ instead of clamp_min_, alternate NS branch for DTensor)
21+
* Release 1.0.25
22+
1523
## Jan 21, 2026
1624
* **Compat Break**: Fix oversight w/ QKV vs MLP bias in `ParallelScalingBlock` (& `DiffParallelScalingBlock`)
1725
* Does not impact any trained `timm` models but could impact downstream use.

timm/version.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
__version__ = '1.0.25.dev0'
1+
__version__ = '1.0.25'

0 commit comments

Comments
 (0)