You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
a new paper claims there is a free lunch by setting model weights to ema weights every epoch. allow researchers to experiment with this, conveniently already available in EMA-pytorch due to hare and tortoise paper
Copy file name to clipboardExpand all lines: README.md
+11Lines changed: 11 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -483,3 +483,14 @@ docker run -v .:/data --gpus all -it af3
483
483
journal = {bioRxiv}
484
484
}
485
485
```
486
+
487
+
```bibtex
488
+
@article{Li2024SwitchEA,
489
+
title = {Switch EMA: A Free Lunch for Better Flatness and Sharpness},
490
+
author = {Siyuan Li and Zicheng Liu and Juanxi Tian and Ge Wang and Zedong Wang and Weiyang Jin and Di Wu and Cheng Tan and Tao Lin and Yang Liu and Baigui Sun and Stan Z. Li},
0 commit comments