Skip to content

Commit e22d18c

Browse files
authored
Merge pull request #297 from kozistr/update/codes
[Docs] document stuff
2 parents 6610d2f + 9c56620 commit e22d18c

14 files changed

+60
-0
lines changed

README.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -355,6 +355,10 @@ With the default bias-correction, Adam may actually make larger than requested g
355355

356356
Correcting the norm of a gradient in each iteration based on the adaptive training history of gradient norm.
357357

358+
### Cautious optimizer
359+
360+
Updates only occur when the proposed update direction aligns with the current gradient.
361+
358362
## Frequently asked questions
359363

360364
[here](docs/qa.md)

docs/changelogs/v3.3.0.md

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -30,3 +30,15 @@
3030
* `pytorch_optimizer.reduce_max_except_dim` -> `pytorch_optimizer.optimizers.sm3.reduce_max_except_dim`.
3131
* `pytorch_optimizer.neuron_norm` -> `pytorch_optimizer.optimizers.nero.neuron_norm`.
3232
* `pytorch_optimizer.neuron_mean` -> `pytorch_optimizer.optimizers.nero.neuron_mean`.
33+
34+
### Docs
35+
36+
* Add more visualizations. (#297)
37+
38+
### Bug
39+
40+
* Add optimizer parameter to `PolyScheduler` constructor. (#295)
41+
42+
### Contributions
43+
44+
thanks to @tanganke

docs/index.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -355,6 +355,10 @@ With the default bias-correction, Adam may actually make larger than requested g
355355

356356
Correcting the norm of a gradient in each iteration based on the adaptive training history of gradient norm.
357357

358+
### Cautious optimizer
359+
360+
Updates only occur when the proposed update direction aligns with the current gradient.
361+
358362
## Frequently asked questions
359363

360364
[here](docs/qa.md)

docs/visualization.md

Lines changed: 40 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -42,6 +42,10 @@
4242

4343
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rastrigin_AdaMax.png)
4444

45+
### AdamG
46+
47+
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rastrigin_AdamG.png)
48+
4549
### AdaMod
4650

4751
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rastrigin_AdaMod.png)
@@ -74,6 +78,14 @@
7478

7579
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rastrigin_AdaSmooth.png)
7680

81+
### AdEMAMix
82+
83+
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rastrigin_AdEMAMix.png)
84+
85+
### ADOPT
86+
87+
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rastrigin_ADOPT.png)
88+
7789
### AggMo
7890

7991
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rastrigin_AggMo.png)
@@ -134,6 +146,10 @@
134146

135147
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rastrigin_Fromage.png)
136148

149+
### FTRL
150+
151+
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rastrigin_FTRL.png)
152+
137153
### GaLore
138154

139155
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rastrigin_GaLore.png)
@@ -250,6 +266,10 @@
250266

251267
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rastrigin_SM3.png)
252268

269+
### SOAP
270+
271+
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rastrigin_SOAP.png)
272+
253273
### SophiaH
254274

255275
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rastrigin_SophiaH.png)
@@ -316,6 +336,10 @@
316336

317337
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rosenbrock_AdaMax.png)
318338

339+
### AdamG
340+
341+
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rosenbrock_AdamG.png)
342+
319343
### AdaMod
320344

321345
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rosenbrock_AdaMod.png)
@@ -348,6 +372,14 @@
348372

349373
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rosenbrock_AdaSmooth.png)
350374

375+
### AdEMAMix
376+
377+
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rosenbrock_AdEMAMix.png)
378+
379+
### ADOPT
380+
381+
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rosenbrock_ADOPT.png)
382+
351383
### AggMo
352384

353385
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rosenbrock_AggMo.png)
@@ -408,6 +440,10 @@
408440

409441
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rosenbrock_Fromage.png)
410442

443+
### FTRL
444+
445+
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rosenbrock_FTRL.png)
446+
411447
### GaLore
412448

413449
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rosenbrock_GaLore.png)
@@ -524,6 +560,10 @@
524560

525561
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rosenbrock_SM3.png)
526562

563+
### SOAP
564+
565+
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rosenbrock_SOAP.png)
566+
527567
### SophiaH
528568

529569
![image](https://raw.githubusercontent.com/kozistr/pytorch_optimizer/main/docs/visualizations/rosenbrock_SophiaH.png)
719 KB
Loading
724 KB
Loading
82.1 KB
Loading
719 KB
Loading
364 KB
Loading
368 KB
Loading

0 commit comments

Comments
 (0)