Skip to content

Commit a6136fd

Browse files
Update README.md
1 parent 42e3a73 commit a6136fd

File tree

1 file changed

+7
-1
lines changed

1 file changed

+7
-1
lines changed

README.md

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ Since Tensorflow 2.0 has already incorporated the dynamic graph construction ins
1717

1818
## Contents:
1919

20-
* Two versions of **Soft Actor-Critic (SAC)** are implemented.
20+
* Multiple versions of **Soft Actor-Critic (SAC)** are implemented.
2121

2222
**SAC Version 1**:
2323

@@ -30,6 +30,12 @@ Since Tensorflow 2.0 has already incorporated the dynamic graph construction ins
3030
`sac_v2.py`: using target Q-value function instead of state-value function.
3131

3232
paper: https://arxiv.org/pdf/1812.05905.pdf
33+
34+
**SAC Discrete**
35+
36+
`sac_discrete.py`: for discrete action space.
37+
38+
paper (the author is actually one of my classmates at IC): https://arxiv.org/abs/1910.07207
3339

3440
* **Deep Deterministic Policy Gradient (DDPG)**:
3541

0 commit comments

Comments
 (0)