Skip to content

Commit 96021d5

Browse files
authored
update readme for s-muzero
1 parent ba0b561 commit 96021d5

File tree

1 file changed

+14
-14
lines changed

1 file changed

+14
-14
lines changed

README.md

Lines changed: 14 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# Stochastic MuZero
22

3-
Pytorch Implementation of [Stochastic MuZero](https://openreview.net/pdf?id=X6D9bAHhBQ1). Base on [Muzero Unplugged](https://github.com/DHDev0/Muzero-unplugged).
3+
Pytorch Implementation of [Stochastic MuZero](https://openreview.net/pdf?id=X6D9bAHhBQ1). Base on [Muzero Unplugged](https://github.com/DHDev0/Stochastic-muzero).
44

55
It is suggested to refer to Stochastic MuZero as "unplugged," as setting the reanalyze_ratio to 0 is necessary to achieve Stochastic MuZero. This is because the original "Stochastic MuZero" paper highlights online reinforcement learning, however, as an enhancement to "MuZero Unplugged," it also encompasses offline reinforcement learning capabilities.
66

@@ -29,33 +29,33 @@ Getting started
2929
Local Installation
3030
------------------
3131

32-
PIP dependency : [requirement.txt](https://github.com/DHDev0/Muzero-unplugged/blob/main/requirements.txt)
32+
PIP dependency : [requirement.txt](https://github.com/DHDev0/Stochastic-muzero/blob/main/requirements.txt)
3333
~~~bash
34-
git clone https://github.com/DHDev0/Muzero-unplugged.git
34+
git clone https://github.com/DHDev0/Stochastic-muzero.git
3535

3636
cd Muzero-unplugged
3737

3838
pip install -r requirements.txt
3939
~~~
4040

41-
If you experience some difficulty refer to the first cell [Tutorial](https://github.com/DHDev0/Muzero-unplugged/blob/main/tutorial.ipynb) or use the dockerfile.
41+
If you experience some difficulty refer to the first cell [Tutorial](https://github.com/DHDev0/Stochastic-muzero/blob/main/tutorial.ipynb) or use the dockerfile.
4242

4343
Docker
4444
------
4545

4646
Build image: (building time: 22 min , memory consumption: 8.75 GB)
4747
~~~bash
48-
docker build -t muzero_unplugged .
48+
docker build -t stochastic_muzero .
4949
~~~
5050
(do not forget the ending dot)
5151

5252
Start container:
5353
~~~bash
54-
docker run --cpus 2 --gpus 1 -p 8888:8888 muzero_unplugged
54+
docker run --cpus 2 --gpus 1 -p 8888:8888 stochastic_muzero
5555
#or
56-
docker run --cpus 2 --gpus 1 --memory 2000M -p 8888:8888 muzero_unplugged
56+
docker run --cpus 2 --gpus 1 --memory 2000M -p 8888:8888 stochastic_muzero
5757
#or
58-
docker run --cpus 2 --gpus 1 --memory 2000M -p 8888:8888 --storage-opt size=15g muzero_unplugged
58+
docker run --cpus 2 --gpus 1 --memory 2000M -p 8888:8888 --storage-opt size=15g stochastic_muzero
5959
~~~
6060

6161
The docker run will start a jupyter lab on https://localhost:8888//lab?token=token (you need the token) with all the necessary dependency for cpu and gpu(Nvidia) compute.
@@ -69,12 +69,12 @@ Option meaning:
6969

7070
Stop the container:
7171
~~~bash
72-
docker stop $(docker ps -q --filter ancestor=muzero_unplugged)
72+
docker stop $(docker ps -q --filter ancestor=stochastic_muzero)
7373
~~~
7474

7575
Delete the container:
7676
~~~bash
77-
docker rmi -f muzero_unplugged
77+
docker rmi -f stochastic_muzero
7878
~~~
7979

8080
Dependency
@@ -91,7 +91,7 @@ Library :
9191
* matplotlib >=3.0
9292
* numpy 1.21.5
9393

94-
More details at: [requirement.txt](https://github.com/DHDev0/Muzero-unplugged/blob/main/requirements.txt)
94+
More details at: [requirement.txt](https://github.com/DHDev0/Stochastic-muzero/blob/main/requirements.txt)
9595

9696

9797
Usage
@@ -100,17 +100,17 @@ Usage
100100
Jupyter Notebook
101101
---------------
102102

103-
For practical example, you can use the [Tutorial](https://github.com/DHDev0/Muzero-unplugged/blob/main/tutorial.ipynb).
103+
For practical example, you can use the [Tutorial](https://github.com/DHDev0/Stochastic-muzero/blob/main/tutorial.ipynb).
104104

105105

106106
CLI
107107
-----------
108108

109-
Set your config file (example): https://github.com/DHDev0/Muzero-unplugged/blob/main/config/
109+
Set your config file (example): https://github.com/DHDev0/Stochastic-muzero/blob/main/config/
110110

111111
First and foremost cd to the project folder:
112112
~~~bash
113-
cd Muzero
113+
cd Stochastic-muzero
114114
~~~
115115

116116
Construct your dataset through experimentation.

0 commit comments

Comments
 (0)