Skip to content

Commit a25f02c

Browse files
authored
Merge pull request #196 from BDonnot/bd_dev
Fix notebook on RL
2 parents 9e21965 + 74563f0 commit a25f02c

File tree

5 files changed

+1066
-558
lines changed

5 files changed

+1066
-558
lines changed

.gitignore

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -412,7 +412,7 @@ getting_started/env_py38_grid2op110_ray110.ipynb
412412
getting_started/env_py38_grid2op110_ray210.ipynb
413413
grid2op/tests/req_chronix2grid
414414
grid2op/tests/venv_test_chronix2grid/
415-
415+
getting_started/venv_310_ray/
416416

417417
# profiling files
418418
**.prof

CHANGELOG.rst

Lines changed: 16 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -31,15 +31,28 @@ Change Log
3131
- [???] "asynch" multienv
3232
- [???] properly model interconnecting powerlines
3333

34-
34+
Work kind of in progress
35+
----------------------------------
3536
- TODO A number of max buses per sub
3637
- TODO in the runner, save multiple times the same sceanrio
38+
- TODO in the gym env, make the action_space and observation_space attribute
39+
filled automatically (see ray integration, it's boring to have to copy paste...)
40+
41+
Next release
42+
---------------------------------
43+
- TODO Notebook for tf_agents
44+
- TODO Notebook for acme
45+
- TODO Notebook using "keras rl" (see https://keras.io/examples/rl/ppo_cartpole/)
46+
- TODO put the Grid2opEnvWrapper directly in grid2op as GymEnv
47+
- TODO example for MCTS https://github.com/bwfbowen/muax et https://github.com/google-deepmind/mctx
48+
- TODO jax everything that can be: create a simple env based on jax for topology manipulation, without
49+
redispatching or rules
50+
- TODO backend in jax, maybe ?
3751

3852

3953
[1.10.3] - 2024-xx-yy
4054
-------------------------
4155
- TODO Automatic "experimental_read_from_local_dir"
42-
- TODO Notebook for stable baselines
4356

4457
- [BREAKING] `env.chronics_hander.set_max_iter(xxx)` is now a private function. Use
4558
`env.set_max_iter(xxx)` or even better `env.reset(options={"max step": xxx})`.
@@ -59,7 +72,7 @@ Change Log
5972

6073
[1.10.2] - 2024-05-27
6174
-------------------------
62-
- [BREAKING] the `runner.run_one_episode` now returns an extra first argument:
75+
- [BREAKING] the `runner.run_one_episode` now returns an extra argument (first position):
6376
`chron_id, chron_name, cum_reward, timestep, max_ts = runner.run_one_episode()` which
6477
is consistant with `runner.run(...)` (previously it returned only
6578
`chron_name, cum_reward, timestep, max_ts = runner.run_one_episode()`)

0 commit comments

Comments
 (0)