|
3 | 3 | Fast and simple implementation of RL algorithms, designed to run fully on GPU. |
4 | 4 | This code is an evolution of `rl-pytorch` provided with NVIDIA's Isaac GYM. |
5 | 5 |
|
6 | | -| The `algorithms` branch supports additional algorithms (SAC, DDPG, DSAC, and more)! | |
7 | | -| ----------------------------------------------------------------------------------- | |
| 6 | +The main branch supports PPO with additional features from our work. |
| 7 | +These include: |
8 | 8 |
|
9 | | -The main branch only supports PPO for now. |
10 | | -Contributions are welcome. |
| 9 | +* [Random Network Distillation (RND)](https://proceedings.mlr.press/v229/schwarke23a.html) |
| 10 | +* [Symmetry-based Augmentation](https://arxiv.org/abs/2403.04359) |
11 | 11 |
|
12 | 12 | **Maintainer**: Mayank Mittal and Clemens Schwarke <br/> |
13 | 13 | **Affiliation**: Robotic Systems Lab, ETH Zurich & NVIDIA <br/> |
14 | 14 | |
15 | 15 |
|
| 16 | +Environment repositories using the framework: |
| 17 | + |
| 18 | +* `Isaac Lab` (built on top of NVIDIA Isaac Sim): https://github.com/isaac-sim/IsaacLab |
| 19 | +* `Legged-Gym` (built on top of NVIDIA Isaac Gym): https://leggedrobotics.github.io/legged_gym/ |
| 20 | + |
| 21 | +We welcome contributions on the community. Please check our contribution guidelines for more |
| 22 | +information. |
| 23 | + |
| 24 | +> **Note:** The `algorithms` branch supports additional algorithms (SAC, DDPG, DSAC, and more). However, it isn't currently actively maintained. |
| 25 | +
|
| 26 | + |
16 | 27 | ## Setup |
17 | 28 |
|
18 | 29 | The package can be installed via PyPI with: |
@@ -50,17 +61,57 @@ We use the following tools for maintaining code quality: |
50 | 61 |
|
51 | 62 | Please check [here](https://pre-commit.com/#install) for instructions to set these up. To run over the entire repository, please execute the following command in the terminal: |
52 | 63 |
|
53 | | - |
54 | 64 | ```bash |
55 | 65 | # for installation (only once) |
56 | 66 | pre-commit install |
57 | 67 | # for running |
58 | 68 | pre-commit run --all-files |
59 | 69 | ``` |
60 | 70 |
|
61 | | -## Useful Links |
| 71 | +## Citing |
| 72 | + |
| 73 | +**We are working on writing a white paper for this library.** Until then, please cite the following work |
| 74 | +if you use this library for your research: |
| 75 | + |
| 76 | +```text |
| 77 | +@InProceedings{rudin2022learning, |
| 78 | + title = {Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning}, |
| 79 | + author = {Rudin, Nikita and Hoeller, David and Reist, Philipp and Hutter, Marco}, |
| 80 | + booktitle = {Proceedings of the 5th Conference on Robot Learning}, |
| 81 | + pages = {91--100}, |
| 82 | + year = {2022}, |
| 83 | + volume = {164}, |
| 84 | + series = {Proceedings of Machine Learning Research}, |
| 85 | + publisher = {PMLR}, |
| 86 | + url = {https://proceedings.mlr.press/v164/rudin22a.html}, |
| 87 | +} |
| 88 | +``` |
62 | 89 |
|
63 | | -Environment repositories using the framework: |
| 90 | +If you use the library with curiosity-driven exploration (random network distillation), please cite: |
| 91 | + |
| 92 | +```text |
| 93 | +@InProceedings{schwarke2023curiosity, |
| 94 | + title = {Curiosity-Driven Learning of Joint Locomotion and Manipulation Tasks}, |
| 95 | + author = {Schwarke, Clemens and Klemm, Victor and Boon, Matthijs van der and Bjelonic, Marko and Hutter, Marco}, |
| 96 | + booktitle = {Proceedings of The 7th Conference on Robot Learning}, |
| 97 | + pages = {2594--2610}, |
| 98 | + year = {2023}, |
| 99 | + volume = {229}, |
| 100 | + series = {Proceedings of Machine Learning Research}, |
| 101 | + publisher = {PMLR}, |
| 102 | + url = {https://proceedings.mlr.press/v229/schwarke23a.html}, |
| 103 | +} |
| 104 | +``` |
64 | 105 |
|
65 | | -* `Isaac Lab` (built on top of NVIDIA Isaac Sim): https://github.com/isaac-sim/IsaacLab |
66 | | -* `Legged-Gym` (built on top of NVIDIA Isaac Gym): https://leggedrobotics.github.io/legged_gym/ |
| 106 | +If you use the library with symmetry augmentation, please cite: |
| 107 | + |
| 108 | +```text |
| 109 | +@InProceedings{mittal2024symmetry, |
| 110 | + author={Mittal, Mayank and Rudin, Nikita and Klemm, Victor and Allshire, Arthur and Hutter, Marco}, |
| 111 | + booktitle={2024 IEEE International Conference on Robotics and Automation (ICRA)}, |
| 112 | + title={Symmetry Considerations for Learning Task Symmetric Robot Policies}, |
| 113 | + year={2024}, |
| 114 | + pages={7433-7439}, |
| 115 | + doi={10.1109/ICRA57147.2024.10611493} |
| 116 | +} |
| 117 | +``` |
0 commit comments