-
Notifications
You must be signed in to change notification settings - Fork 25
Description
Hi, sorry if feature requests are not accepted (close this if so), but I was wondering if it would be possible to upgrade this repo from gym to gymnasium?
Gymnasium a maintained fork of openai gym and is designed as a drop-in replacement (import gym -> import gymnasium as gym). Beyond just bugfixes, many RL training libraries have also switched (rllib, tianshou, CleanRL), or are planning to switch ((stable-baselines3](https://github.com/DLR-RM/stable-baselines3/blob/e5deeed16efb57c34ccdcb14692439154d970527/docs/guide/install.rst#bleeding-edge-version)). It would be great if users could train agents on Griddly using the latest models and features from these libraries (e.g., scalable distributed training/model serving using Ray/RLlib).
For information about upgrading and compatibility, see migration guide and gym compatibility. The main difference is the API has switched to returning truncated and terminated, rather than done, in order to give more information and mitigate edge case issues (for example, many popular tutorials/implementations of Q learning using gym were actually incorrect because of done, there will be an upcoming blog post explaining more details about this on the Farama site (https://farama.org/blog).
We would love to add Griddly to our list of third party environments, but are only including repositories which are compatible with gymnasium.
We have discussed making a Shimmy wrapper and that is a possibility, but it would be best if it were done here so that more people could see it. Gymnasium doesn't directly support multi-agent environments, so it would be most ideal to make a wrapper to convert that setting into a PettingZoo environment instead. I'm personally not familiar with the Griddly codebase so it's hard for me to tell how difficult it would be to upgrade, just making this to inquire about if there are any plans or if you would be open to accepting PRs or helping with any questions we have if we do end up making a shimmy wrapper, for example.