Skip to content

v2.7.0: Added support for n-step returns for off-policy algorithms

Latest
Compare
Choose a tag to compare
@araffin araffin released this 25 Jul 13:03
33889db

Breaking Changes

  • Upgraded to SB3 >= 2.7.0

New features

  • Add n-step returns support with n_steps parameter

Bug fixes

  • Use the FloatSchedule and LinearSchedule classes instead of lambdas in the ARS, PPO, and QRDQN implementations to improve model portability across different operating systems

New Contributors

Full Changelog: v2.6.0...v2.7.0