Skip to content

SB3-Contrib v2.4.0: New algorithm (CrossQ), Gymnasium v1.0 support

Latest
Compare
Choose a tag to compare
@araffin araffin released this 18 Nov 10:33
· 2 commits to master since this release
d5ac968

Breaking Changes:

  • Upgraded to Stable-Baselines3 >= 2.4.0

New Features:

  • Added CrossQ algorithm, from "Batch Normalization in Deep Reinforcement Learning" paper (@danielpalen)
  • Added BatchRenorm PyTorch layer used in CrossQ (@danielpalen)
  • Added support for Gymnasium v1.0

Bug Fixes:

  • Updated QR-DQN optimizer input to only include quantile_net parameters (@corentinlger)
  • Updated QR-DQN paper link in docs (@corentinlger)
  • Fixed a warning with PyTorch 2.4 when loading a RecurrentPPO model (You are using torch.load with weights_only=False)
  • Fixed loading QRDQN changes target_update_interval (@jak3122)

Others:

  • Updated PyTorch version on CI to 2.3.1
  • Remove unnecessary SDE noise resampling in PPO/TRPO update
  • Switched to uv to download packages on GitHub CI

New Contributors

Full Changelog: v2.3.0...v2.4.0