SB3-Contrib v2.4.0: New algorithm (CrossQ), Gymnasium v1.0 support

Latest

Latest

araffin released this 18 Nov 10:33

· 2 commits to master since this release

Breaking Changes:

Upgraded to Stable-Baselines3 >= 2.4.0

New Features:

Added CrossQ algorithm, from "Batch Normalization in Deep Reinforcement Learning" paper (@danielpalen)
Added BatchRenorm PyTorch layer used in CrossQ (@danielpalen)
Added support for Gymnasium v1.0

Bug Fixes:

Updated QR-DQN optimizer input to only include quantile_net parameters (@corentinlger)
Updated QR-DQN paper link in docs (@corentinlger)
Fixed a warning with PyTorch 2.4 when loading a RecurrentPPO model (You are using torch.load with weights_only=False)
Fixed loading QRDQN changes target_update_interval (@jak3122)

Others:

Updated PyTorch version on CI to 2.3.1
Remove unnecessary SDE noise resampling in PPO/TRPO update
Switched to uv to download packages on GitHub CI

New Contributors

@corentinlger made their first contribution in #252
@jak3122 made their first contribution in #259
@danielpalen made their first contribution in #243

Full Changelog: v2.3.0...v2.4.0

Contributors

jak3122, danielpalen, and corentinlger

Assets 2