Breaking Changes:
- Upgraded to Stable-Baselines3 >= 2.4.0
New Features:
- Added
CrossQ
algorithm, from "Batch Normalization in Deep Reinforcement Learning" paper (@danielpalen) - Added
BatchRenorm
PyTorch layer used inCrossQ
(@danielpalen) - Added support for Gymnasium v1.0
Bug Fixes:
- Updated QR-DQN optimizer input to only include quantile_net parameters (@corentinlger)
- Updated QR-DQN paper link in docs (@corentinlger)
- Fixed a warning with PyTorch 2.4 when loading a
RecurrentPPO
model (You are using torch.load with weights_only=False) - Fixed loading QRDQN changes
target_update_interval
(@jak3122)
Others:
- Updated PyTorch version on CI to 2.3.1
- Remove unnecessary SDE noise resampling in PPO/TRPO update
- Switched to uv to download packages on GitHub CI
New Contributors
- @corentinlger made their first contribution in #252
- @jak3122 made their first contribution in #259
- @danielpalen made their first contribution in #243
Full Changelog: v2.3.0...v2.4.0