OpenSpiel 1.2

lanctot released this 21 Aug 23:05

· 1105 commits to master since this release

This release adds several games and algorithms, improvements, bug fixes, and documentation updates.

Support and Process changes

Upgrade support for newer versions of dependencies
Add dependency to pybind11_abseil

Games

2048
Checkers
Dynamic routing game
Euchre
Mancala
Nim
Phantom Go

Algorithms

Asymmetric Q-learning
Magnetic Mirror Descent (MMD)
NeuRD (PyTorch)
Policy gradients (JAX)
Sample-based NeuRD loss (PyTorch)
Stackelberg solver
WoLF-PHC

Improvements and other additions

Blackjack: add observation tensor
C++ DQN: in-memory target net, saving + loading of model
Core API reference
Remove hard-coded inclusion of Hanabi and ACPC in setup.py

Fixes

Colored Trails: fix max utility
MCTS handling of chance nodes: properly handle them not just at the root
Nash averaging optimization fix
Othello: fix the max game length
Policy aggregator, surface copy -> deep copy
pybind11: change game references to shared pointers

Several other miscellaneous fixes and improvements.

Acknowledgments

Thanks to DeepMind for continued support of development and maintenance of OpenSpiel.

Thanks to all of our contributors:

Core Team: https://github.com/deepmind/open_spiel/blob/master/docs/authors.md
All Contributors: https://github.com/deepmind/open_spiel/graphs/contributors

Assets 2