instadeepai/Mava
๐ฆ A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
5 Releases
Latest: 3y ago
0.1.3LatestPre-release
๐ฆ Systems
- Updates to acme, reverb and tensorflow.
- Working centralised and state based architectures.
- Recurrent and Multiple Trainer PPO.
๐ฆ Environments
- -
๐ What's Changed
- Bugfix/ Release aren't triggering pypi push job. by @KaleabTessera in https://github.com/instadeepai/Mava/pull/466
- Feature / Release 0.1.2 v2 by @KaleabTessera in https://github.com/instadeepai/Mava/pull/467
- fix: Update black version. by @DriesSmit in https://github.com/instadeepai/Mava/pull/470
- Bugfix/ Update PZ Version and new jax dockerfiles by @KaleabTessera in https://github.com/instadeepai/Mava/pull/480
- Feature/recurrent and multiple trainer MAPPO by @DriesSmit in https://github.com/instadeepai/Mava/pull/326
- Feat/maddpg obs optim by @AsadJeewa in https://github.com/instadeepai/Mava/pull/459
- feat: Add fixed sampler capability + bugfixes by @DriesSmit in https://github.com/instadeepai/Mava/pull/475
- Feature/fix sampler madqn by @EdanToledo in https://github.com/instadeepai/Mava/pull/477
- + 4 more
0.1.2Pre-release
๐ฆ Systems
- Fixed observation network bug in mappo + changed implementation to use two optims.
- Fixes in maddpg/mad4pg loss calculation.
- Began on jax system implementations.
๐ฆ Environments
- -
๐ What's Changed
- Fix/add loss mask to ppo by @EdanToledo in https://github.com/instadeepai/Mava/pull/441
- Mainetenance: Fix tf examples issues by @AsadJeewa in https://github.com/instadeepai/Mava/pull/444
- fix: shared weights with agent type by @AsadJeewa in https://github.com/instadeepai/Mava/pull/428
- Fix broken readme links and neaten up formatting by @AsadJeewa in https://github.com/instadeepai/Mava/pull/446
- Feature/jax abstract builder class by @arnupretorius in https://github.com/instadeepai/Mava/pull/433
- docs: updated docs to better represent available options by @sash-a in https://github.com/instadeepai/Mava/pull/448
- Feature/jax general system class by @arnupretorius in https://github.com/instadeepai/Mava/pull/425
- Bugfix/Mypy Inconsistency Issue by @KaleabTessera in https://github.com/instadeepai/Mava/pull/458
- + 7 more
0.1.1Pre-release
๐ฆ Systems
- Stable versions of all systems - noteably stable `mappo`, `vdn` and `qmix`.
- Multiple trainer implementations for `maddpg` and `mad4pg`.
- Removed the `dial` system.
๐ฆ Environments/ Environment Wrappers
- Added [Melting Pot](https://github.com/deepmind/meltingpot) support.
๐ What's Changed
- Feature/Enforce docstring code coverage. by @KaleabTessera in https://github.com/instadeepai/Mava/pull/271
- Chore/Resized gifs in readme. by @KaleabTessera in https://github.com/instadeepai/Mava/pull/272
- Feature/Improve Mava agent networks by @DriesSmit in https://github.com/instadeepai/Mava/pull/258
- Feature/upgrade acme version and use new adders by @KaleabTessera in https://github.com/instadeepai/Mava/pull/274
- Chore/Updated makefile and readme for Windows. by @Nashlen in https://github.com/instadeepai/Mava/pull/273
- Fix/supersuit version by @KaleabTessera in https://github.com/instadeepai/Mava/pull/277
- Chore/ Update quickstart by @KaleabTessera in https://github.com/instadeepai/Mava/pull/278
- Feature/New acme adders and tests. by @KaleabTessera in https://github.com/instadeepai/Mava/pull/276
- + 62 more
โจ New Contributors
- @Nashlen made their first contribution in https://github.com/instadeepai/Mava/pull/273
- @mnguyen0226 made their first contribution in https://github.com/instadeepai/Mava/pull/286
- @mmorris44 made their first contribution in https://github.com/instadeepai/Mava/pull/319
- @AsadJeewa made their first contribution in https://github.com/instadeepai/Mava/pull/320
- @RuanJohn made their first contribution in https://github.com/instadeepai/Mava/pull/341
- @sash-a made their first contribution in https://github.com/instadeepai/Mava/pull/370
- @EdanToledo made their first contribution in https://github.com/instadeepai/Mava/pull/353
- Full Changelog: https://github.com/instadeepai/Mava/compare/0.1.0...0.1.1
Mava-0.1.00.1.0Pre-release
๐ฆ Mava Core
- Components
- Architectures
- Added Centralised, Decentralised, Networked and State Based Architectures.
- Modules
- Added Broadcast Communication, Epsilon Decay Scheduling, Additive and Monotonic Mixing and Fingerprint Stabilization.
- Networks
- Added Additive and Monotic Mixing Networks, Hypernetworks, Communication Networks, Epsilon Greedy and DiscreteValued head.
- Environment Loops
- + 3 more
๐ฆ Systems
- Added feedforward training for maddpg, mad4pg, madqn, mappo, vdn and qmix.
- Added recurrent training for madqn, dial, maddpg and mad4pg.
- Added continuous network heads for maddpg, mad4pg and mappo.
- Added decentralised architecture training for maddpg, mad4pg, madqn, mappo, dial, vdn and qmix.
- Added centralised architecture training for maddpg, mad4pg and mappo.
- Added state based architecture training for maddpg and mad4pg.
- Added networked architecture training for maddpg.
๐ฆ Environments/ Environment Wrappers
- Added PettingZoo, SMAC, RoboCup, OpenSpiel, Flatland, Debug Simple Spread, Debug Switch environment and Debug Two-Step game.
๐ฆ Examples
- Added quickstart notebook.
- Added basic examples for sample systems and environments.
- -
MAVA Pre-release0.0.9Pre-release
MAVA Pre-release test before official `0.1.0` release.