Gitpedia
instadeepai

instadeepai/Mava

๐Ÿฆ A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

5 Releases
Latest: 3y ago
0.1.3LatestPre-release
KaleabTesseraKaleabTesseraยท3y agoยทJune 15, 2022
GitHub

๐Ÿ“ฆ Systems

  • Updates to acme, reverb and tensorflow.
  • Working centralised and state based architectures.
  • Recurrent and Multiple Trainer PPO.

๐Ÿ“ฆ Environments

  • -

๐Ÿ“‹ What's Changed

  • Bugfix/ Release aren't triggering pypi push job. by @KaleabTessera in https://github.com/instadeepai/Mava/pull/466
  • Feature / Release 0.1.2 v2 by @KaleabTessera in https://github.com/instadeepai/Mava/pull/467
  • fix: Update black version. by @DriesSmit in https://github.com/instadeepai/Mava/pull/470
  • Bugfix/ Update PZ Version and new jax dockerfiles by @KaleabTessera in https://github.com/instadeepai/Mava/pull/480
  • Feature/recurrent and multiple trainer MAPPO by @DriesSmit in https://github.com/instadeepai/Mava/pull/326
  • Feat/maddpg obs optim by @AsadJeewa in https://github.com/instadeepai/Mava/pull/459
  • feat: Add fixed sampler capability + bugfixes by @DriesSmit in https://github.com/instadeepai/Mava/pull/475
  • Feature/fix sampler madqn by @EdanToledo in https://github.com/instadeepai/Mava/pull/477
  • + 4 more
0.1.2Pre-release
KaleabTesseraKaleabTesseraยท4y agoยทMarch 28, 2022
GitHub

๐Ÿ“ฆ Systems

  • Fixed observation network bug in mappo + changed implementation to use two optims.
  • Fixes in maddpg/mad4pg loss calculation.
  • Began on jax system implementations.

๐Ÿ“ฆ Environments

  • -

๐Ÿ“‹ What's Changed

  • Fix/add loss mask to ppo by @EdanToledo in https://github.com/instadeepai/Mava/pull/441
  • Mainetenance: Fix tf examples issues by @AsadJeewa in https://github.com/instadeepai/Mava/pull/444
  • fix: shared weights with agent type by @AsadJeewa in https://github.com/instadeepai/Mava/pull/428
  • Fix broken readme links and neaten up formatting by @AsadJeewa in https://github.com/instadeepai/Mava/pull/446
  • Feature/jax abstract builder class by @arnupretorius in https://github.com/instadeepai/Mava/pull/433
  • docs: updated docs to better represent available options by @sash-a in https://github.com/instadeepai/Mava/pull/448
  • Feature/jax general system class by @arnupretorius in https://github.com/instadeepai/Mava/pull/425
  • Bugfix/Mypy Inconsistency Issue by @KaleabTessera in https://github.com/instadeepai/Mava/pull/458
  • + 7 more
0.1.1Pre-release
KaleabTesseraKaleabTesseraยท4y agoยทFebruary 25, 2022
GitHub

๐Ÿ“ฆ Systems

  • Stable versions of all systems - noteably stable `mappo`, `vdn` and `qmix`.
  • Multiple trainer implementations for `maddpg` and `mad4pg`.
  • Removed the `dial` system.

๐Ÿ“ฆ Environments/ Environment Wrappers

  • Added [Melting Pot](https://github.com/deepmind/meltingpot) support.

๐Ÿ“‹ What's Changed

  • Feature/Enforce docstring code coverage. by @KaleabTessera in https://github.com/instadeepai/Mava/pull/271
  • Chore/Resized gifs in readme. by @KaleabTessera in https://github.com/instadeepai/Mava/pull/272
  • Feature/Improve Mava agent networks by @DriesSmit in https://github.com/instadeepai/Mava/pull/258
  • Feature/upgrade acme version and use new adders by @KaleabTessera in https://github.com/instadeepai/Mava/pull/274
  • Chore/Updated makefile and readme for Windows. by @Nashlen in https://github.com/instadeepai/Mava/pull/273
  • Fix/supersuit version by @KaleabTessera in https://github.com/instadeepai/Mava/pull/277
  • Chore/ Update quickstart by @KaleabTessera in https://github.com/instadeepai/Mava/pull/278
  • Feature/New acme adders and tests. by @KaleabTessera in https://github.com/instadeepai/Mava/pull/276
  • + 62 more

โœจ New Contributors

  • @Nashlen made their first contribution in https://github.com/instadeepai/Mava/pull/273
  • @mnguyen0226 made their first contribution in https://github.com/instadeepai/Mava/pull/286
  • @mmorris44 made their first contribution in https://github.com/instadeepai/Mava/pull/319
  • @AsadJeewa made their first contribution in https://github.com/instadeepai/Mava/pull/320
  • @RuanJohn made their first contribution in https://github.com/instadeepai/Mava/pull/341
  • @sash-a made their first contribution in https://github.com/instadeepai/Mava/pull/370
  • @EdanToledo made their first contribution in https://github.com/instadeepai/Mava/pull/353
  • Full Changelog: https://github.com/instadeepai/Mava/compare/0.1.0...0.1.1
Mava-0.1.00.1.0Pre-release
KaleabTesseraKaleabTesseraยท4y agoยทJuly 6, 2021
GitHub

๐Ÿ“ฆ Mava Core

  • Components
  • Architectures
  • Added Centralised, Decentralised, Networked and State Based Architectures.
  • Modules
  • Added Broadcast Communication, Epsilon Decay Scheduling, Additive and Monotonic Mixing and Fingerprint Stabilization.
  • Networks
  • Added Additive and Monotic Mixing Networks, Hypernetworks, Communication Networks, Epsilon Greedy and DiscreteValued head.
  • Environment Loops
  • + 3 more

๐Ÿ“ฆ Systems

  • Added feedforward training for maddpg, mad4pg, madqn, mappo, vdn and qmix.
  • Added recurrent training for madqn, dial, maddpg and mad4pg.
  • Added continuous network heads for maddpg, mad4pg and mappo.
  • Added decentralised architecture training for maddpg, mad4pg, madqn, mappo, dial, vdn and qmix.
  • Added centralised architecture training for maddpg, mad4pg and mappo.
  • Added state based architecture training for maddpg and mad4pg.
  • Added networked architecture training for maddpg.

๐Ÿ“ฆ Environments/ Environment Wrappers

  • Added PettingZoo, SMAC, RoboCup, OpenSpiel, Flatland, Debug Simple Spread, Debug Switch environment and Debug Two-Step game.

๐Ÿ“ฆ Examples

  • Added quickstart notebook.
  • Added basic examples for sample systems and environments.
  • -
MAVA Pre-release0.0.9Pre-release
KaleabTesseraKaleabTesseraยท4y agoยทJune 9, 2021
GitHub

MAVA Pre-release test before official `0.1.0` release.