instadeepai/Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

5 Releases

Latest: 3y ago

0.1.3LatestPre-release

KaleabTessera·3y ago·June 15, 2022

📦 Systems

📦 Environments

📋 What's Changed

Bugfix/ Release aren't triggering pypi push job. by @KaleabTessera in https://github.com/instadeepai/Mava/pull/466
Feature / Release 0.1.2 v2 by @KaleabTessera in https://github.com/instadeepai/Mava/pull/467
fix: Update black version. by @DriesSmit in https://github.com/instadeepai/Mava/pull/470
Bugfix/ Update PZ Version and new jax dockerfiles by @KaleabTessera in https://github.com/instadeepai/Mava/pull/480
Feature/recurrent and multiple trainer MAPPO by @DriesSmit in https://github.com/instadeepai/Mava/pull/326
Feat/maddpg obs optim by @AsadJeewa in https://github.com/instadeepai/Mava/pull/459
feat: Add fixed sampler capability + bugfixes by @DriesSmit in https://github.com/instadeepai/Mava/pull/475
Feature/fix sampler madqn by @EdanToledo in https://github.com/instadeepai/Mava/pull/477
+ 4 more

0.1.2Pre-release

KaleabTessera·4y ago·March 28, 2022

📦 Systems

Fixed observation network bug in mappo + changed implementation to use two optims.
Fixes in maddpg/mad4pg loss calculation.
Began on jax system implementations.

📦 Environments

📋 What's Changed

Fix/add loss mask to ppo by @EdanToledo in https://github.com/instadeepai/Mava/pull/441
Mainetenance: Fix tf examples issues by @AsadJeewa in https://github.com/instadeepai/Mava/pull/444
fix: shared weights with agent type by @AsadJeewa in https://github.com/instadeepai/Mava/pull/428
Fix broken readme links and neaten up formatting by @AsadJeewa in https://github.com/instadeepai/Mava/pull/446
Feature/jax abstract builder class by @arnupretorius in https://github.com/instadeepai/Mava/pull/433
docs: updated docs to better represent available options by @sash-a in https://github.com/instadeepai/Mava/pull/448
Feature/jax general system class by @arnupretorius in https://github.com/instadeepai/Mava/pull/425
Bugfix/Mypy Inconsistency Issue by @KaleabTessera in https://github.com/instadeepai/Mava/pull/458
+ 7 more

0.1.1Pre-release

KaleabTessera·4y ago·February 25, 2022

📦 Systems

📦 Environments/ Environment Wrappers

📋 What's Changed

Feature/Enforce docstring code coverage. by @KaleabTessera in https://github.com/instadeepai/Mava/pull/271
Chore/Resized gifs in readme. by @KaleabTessera in https://github.com/instadeepai/Mava/pull/272
Feature/Improve Mava agent networks by @DriesSmit in https://github.com/instadeepai/Mava/pull/258
Feature/upgrade acme version and use new adders by @KaleabTessera in https://github.com/instadeepai/Mava/pull/274
Chore/Updated makefile and readme for Windows. by @Nashlen in https://github.com/instadeepai/Mava/pull/273
Fix/supersuit version by @KaleabTessera in https://github.com/instadeepai/Mava/pull/277
Chore/ Update quickstart by @KaleabTessera in https://github.com/instadeepai/Mava/pull/278
Feature/New acme adders and tests. by @KaleabTessera in https://github.com/instadeepai/Mava/pull/276
+ 62 more

✨ New Contributors

@Nashlen made their first contribution in https://github.com/instadeepai/Mava/pull/273
@mnguyen0226 made their first contribution in https://github.com/instadeepai/Mava/pull/286
@mmorris44 made their first contribution in https://github.com/instadeepai/Mava/pull/319
@AsadJeewa made their first contribution in https://github.com/instadeepai/Mava/pull/320
@RuanJohn made their first contribution in https://github.com/instadeepai/Mava/pull/341
@sash-a made their first contribution in https://github.com/instadeepai/Mava/pull/370
@EdanToledo made their first contribution in https://github.com/instadeepai/Mava/pull/353
Full Changelog: https://github.com/instadeepai/Mava/compare/0.1.0...0.1.1

Mava-0.1.00.1.0Pre-release

KaleabTessera·4y ago·July 6, 2021

📦 Mava Core

Components
Architectures
Added Centralised, Decentralised, Networked and State Based Architectures.
Modules
Added Broadcast Communication, Epsilon Decay Scheduling, Additive and Monotonic Mixing and Fingerprint Stabilization.
Networks
Added Additive and Monotic Mixing Networks, Hypernetworks, Communication Networks, Epsilon Greedy and DiscreteValued head.
Environment Loops
+ 3 more

📦 Systems

Added feedforward training for maddpg, mad4pg, madqn, mappo, vdn and qmix.
Added recurrent training for madqn, dial, maddpg and mad4pg.
Added continuous network heads for maddpg, mad4pg and mappo.
Added decentralised architecture training for maddpg, mad4pg, madqn, mappo, dial, vdn and qmix.
Added centralised architecture training for maddpg, mad4pg and mappo.
Added state based architecture training for maddpg and mad4pg.
Added networked architecture training for maddpg.

📦 Environments/ Environment Wrappers

Added PettingZoo, SMAC, RoboCup, OpenSpiel, Flatland, Debug Simple Spread, Debug Switch environment and Debug Two-Step game.

📦 Examples

MAVA Pre-release0.0.9Pre-release

KaleabTessera·4y ago·June 9, 2021

MAVA Pre-release test before official `0.1.0` release.