Coder Social home page Coder Social logo

mdnpg's Introduction

Momentun-based Decentralized Natural Policy Gradient (MDNPG)

Code for paper "Decentralized Natural Policy Gradient with Variance Reduction for Collaborative Multi-Agent Reinforcement Learning"

Basic Requirements

  • Python (3.7)
  • Pytorch (1.4.0)
  • Numpy (1.21.5)
  • OpenAI Gym (0.18.0)

Code Structure

├─ CooperativeNavigation # Experiments on Cooperative navigation.
│  ├─ multiagent # Code for environment [REF].
│  │  ├─ core.py
│  │  ├─ environment.py
│  │  ├─ multi_discrete.py
│  │  ├─ policy.py
│  │  ├─ rendering.py
│  │  ├─ scenario.py
│  │  └─ scenarios
│  ├─ particle_envs # Code for environment [REF].
│  │  └─ make_particleworld.py
│  ├─ topology # Generate connectivity topology [REF].
│  │  ├─ connectivity
│  │  ├─ genconnectivity.py
│  │  └─ load_con.py
│  ├─ tools
│  │  └─ tool.py
│  ├─ nets.py # Policy network and value network.
│  ├─ rl_utils.py
│  ├─ momentum_npg.py # Class of MDNPG method.
│  ├─ mdnpg_main.py # Main file of MDNPG method.
│  ├─ momentum_pg.py # Class of MDPGT method [REF].
│  ├─ mdpgt_main.py # Main file of MDPGT method [REF].
│  ├─ rl_utils_vp.py
│  ├─ value_propagation_one_step.py # Class of one-step-version VP method.
│  ├─ value_propagation_multi_step.py # Class of multi-step-version VP method.
│  └─ value_propagation_main_on_policy.py # Main file of on-policy-version VP method.
├─ GridWorld # Experiments on GridWorld.
│  ├─ envs # Code for environment.
│  │  ├─ grid_maps
│  │  ├─ gridworld.py
│  │  └─ gridworld_4_test.py
│  ├─ multi_agents # Multi-agent GridWorld.
│  │  ├─ topology # Generate connectivity topology [REF].
│  │  ├─ nets.py # Policy network and value network.
│  │  ├─ rl_utils.py
│  │  ├─ momentum_npg.py # Class of MDNPG method.
│  │  ├─ mdnpg_main.py # Main file of MDNPG method.
│  │  ├─ momentum_pg.py # Class of MDPGT method [REF].
│  │  ├─ mdpgt_main.py # Main file of MDPGT method [REF].
│  │  ├─ pg_entropy.py # Class of PG with entropy method.
│  │  └─ pg_entropy_main.py # Main file of PG with entropy method.
│  ├─ single_agent # Single-agent GridWorld.
│  │  ├─ momentum_npg_discrete.py # Momentum-based NPG.
│  │  ├─ momentum_pg_discrete.py # Momentum-based PG.
│  │  ├─ pg_entropy_discrete.py # PG with entropy.
│  │  ├─ ppo_discrete.py # PPO.
│  │  └─ srvrnpg_discrete.py # SRVR-NPG.
│  └─ tools
│     └─ tool.py
├─ MountainCar # Experiments on MountainCarContinuous (single agent).
│  ├─ momentum_npg_continuous.py # Momentum-based NPG.
│  ├─ momentum_pg_continuous.py # Momentum-based PG.
│  ├─ ppo_continuous.py # PPO.
│  └─ srvrnpg_continuous.py # SRVR-NPG.
└─ README.md

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.