Coder Social home page Coder Social logo

alibaheri / rl-examples Goto Github PK

View Code? Open in Web Editor NEW

This project forked from anjum48/rl-examples

0.0 3.0 0.0 3.75 MB

Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow

Python 99.23% Shell 0.77%

rl-examples's Introduction

rl-examples

Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow. Most of my research is in the continuous domain, and I haven't spent much time testing these in discrete domains such as Atari etc.

PPO LSTM solving BipedalWalker-v2

PPO LSTM solving BipedalWalker-v2

BipedalWalker-v2 solved using PPO with a LSTM layer

Algorithms Implemented

Thanks to DeepMind and OpenAI for making their research openly available. Big thanks also to the TensorFlow community.

Algorithm arXiv Link Paper
DPPG https://arxiv.org/abs/1509.02971 Continuous control with deep reinforcement learning
A3C https://arxiv.org/abs/1602.01783 Asynchronous Methods for Deep Reinforcement Learning
PPO https://arxiv.org/abs/1707.06347 Proximal Policy Optimization Algorithms
DPPO https://arxiv.org/abs/1707.02286 Emergence of Locomotion Behaviours in Rich Environments
GAE https://arxiv.org/abs/1506.02438 High-Dimensional Continuous Control Using Generalized Advantage Estimation
  • GAE was used in all algorithms except for DPPG
  • Where possible, I've added an LSTM layer to the policy and value functions. This usually made the more complex environments more stable (but slower)
  • DPPO is currently a bit unstable. Work in progress

Training

All the Python scripts are written as standalone scripts. Just run them as you would for a single file or in your IDE. The models and TensorBoard summaries are saved in the same directory as the script. DPPO has a helper script to set off the worker threads

Requirements

  • Python 3.5+
  • OpenAI Gym
  • TensorFlow 1.4
  • Numpy 1.13+

DPPO was tested on a 16 core machine using CPU only, so the helper script will need to be updated for your particular setup. For my setup, there was usually no speed advantage training on the CPU vs GPU (GTX 1080), but your performance may differ

rl-examples's People

Contributors

anjum48 avatar

Watchers

James Cloos avatar Ali Baheri avatar paper2code - bot avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.