Coder Social home page Coder Social logo

kapkapas / a-introduction-to-reinforcement-learning Goto Github PK

View Code? Open in Web Editor NEW

This project forked from xubo92/an-introduction-to-reinforcement-learning

0.0 1.0 0.0 61.43 MB

RL library based on algorithms from the book <A-introduction-to-reinforcement-learning>

Python 100.00%

a-introduction-to-reinforcement-learning's Introduction

A-introduction-to-reinforcement-learning

Instructions

- why do I maintain this repository?

Since it's hard for me to find a complete and feasible algorithms set on ‘Reinforcement Learning’ which is of vital help for freshers who want to get into this field.

Nowthat I have made a detailed reading notes on the excellent book "A-introduction-to-reinforcement-learning" and implemented most of its algorithms, why not share them to public?

I believe the quote > 'talk is cheap, show me the code'.

- what are in this repository?

This repository consists of the algorithms from first 9 chapters in this book: Reinforcement Learning:An introduction --2012 version

on-policy MC

The algorithms after 9th chapter will be added continuously by another version of this book.

  • chapter4 : Dynamic Programming

    Includes two exercises:

    1. The Gambler
    2. Jack's Car Rental
  • chapter5 : Monte Carlo Methods

    Includes an exercise called "racetrack" and experiment performances on racetrack.

    1. Racetrack.py

    Note:the Monte Carlo algorithms from chapter 5 are all implemented in the single file Racetrack.py. Specifically in func: def update_policy(episode):

  • chapter6 : Temporal-Difference Learning

    Includes td-related algorithms and experiment performances on racetrack.

    1. td.py

    Note: From this chapter, I quit implementing environment of every exercise. Since almost each exercise has a different environment. If I just used different algorithms on different environments, one can hardly has a comparison between those algorithms, Therefore, I decided to show the performances of different algorithms on single same environment: racetrack.

    naive-Q-lambda

  • chapter7 : Eligibility Traces

    Includes td-lambda related algorithms and experiment performances on racetrack

    1. td_lambda.py
  • chapter8 : Planning and Learning with Tabular Methods

    Includes Dyna_Q algorithm.

    1. Dyna_Q
  • RLF : An reinforcement learning algorithms library which pulls together all algorithms mentioned above and some new deep reinforment learning algorithms like DQN or DDPG for purpose of convenient external call.
    Note: One remarkable point of this library is the seperation of 'environment' and 'agent algorithm' via the attribute of 'python Class'. In this way, you could add or modify your own environment without interfering agent part. Also, you can just connect agent algorithms(td,monte-carlo,dp,...) to your environment.

- How to utilize them for your projects?

a-introduction-to-reinforcement-learning's People

Contributors

xubo92 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.