Coder Social home page Coder Social logo

yingfan-bot / mbpsrl Goto Github PK

View Code? Open in Web Editor NEW
5.0 1.0 2.0 87 KB

Code for "Model-based Reinforcement Learning for Continuous Control with Posterior Sampling", ICML 2021

Python 100.00%
reinforcement-learning model-based-rl posterior-sampling

mbpsrl's Introduction

Code for paper: Model-based Reinforcement Learning for Continuous Control with Posterior Sampling (https://arxiv.org/abs/2012.09613)

Please see requirements.txt for package dependencies.

Directly run files with configuration for each environment:

Stochastic Cartpole:

python run_cartpole.py --with-reward True

(with oracle rewards)

python run_cartpole.py --with-reward False

(without oracle rewards)

Stochastic Pendulum:

python run_pendulum.py --with-reward True

(with oracle rewards)

python run_pendulum.py --with-reward False

(without oracle rewards)

Reacher:

python run_reacher --with-reward True

(with oracle rewards)

python run_reacher.py --with-reward False

(without oracle rewards)

Pusher:

python run_pusher.py --with-reward True

(with oracle rewards)

python run_pusher.py --with-reward False

(without oracle rewards)

Cumulative rewards are saved as envname_log.txt files.

If you find the code useful, please cite:

@InProceedings{pmlr-v139-fan21b,
  title = 	 {Model-based Reinforcement Learning for Continuous Control with Posterior Sampling},
  author =       {Fan, Ying and Ming, Yifei},
  booktitle = 	 {Proceedings of the 38th International Conference on Machine Learning},
  pages = 	 {3078--3087},
  year = 	 {2021},
  editor = 	 {Meila, Marina and Zhang, Tong},
  volume = 	 {139},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {18--24 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v139/fan21b/fan21b.pdf},
  url = 	 {https://proceedings.mlr.press/v139/fan21b.html}

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.