Coder Social home page Coder Social logo

bcf's Introduction

Bayesian Controller Fusion

QUT Centre for Robotics Open Source License: MIT

A hybrid control strategy for combining deep RL and classical robotic controllers. We provide two environments for both navigation and reaching tasks. For each task, we additionally provide traditional handcrafted controllers that can solve part of the task, however are not the optimal solution. Visit our project page for more information and video demonstrations.

Installation

The only requirement is that you have Conda installed on your system, and NVIDIA drivers installed if you want CUDA acceleration. To install all the required python dependencies run the following command within conda.

git clone https://github.com/krishanrana/bcf.git
cd bcf
conda env create --file env_requirements.yml/

For the manipuliabity maximising reacher task, you will additionally require PyRep and CoppeliaSim. Head to the PyRep github page for installation instructions for this environment.

Usage

The complete training pipeline for BCF and the baselines compared in this work are provides in 'main.py'. The file allows for several input arguments to allow the user to specify the task, algorithm, prior controller, and the respective hyperparameters.

python3 main.py --task "navigation" --method "BCF" --prior_controller "APF" --sigma_prior 0.4 --num_agents 10

Logging

All results are logged using Weights and Biases. An account and initial login is required to initialise logging as described on thier website.

Citation

  @article{rana2021bayesian,
    title={Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics},
    author={Rana, Krishan and Dasagi, Vibhavari and Haviland, Jesse and Talbot, Ben and Milford, Michael and S{\"u}nderhauf, Niko},
    journal={arXiv preprint arXiv:2107.09822},
    year={2021}
  }

bcf's People

Contributors

krishanrana avatar

Stargazers

 avatar  avatar Weijie Wang avatar Jialiang Fan avatar zcy avatar  avatar Antonin RAFFIN avatar  avatar  avatar  avatar  avatar Yi-Chen Li avatar Matt Shaffer avatar Jack Lindsey avatar Alexander avatar Curt Henrichs avatar  avatar Konstantinos Chatzilygeroudis avatar Eko Rudiawan Jamzuri avatar Diego Ferigo avatar jiawei avatar Michael avatar Nicola Piga avatar Silvio Traversaro avatar Ekansh Gupta avatar Jesse Haviland avatar  avatar  avatar

Watchers

James Cloos avatar Silvio Traversaro avatar  avatar Kostas Georgiou avatar Korbinian avatar Matt Shaffer avatar

bcf's Issues

Can't reproduce the results of paper.

I find something strange in the results of the code. My results of navigation are as follows. The hyper-parameters are same with yours (Num_Agents =5).

3AA0E43E-8901-49A4-9436-5D08A49AA57E
Besides, I have a question for the BCF. I see your only evaluate the ensemble policy during training. Do we need the prior controller to do evaluation ? Or just use the ensemble policy to give action?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.