bmenendez / deeprl-navigation Goto Github PK

View Code? Open in Web Editor NEW

Repository for exploring navigation (collecting yellow bananas) through Deep Reinforcement Learning algorithms.

License: GNU General Public License v3.0

Jupyter Notebook 84.83% Python 15.17%

deeprl-navigation's Introduction

Deep Reinforcement Learning project: navigation

Project details

In this project, an agent is trained to solve an environment in which it should collect as many yellow bananas avoiding the blue ones as possible.

A reward of +1 is provided for collecting a yellow banana, and a reward of -1 is provided for collecting a blue banana. Thus, the goal of the agent is to collect as many yellow bananas as possible while avoiding blue bananas.

The state space has 37 dimensions and contains the agent's velocity, along with ray-based perception of objects around the agent's forward direction. Given this information, the agent has to learn how to best select actions. Four discrete actions are available, corresponding to:

0 - move forward.
1 - move backward.
2 - turn left.
3 - turn right.

The task is episodic, and in order to solve the environment, the agent must get an average score of +13 over 100 consecutive episodes.

Getting started

Follow the instructions given here to install all the dependencies.
Download the environment for your OS:
- Linux: click here
- Mac OS: click here
- Windows (32-bit): click here
- Windows (64-bit): click here
Place the file in the root of the folder and unzip it.
Run it! Consider to change the cell at point 2 of the notebook to match with your folder env = UnityEnvironment(file_name='Banana_Linux/Banana.x86_64').

Instructions

If you want to train the agent, run the cells from the point 1 to the point 6 at Deep_Q_Network.ipynb. If you just want to execute the trained agent, run the cells from the point 1 to the point 4 plus the 7 at the notebook, since they load the weights of the network (checkpoint.pth) for the trained agent. After running whatever you want, you can close the environment by running the cell at point 8.

Description of the implementation is in Report.md, but for more technical details, see the code at the notebook provided before.

Recommend Projects

bmenendez / deeprl-navigation Goto Github PK

deeprl-navigation's Introduction

Deep Reinforcement Learning project: navigation

Project details

Getting started

Instructions

deeprl-navigation's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent