nikhilbarhate99 / td3-pytorch-bipedalwalker-v2 Goto Github PK

Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment

License: MIT License

Python 100.00%

ddpg td3 deep-reinforcement-learning openai-gym bipedalwalker pytorch pytorch-implmention reinforcement-learning openai-gym-environments lunar-lander

td3-pytorch-bipedalwalker-v2's Introduction

Hi there 👋

td3-pytorch-bipedalwalker-v2's People

Contributors

Stargazers

Watchers

td3-pytorch-bipedalwalker-v2's Issues

Consistent results ?

Hey ! Thanks for open-sourcing your implementation, it's good to see RL examples in PyTorch.

I've been using your work for some tests and I wanted to know how consistent was your policy ? I mean, it sometimes trains successfully while in some runs completely fails.

What do you think ?

Thanks (:

Lunar Lander hyperparameters

May I ask what are your hyperparameters for lunar lander?

Solved Bipedal Walker Environment

I noticed in your code you only check if the average of the last 10 episodes are above 300. But on the leaderboard page it requires that the last 100 episode average is above 300. Did you test it with such a large averaging window? The reason I ask is because from the figures in this #2 it appears that the reward signal is not that stable to get such a high average. I've been trying to solve the same environment with DDPG and the agent will master it but make a few mistakes in between making it hard to get the 100-episode average above 300.

Also did you ever experience the agent forgetting what it has learned after training the model longer?

nikhilbarhate99 / td3-pytorch-bipedalwalker-v2 Goto Github PK

td3-pytorch-bipedalwalker-v2's Introduction

Hi there 👋

td3-pytorch-bipedalwalker-v2's People

Contributors

Stargazers

Watchers

Forkers

td3-pytorch-bipedalwalker-v2's Issues

Consistent results ?

Lunar Lander hyperparameters

Solved Bipedal Walker Environment

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent