Coder Social home page Coder Social logo

Add new algorithms about minimalrl HOT 7 OPEN

seungeunrho avatar seungeunrho commented on August 22, 2024
Add new algorithms

from minimalrl.

Comments (7)

rahulptel avatar rahulptel commented on August 22, 2024 1

Agreed. We can go with RAINBOW.

from minimalrl.

BDEvan5 avatar BDEvan5 commented on August 22, 2024 1

MuZero would also be a cool algorithm, it is a bit more complicated with the MCTS but it works very well

from minimalrl.

seungeunrho avatar seungeunrho commented on August 22, 2024

Hi!
I think A2C (synchronous update version of A3C) is good.
What about implementing RAINBOW rather than Double, Dueling DQN?
I think the significance of the code to both Double and Dueling DQN is marginal because they are small variations of DQN in terms of implementation.
In contrast, a simple implementation of the RAINBOW might be helpful for many people.
(Actually, Dueling and Double DQN are 2 components of RAINBOW out of 6)
https://arxiv.org/abs/1710.02298

from minimalrl.

seungeunrho avatar seungeunrho commented on August 22, 2024

Awesome!

from minimalrl.

BDEvan5 avatar BDEvan5 commented on August 22, 2024

Also, thanks so much for sharing.
These are great simple implementations for learning and have been very useful.

If you want to try something else, you could also try to implement them in TensorFlow

from minimalrl.

ADGEfficiency avatar ADGEfficiency commented on August 22, 2024

How about SAC?

from minimalrl.

Mahesha999 avatar Mahesha999 commented on August 22, 2024

How about Phasic Policy Gradient (PPG) as it gives better results than PPO?
Also an example of using these algorithms for non gaming environment like ones with list, dict etc as observation instead of image frames. I guess that will be easy as we will have to use NN instead of CNN. Still a simple example, may be.

from minimalrl.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.