Coder Social home page Coder Social logo

Comments (4)

HYDesmondLiu avatar HYDesmondLiu commented on August 23, 2024 1

Also, which version of D4RL were you using (also in COMBO)?
The reason why I ask is that the buffer quality is quite different in v0~v2. (you could refer to the TD3BC paper for details).

from mopo.

weihongwei0586 avatar weihongwei0586 commented on August 23, 2024

Hi, I really appreciate your open source code. My question is how is your performance number reported in the paper.

For example, in Table 1, do you use the max evaluation return during the learning process or use the last evaluation return. The return of the policy has large variance in different iteration.

image

Thanks,
Yue

I have the same problem, When i run the demo not in mixed, the results has large variance.
image

from mopo.

typoverflow avatar typoverflow commented on August 23, 2024

@HYDesmondLiu The config file in this repo says they used '-v0' dataset for MOPO. But I'm still curious about the dataset version used in COMBO, is COMBO's source code even released?
I am also having trouble stabilizing MOPO's performance. The variance of performance across epochs is quite huge.

from mopo.

HYDesmondLiu avatar HYDesmondLiu commented on August 23, 2024

@typoverflow
AFAIK, COMBO source code is not shared. As I recall they use D4RL v2 buffers since the performance between v0 and v2 is quite different. You could easily spot the difference.
"Some" DRL methods are notorious for being unreproducible.
You could refer to this paper and other related research for more information.

from mopo.

Related Issues (13)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.