Coder Social home page Coder Social logo

jacobandreas / psketch Goto Github PK

View Code? Open in Web Editor NEW
105.0 7.0 35.0 34.78 MB

Modular multitask reinforcement learning with policy sketches

Home Page: https://arxiv.org/abs/1611.01796

License: Apache License 2.0

Shell 0.85% Python 19.44% Jupyter Notebook 79.71%

psketch's People

Contributors

jacobandreas avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

psketch's Issues

Question on algorithms used to build baseline

May I ask what algorithm did you use when you train the joint policy and independent policies? ('Joint' and 'Indep' in Figure 4 in your paper) Moreover, what algorithms are implemented in models 'attentive.py' and 'reflex.py'?

cliff environment

Hi,

Sorry i cannot find any cliff enironment in experiments repo as stated in your paper. Can you tell me how to run those cliff experiments?

Regards

Are the positions of subgoals(in craft for example, iron, grass, wood, gold, ...) Fixed?

May I ask Are the positions of subgoals fixed in all experiment environments, at multitask and zero-shot settings?
At first after reading the paper only, I thought that subpolicy pi is only given s_i (state), so the model can't treat variable subgoal position environments. I read the code and guess the positions are fixed in the code too.

Sorry for my question at this late time(2021).
Thank you in advance :)

Can not run main.py

Hi Jacob,

I recently started to follow your works.

I just run main.py but it says "No module named reflex". Last time i checked, there is a reflex.py in folder models.

Or could you please show me how to execute the whole program to get the craft experiment result?

Thx

no gpu needed when running experiment?

hi Jacob,

i was running your experiment light_modular and i found my gpu-utli(nividia-smi) is always zero. Does the same situation happen to you? Or is it true that your algorithm doesn't have gpu part(just cpu)? sorry this question may sound a little bit stupid.

Pytorch version

Hi,
Do you have a pytorch version for this method?
best regards

Question on adaptation & zero-shot

May I ask which part of the source code shows the results of adaptation (learn the form of a suitable sketch) and zero-shot? All I see is just the training process.
Thanks for the response!

System configuration

I've tried different versions of Python and Tensorflow but always get this error:

File "/usr/local/lib/python2.7/site-packages/tensorflow_core/init.py", line 40, in
from tensorflow.python.tools import module_util as _module_util
ImportError: No module named tools
What versions am I supposed to use?

Cannot reproduce the result in the paper

Hi, I tried to run your code on my laptop and it works fine. The problem is that I cannot reproduce the result on light-joint-feat-plan task. I changed the experiment name to "test" in comfig.yaml ran the main.py directly. The final result is around 0.3. Are there any parameters that I need to change in order to reproduce the result?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.