The pi-prl from ru-automated-reasoning-group

Problems when running the program

Hello! I'm trying to run the example in the README by running "python3 pi_PRL.py". However I got error like

not enough values to unpack (expected 5, got 4)
Timeout Error raised... Trying again
Traceback (most recent call last):
  File "pi_PRL.py", line 664, in <module>
    plot_keys=['stoc_pol_mean', 'running_score'])
  File "/home/wyx/Documents/Analogy/code/pi-PRL/mjrl/utils/train_agent.py", line 153, in train_agent_flip
    stats = agent.train_step(**args)
  File "/home/wyx/Documents/Analogy/code/pi-PRL/mjrl/algos/batch_reinforce.py", line 84, in train_step
    paths = trajectory_sampler.sample_paths(**input_dict)
  File "/home/wyx/Documents/Analogy/code/pi-PRL/mjrl/samplers/core.py", line 144, in sample_paths
    for result in results:
TypeError: 'NoneType' object is not iterable

Would you like to help me with that?

And I also noticed that if I run the code directly, I'll get error like

'numpy.random._generator.Generator' object has no attribute 'randn'

I tried to fix it by substitute all "self.np_random" by "np.random". Maybe this cause the first error. Would you like to share with me the numpy version you used in the project?

Question about PID controllers on LunarLander

Hello,

I have a question about the results that you obtained with LunarLander-v2 in the appendix of the paper. Specifically, I want to know about the criteria used for selecting observations with PID controller. Were all observations utilized, or was there a specific method for their selection in the LunarLander task?

ru-automated-reasoning-group / pi-prl Goto Github PK

pi-prl's People

Contributors

Stargazers

Watchers

Forkers

pi-prl's Issues

Problems when running the program

Question about PID controllers on LunarLander

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent