Unable to download the s3dg weights

Pytorch + StableBaselines3 Implementation of RoboCLIP

This repository contains the implementation for the NeurIPS 2023 paper, RoboCLIP: One Demonstration is Enough to Learn Robot Policies.

Setting up the env

We recommend using conda for installation and provide a .yml file for installation.

git clone https://github.com/sumedh7/RoboCLIP.git --recursive
cd RoboCLIP
conda env create -f environment_roboclip.yml
conda activate roboclip
pip install -e mjrl
pip install -e Metaworld
pip install -e kitchen_alt
pip install -e kitchen_alt/kitchen/envs
wget https://www.rocq.inria.fr/cluster-willow/amiech/howto100m/s3d_howto100m.pth
wget https://www.rocq.inria.fr/cluster-willow/amiech/howto100m/s3d_dict.npy
mv S3D_HowTo100M/s3dg.py ./

If you're running into download issues with the S3D weights (last 2 commands), the two files can also be obtained from our google drive: https://drive.google.com/file/d/1DN8t1feS0t5hYYTfXQfZd596Pu9yXJIr/view?usp=sharing https://drive.google.com/file/d/1p_CMGppAQk3B0_Z0i1r2AN_D1FM3kUFW/view?usp=sharing

How To use it ?

To run experiments on the Metaworld environment suite with the sparse learnt reward, we need to first define what the demonstration to be used is. For textual input, uncomment line 222 and comment 223 and add the string prompt you would like to use in the text_string param. Similarly, if you would like to use human demonstration, uncomment line 223 and pass the path of the gif of the demonstration you would like to use. Similarly, for a metaworld video demo, set human=False and set the video_path.

We provide the gifs used in our experiments within the gifs/. Then run:

python metaworld_envs.py --env-type sparse_learnt --env-id drawer-open-v2-goal-hidden --dir-add <add experiment identifier>

To run the Kitchen experiments, similarly specify the gif path on line 345 and then run the following line with --env-id as Kettle, Hinge or Slide.

python kitchen_env_wrappers.py --env-type sparse_learnt --env-id Kettle --dir-add <add experiment identifier>

These runs should produce default tensorboard experiments which save the best eval policy obtained by training on the RoboCLIP reward to disk. The plots in the paper are visualized by finetuning these policies for a handful of episodes. To replicate the Metaworld finetuning, run:

python metaworld_envs.py --env-type dense_original --env-id drawer-open-v2-goal-hidden --pretrained <path_to_best_policy> --dir-add <add_experiment_identifier>

FAQ for Debugging

Please use the older version of Metaworld, i.e., pre Farama Foundation. Also rendering can be an issue sometimes, so setting the right renderer is necessary. We found egl to be useful.

export MUJOCO_GL=egl

sumedh7 / roboclip Goto Github PK

roboclip's Introduction

Pytorch + StableBaselines3 Implementation of RoboCLIP

Setting up the env

How To use it ?

FAQ for Debugging

roboclip's People

Contributors

Stargazers

Watchers

Forkers

roboclip's Issues