Comments (4)
Hi,
Thanks for the interest in our work. Yes you are right, changing img_size in mvt config might work (https://github.com/NVlabs/RVT/blob/master/rvt/mvt/config.py#L10). But one caveat is to accordingly adjust the patch_size (https://github.com/NVlabs/RVT/blob/master/rvt/mvt/config.py#L26) so that the img_size is divisible by it.
You might also try using a img_size of 253 so that it is divisible by the current patch_size of 11.
I am not sure if any other error because of corner cases like adjusting convolution size might creep in. Let me if you face any issue.
Best,
Ankit
from rvt.
Thank you for your answer. I will try it out as soon as possible. Also, I have recreated the dataset for the open_drawer
task with 400
episodes from RLBench
, and the images within the episodes are saved at a resolution of 224
pixels. I modified the IMAGE_SIZE
in rvt.utils.peract_utils.py
from the original 128
to 224
and successfully ran train.py
. May I ask if this change won't have any significant impact on the experimental results? Additionally, if I train on a single task, can I reduce the number of epochs
to around 2
?
Thanks in advance!
from rvt.
Oh I see. To clarify, there are two image sizes. One is the input image size
(which you changed in rvt.utils.peract_utils.py
). The other one is virtual image size
(which I had referred to in my first respose in mvt_config
).
I am unsure if changing the input image size
would affect performance as in my experiments I assumed it to be fixed and same as PerAct. If I had to guess, it might not affect a lot on tasks like open drawer which need not be very precise. For other tasks it might affect performance.
I suppose you can reduce epochs (not a very clear name as it is same as steps) if you train on a single task. Ideally, I would suggest trying some set of epochs like 2
, 4
, 6
to make sure the training has converged.
from rvt.
Thank you very much for your answer! I'm currently trying to train on other tasks.
from rvt.
Related Issues (20)
- Does the line 150 give a Cartesian coordinate of a pixel? HOT 6
- Why is time an essential part of low_dim_state? HOT 2
- Can I run eval.py with multiple processes? HOT 2
- Questions about inference speed HOT 2
- About models training HOT 2
- About model training in the real-world HOT 2
- add_final and repeat in replay HOT 4
- [Training] How to evaluate when training? HOT 2
- Questions about the details HOT 5
- About the tp1 in replay HOT 3
- Does the RVT model control Franka robot in Isaac-sim? HOT 4
- Data collection issues HOT 4
- question about camera calibration in real world HOT 8
- AttributeError: 'CustomMultiTaskRLBenchEnv2' object has no attribute 'get_ground_truth_action' HOT 1
- Rendered image with color noise in the background HOT 4
- Question about the up direction of "front" and "back" camera in renderer.py HOT 2
- Details about the real-world experiments HOT 4
- Questions about the scene bounds and img aug in real-robot experiments HOT 4
- Issues while running the training script
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rvt.