nqanh / video2command Goto Github PK
View Code? Open in Web Editor NEWTranslating Videos to Commands for Robotic Manipulation with Deep Recurrent Neural Networks - ICRA18
License: MIT License
Translating Videos to Commands for Robotic Manipulation with Deep Recurrent Neural Networks - ICRA18
License: MIT License
Hello,
Again thank you for providing the ResNet50 feature files last time. However I'd actually be more helpful if you can provide the missing script that you used to generate all those feature files. That is, the script that you used to generate all:
One of the keys for training you mentioned during your paper is the way you augment the frames with synthetic imagenet frames. I think this part of the code is also included in your missing script. (This is also the part that interests me the most)
Thank you again in advance!
Hello, Prof. Nguyen.
I want to use the TCN Module of V2C net. It would be very beneficial to me if you can provide the code of the same.
Hello, first of all, this is a very interesting work combing captioning into the robotic application.
I am wondering if any pre-trained models are available for inference? Or would you mind providing the missing pre-extracted features with ResNet50 so that it will be easier to train a model on my own?
Thanks in advance!
Edit: From my review of the code, it seems that the part of the code for extracting image features and packing up (images, captions) into a single pkl file is missing.
Data_io.py file reported an error. Is there any missing file?
IndexError:list index out of range
Hello, Prof. Nguyen. I want to follow this work. My access was denied on Google drive.
How do I get the IIT-V2C dataset? Thanks.
Hello,
Thank you for your answer to the last question. The model has been successfully run.Now I am confused about the video frame extraction method and supplementary frame method. I hope to get your help. Thank you again.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.