Coder Social home page Coder Social logo

Comments (31)

gariel-google avatar gariel-google commented on April 29, 2024 3

from google-research.

gariel-google avatar gariel-google commented on April 29, 2024 1

from google-research.

gariel-google avatar gariel-google commented on April 29, 2024 1

from google-research.

gariel-google avatar gariel-google commented on April 29, 2024

from google-research.

cognitiveRobot avatar cognitiveRobot commented on April 29, 2024

Hi @gariel-google, are the models that you provide trained on images 416x128? When I tried inference with other resolutions it doesn't work well at all.

If it's indeed 416x128, have you tried training with higher resolutions? I know some previous work use 416x128 for training, but recently most methods use higher resolutions and experiments have demonstrated higher resolutions lead to better results. Is it something related to the GPU memory issue?

Hi, did you retrain for higher resolution? If so, can you share the training script? Thanks.

from google-research.

kwea123 avatar kwea123 commented on April 29, 2024

no, I just tested the inference.

from google-research.

kwea123 avatar kwea123 commented on April 29, 2024

@gariel-google Could you release the full paper that you submitted to ICCV? I downloaded the paper from ICCV QR code but can't find the supplementary matrials. Thank you.

from google-research.

cognitiveRobot avatar cognitiveRobot commented on April 29, 2024

@gariel-google Can you release the code to train the model from image sequence only. Thanks in advance.

from google-research.

gariel-google avatar gariel-google commented on April 29, 2024

from google-research.

cognitiveRobot avatar cognitiveRobot commented on April 29, 2024

@gariel-google thanks for your quick reply. Looks like this current training code will only work if we provide the following three items.

  1. a png file which is a stitched image from three consecutive frames.
  2. a camera matrix file.
  3. a png file is a mask image for moving objects.

But, I have only image frames.

In the paper, I find
@ 5.1 EuRoC Micro Aerial Vehicle Dataset:
.............. Vicon scene 3d scans, and camera calibration, we only used the monocular videos for
training....
So, I thought you have some other training code for only image frames. Does it make sense?

from google-research.

gariel-google avatar gariel-google commented on April 29, 2024

from google-research.

cognitiveRobot avatar cognitiveRobot commented on April 29, 2024

Yes. That's the code. :) I would be grateful if you release them. Thanks.
By this time, I made segmentation masks (easy for me, as there is no moving objects in my case) and fake camera matrix (by guessing the focal length and the principal point).

Unfortunately, it's not working. The error message I get

I1106 11:13:31.460314 140258611402496 train.py:164] Attempting to resume training from depth_from_video_in_the_wild/kitti_learned_intrinsics/model-248900...
I1106 11:13:31.460670 140258611402496 train.py:166] Last checkpoint found: None
I1106 11:13:31.460769 140258611402496 train.py:173] Training...
INFO:tensorflow:Error reported to Coordinator: <class 'AttributeError'>, 'dict' object has no attribute 'iteritems'
I1106 11:14:45.539946 140258611402496 coordinator.py:224] Error reported to Coordinator: <class 'AttributeError'>, 'dict' object has no attribute 'iteritems'

from google-research.

gariel-google avatar gariel-google commented on April 29, 2024

from google-research.

cognitiveRobot avatar cognitiveRobot commented on April 29, 2024

Ok. I will check with python 2. Thanks.

from google-research.

cognitiveRobot avatar cognitiveRobot commented on April 29, 2024

Yeah. It works with python2. Thanks again @gariel-google

from google-research.

cognitiveRobot avatar cognitiveRobot commented on April 29, 2024

@gariel-google I going to train on my images, but not sure exactly how to make samples.
I have sequence, lets say, img1, img2, img3, img4, img5, img6...
sample1: img1, img2, img3
sample2: img2, img3, img4
sample3: img3, img4, img5
and so on

I appreciate your reply.
Thanks

from google-research.

gariel-google avatar gariel-google commented on April 29, 2024

from google-research.

studennis911988 avatar studennis911988 commented on April 29, 2024

Yes, it does. It helps a lot. We can release the code for training on EuRoC, which would read from time-sorted sequences of files, will use no masks and will learn the camera matrix. Please confirm that this is what you'd need. It may take a week or two to release.

@gariel-google Hi, thanks for your open-source code and can I know is there any schedule on releasing the code for training EuRoc dataset? Thanks in advanced.

from google-research.

Beniko95J avatar Beniko95J commented on April 29, 2024

@gariel-google Hi, thanks for your work! I am also looking forward to seeing your training code for the EuRoC dataset. BTW, is it possible for you to release the trained model for the EuRoC dataset too?

Best regards

from google-research.

studennis911988 avatar studennis911988 commented on April 29, 2024

@gariel-google Really appreciate for your hard work, however I have a little problem here.
Since I wish to get the real depth data from the disparity map produced by the model, I'm wondering is it right to calculate the depth by the traditional stereo disparity to depth formula ? Or there is another way to do that.
Thanks in advanced!

from google-research.

cognitiveRobot avatar cognitiveRobot commented on April 29, 2024

@gariel-google Hi, any update on releasing the code and the checkpoint file? Thanks

from google-research.

studennis911988 avatar studennis911988 commented on April 29, 2024

@gariel-google Thanks for all your hard work on releasing the code for Euroc dataset, but could you provide some instructions about it in readme section?
Thanks in advanced!

from google-research.

gariel-google avatar gariel-google commented on April 29, 2024

from google-research.

gariel-google avatar gariel-google commented on April 29, 2024

from google-research.

studennis911988 avatar studennis911988 commented on April 29, 2024

@gariel-google Thanks for the explanation, but I still have two questions about render the depth from EuRoc dataset.
First of all, how can I get the real depth value from the depth image produced by the render_euroc_depth.py?
Second, I had noticed that I don't have to provide model (checkpoints) to run the script(render_euroc_depth.py), though it still give the depth images, so I'm wondering if I use the wrong script ?
Thanks for your attention!

from google-research.

gariel-google avatar gariel-google commented on April 29, 2024

from google-research.

studennis911988 avatar studennis911988 commented on April 29, 2024

@gariel-google Thanks for the detailed explanation!

If I understand correctly, the render_euroc_depth.py is just a script that provides groundtruth depth map for evaluating the learned depth map which is produced by trained model as mentioned in the paper.

As now I'm wondering how can I use the downloaded EuRoC cam0 data to produced the depth map from the model(checkpoint you provide for all 11 sequences) , moreover, is the depth map produced by model gives as the true real world depth value ?( since I have noticed that the groundtruth depth gives us some strange depth value like nagative 26 or positive 198)

Thanks again!

from google-research.

gariel-google avatar gariel-google commented on April 29, 2024

from google-research.

studennis911988 avatar studennis911988 commented on April 29, 2024

@gariel-google
Thanks for the clarification, and I'll look deep in to the code to find the bug.
By the way, which script should I run to use the checkpoints you provided for all 11 sequences to produce learned depth images?

from google-research.

studennis911988 avatar studennis911988 commented on April 29, 2024

@gariel-google Sorry for bothering you, I'm wondering is there father update for euroc dataset ?

from google-research.

gariel-google avatar gariel-google commented on April 29, 2024

from google-research.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.