Coder Social home page Coder Social logo

Comments (9)

ruoshiliu avatar ruoshiliu commented on August 19, 2024

Hi @xyyeah , thank you for your interest in our work. I wouldn't say GSO is a subset of RTMV. A subset of RTMV, named Google Scanned Object, is composed of 300 scenes, each with 20 GSO objects. Therefore, each scene's geometry and texture is much more complicated than any single object in GSO. The experiments section of the paper as well as the appendix section have covered a lot of details regarding the evaluation process. Feel free to let me know if there's still anything unclear.

from zero123.

xyyeah avatar xyyeah commented on August 19, 2024

Thank you for your quick reply, I figured out the difference between GSO dataset and subset in RTMV. But I found that there are 2.5T data in the RTMV dataset, did you evaluate all the datasets or only a subset? If it is a subset, which subset did you evaluate?

from zero123.

ruoshiliu avatar ruoshiliu commented on August 19, 2024

We used 20 scenes randomly sampled from the Google Scanned Obejct subset, each scene with 17 novel views (1 for input and 16 for evaluation). Zero123 is very fast at inference time so we can easily evaluate hundreds of scenes. The difficulty in including more scenes for evaluation is that it's very expensive to train DietNeRF (one of our baseline). It takes around 24 hours to train one scene for DietNeRF on 1 GPU.

from zero123.

taeyeopl avatar taeyeopl commented on August 19, 2024

@ruoshiliu Thanks for your explanation.

If I want to reproduce the same results of yours in Table 1 and Table 2 as you did, Can you give some information on which 20 scenes and 17 novel views are used and 1 for the input? If datasets are not large, sharing both GSO and RTMV information will be really helpful.

from zero123.

xyyeah avatar xyyeah commented on August 19, 2024

@ruoshiliu Thanks for your quick reply.

I would like to reproduce the same evaluation results showed in the paper. I have downloaded the GSO dataset but cannot find any paired image/camera poses. Could you share more information about the preprocess method used to obtain the test datasets for the results in Table 1 and Table 2? Or as @taeyeopl suggested, could you share the preprocessed GSO and RTMV datasets ?

from zero123.

XinyangHan avatar XinyangHan commented on August 19, 2024

@ruoshiliu Looking for code for GSO and RTMV code as well~ It would be great if these preprocessing codes are released!!

from zero123.

ruoshiliu avatar ruoshiliu commented on August 19, 2024

Here's the script to render 25 views for GSO: gso_render.zip. I believe the RTMV dataset already contains 150 rendered views for each scene.

from zero123.

XinyangHan avatar XinyangHan commented on August 19, 2024

Thanks so much for your help!!!

from zero123.

jwlee-vcl avatar jwlee-vcl commented on August 19, 2024

@ruoshiliu Thanks for your rendering scripts.

It looks like the model names are in the test.json in your script, if you could share the test.json I would appreciate it.

from zero123.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.