Coder Social home page Coder Social logo

Comments (8)

dllu avatar dllu commented on May 18, 2024 6

OK, after 50k iterations with the above settings, it is looking pretty reasonable.

image

image

from gaussian-splatting.

Snosixtyboo avatar Snosixtyboo commented on May 18, 2024 1

That makes it hard... I would suggest reducing the scaling, initial, and final position learning rate, e.g., divide them all by a factor 3 for starters.

from gaussian-splatting.

Snosixtyboo avatar Snosixtyboo commented on May 18, 2024

Hi, would it be possible to get access to the dataset (your colmap input) so we can get an idea what's going on?

from gaussian-splatting.

dllu avatar dllu commented on May 18, 2024

Unfortunately the input dataset contains confidential proprietary data.

from gaussian-splatting.

Snosixtyboo avatar Snosixtyboo commented on May 18, 2024

40

We'd be happy to hear how it goes! There's a few things that could happen at this scale we did not observe, our evaluation was mainly room-sized or smaller outdoor areas. The fact your first run ended in 6 minutes on what seems like an urban data set seems curious.

from gaussian-splatting.

dllu avatar dllu commented on May 18, 2024

reducing learning rate

Thanks for the pro tip!

image

Indeed, with the following parameters

--iterations 50000 --scaling_lr 0.0002 --rotation_lr 0.0002 --position_lr_init 0.00003 --opacity_lr 0.01 --feature_lr 0.001

it seems the loss is decreasing healthily. Let's see what it ends up at after 50k iterations!

from gaussian-splatting.

grgkopanas avatar grgkopanas commented on May 18, 2024

How is it number-wise compared to the taichi implementation? Do you think there is room for improvement considering the hyper-params?

from gaussian-splatting.

Snosixtyboo avatar Snosixtyboo commented on May 18, 2024

OK, after 50k iterations with the above settings, it is looking pretty reasonable.

image

Great, thanks for letting us know!

A short word on urban data sets (there is also a section on it in the FAQ): The approach considers the distribution of the cameras and scales the learning rate accordingly (spatial_lr_scale). I.e., Gaussians will move faster if the cameras are 1,000 units apart than if it's only 1, since by default we have no absolute scaling information for how much one unit is. If cameras are distributed over kilometers or on a long straight path, you can therefore mess it up quite a bit, since Gaussians will zip off quickly. If you know the extent of your dataset, then extending the approach such that it sets learning rates from this information should be trivial (constant * spatial_lr_scale / real_extent). Or, if the coordinates of the point data sets are always at a known size (1unit = 1m), then replacing spatial_lr_scale by a constant should also do it.

from gaussian-splatting.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.