Coder Social home page Coder Social logo

Comments (11)

yilundu avatar yilundu commented on June 14, 2024 1

Yes the attached network should be trained on the full dataset.

from cross_attention_renderer.

yilundu avatar yilundu commented on June 14, 2024

Hi,

Can you post the error you are getting when loading the pretrained weights as well as the associated command you are running?

Unfortunately, using a small subset of the dataset will get a decent bit poorer results -- but should still result in something that can render intermediate views.

Best,
Yilun

from cross_attention_renderer.

SunghwanHong avatar SunghwanHong commented on June 14, 2024

Thanks for the quick reply!

Sure I will attach the screenshot very soon.
In the meanwhile, so when you trained the network, did you use the full dataset?

from cross_attention_renderer.

SunghwanHong avatar SunghwanHong commented on June 14, 2024

This is what I got when I just ran the code without changing anything and with this command :
python experiment_scripts/eval_realestate10k.py --experiment_name vis_realestate --batch_size 3 --gpus 1 --checkpoint_path logs/realestate/checkpoints/model_current.pth

image

After I change the --views to 2 (which was 1 for the default value),

I got this error :
image

I'm guessing it's the number of token size that causes the problem. Maybe I should do something to the image size?

from cross_attention_renderer.

yilundu avatar yilundu commented on June 14, 2024

Can you add the argument --views=2 to the command you are running?

from cross_attention_renderer.

SunghwanHong avatar SunghwanHong commented on June 14, 2024

I still get the same error
image

Probably because I already changed it to 2 in the codes

from cross_attention_renderer.

yilundu avatar yilundu commented on June 14, 2024

Hmm that's a bit bizarre -- can you confirm that the colab runs normally for you? Can you also try to make a fresh install of the repo? I've never seen the above error before

from cross_attention_renderer.

SunghwanHong avatar SunghwanHong commented on June 14, 2024

actually, I tried to run the training codes, it works perfectly fine.
Hmm.. maybe the weights were trained using higher resolution? I'm not sure.. Because the error is when loading the positional embedding, It should be related to image size

from cross_attention_renderer.

yilundu avatar yilundu commented on June 14, 2024

Hmm what images are you using at the test resolution? I don't think that should be the cause -- in principle the weights should be initialized independent of image resolution. If the model arguments are the same as training, they should be loaded correctly. You can also check to the colab to see how the weights of the model should be loaded.

from cross_attention_renderer.

SunghwanHong avatar SunghwanHong commented on June 14, 2024

Oh right, I will take a look at those parts. Thanks!

One last question is that does 'cvpr2023_wide_baseline_data.tar.gz' file contains poses for subset of ACID and RealEstate or poses for full ACID and RealEstate?

From the codes, it seems that we need .mat pose files, but from the generate_ACID or RealEstate codes, I can't find lines that convert txt -> mat. Maybe I'm missing something.

from cross_attention_renderer.

yilundu avatar yilundu commented on June 14, 2024

Cool thanks! Feel free to send another message if there are any additional problems. And let me know what changes to the code (if any) were needed to get things to run. I can also try running the evaluation code later on my end.

The tar.gz file should have poses for the full ACID and RealEstate.

from cross_attention_renderer.

Related Issues (10)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.