Comments (11)
Yes the attached network should be trained on the full dataset.
from cross_attention_renderer.
Hi,
Can you post the error you are getting when loading the pretrained weights as well as the associated command you are running?
Unfortunately, using a small subset of the dataset will get a decent bit poorer results -- but should still result in something that can render intermediate views.
Best,
Yilun
from cross_attention_renderer.
Thanks for the quick reply!
Sure I will attach the screenshot very soon.
In the meanwhile, so when you trained the network, did you use the full dataset?
from cross_attention_renderer.
This is what I got when I just ran the code without changing anything and with this command :
python experiment_scripts/eval_realestate10k.py --experiment_name vis_realestate --batch_size 3 --gpus 1 --checkpoint_path logs/realestate/checkpoints/model_current.pth
After I change the --views to 2 (which was 1 for the default value),
I'm guessing it's the number of token size that causes the problem. Maybe I should do something to the image size?
from cross_attention_renderer.
Can you add the argument --views=2
to the command you are running?
from cross_attention_renderer.
Probably because I already changed it to 2 in the codes
from cross_attention_renderer.
Hmm that's a bit bizarre -- can you confirm that the colab runs normally for you? Can you also try to make a fresh install of the repo? I've never seen the above error before
from cross_attention_renderer.
actually, I tried to run the training codes, it works perfectly fine.
Hmm.. maybe the weights were trained using higher resolution? I'm not sure.. Because the error is when loading the positional embedding, It should be related to image size
from cross_attention_renderer.
Hmm what images are you using at the test resolution? I don't think that should be the cause -- in principle the weights should be initialized independent of image resolution. If the model arguments are the same as training, they should be loaded correctly. You can also check to the colab to see how the weights of the model should be loaded.
from cross_attention_renderer.
Oh right, I will take a look at those parts. Thanks!
One last question is that does 'cvpr2023_wide_baseline_data.tar.gz' file contains poses for subset of ACID and RealEstate or poses for full ACID and RealEstate?
From the codes, it seems that we need .mat pose files, but from the generate_ACID or RealEstate codes, I can't find lines that convert txt -> mat. Maybe I'm missing something.
from cross_attention_renderer.
Cool thanks! Feel free to send another message if there are any additional problems. And let me know what changes to the code (if any) were needed to get things to run. I can also try running the evaluation code later on my end.
The tar.gz file should have poses for the full ACID and RealEstate.
from cross_attention_renderer.
Related Issues (10)
- Error in colab HOT 10
- Regarding square crop HOT 3
- Train / test splits of ACID dataset HOT 6
- File missing HOT 1
- Failed to reproduce the results HOT 9
- Mat file for realestate10K poses HOT 3
- Mapping not support in python3.10 in colab HOT 4
- Experimental details HOT 2
- RealEstate10K download error HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cross_attention_renderer.