Coder Social home page Coder Social logo

Comments (13)

wgcban avatar wgcban commented on September 14, 2024 2

@Crawlling Did you specify the correct time steps (t) here corresponding to the fine-tuned model you use in the test config:

"t": [50, 100, 400, 650]

from ddpm-cd.

wgcban avatar wgcban commented on September 14, 2024 1

Are you using the correct pre-trained CD model? There are different configurations (t).

from ddpm-cd.

Crawlling avatar Crawlling commented on September 14, 2024 1

Are you using the correct pre-trained CD model? There are different configurations (t).

thx for reply!i've check the pre-trained CD model and config,it is correct,i don't know why this alway happen,i've already try it on 2 PC.
but i try to use pre-trained Diffusion model to train CD head model,in epoch 0,it still got pretty nice F1 and iou.thank you.

from ddpm-cd.

wgcban avatar wgcban commented on September 14, 2024 1

I see. I will look into this problem. The thing is, it is working fine with my machine. I am happy that at least you managed to fine-tune it and got reasonable iou. I will look into this problem and update you if I find the reason. In case if you find the reason, pls post it here so that it will benefit others. Thank you.

from ddpm-cd.

wgcban avatar wgcban commented on September 14, 2024 1

@Crawlling np.

from ddpm-cd.

Crawlling avatar Crawlling commented on September 14, 2024

help please

from ddpm-cd.

Crawlling avatar Crawlling commented on September 14, 2024

All the pretrained CD head models i can not use..but i can use the diffusion model.

from ddpm-cd.

Crawlling avatar Crawlling commented on September 14, 2024

I see. I will look into this problem. The thing is, it is working fine with my machine. I am happy that at least you managed to fine-tune it and got reasonable iou. I will look into this problem and update you if I find the reason. In case if you find the reason, pls post it here so that it will benefit others. Thank you.

thank you for your patience,i will.

from ddpm-cd.

Crawlling avatar Crawlling commented on September 14, 2024

specify

OMG,i forgot this,my bad, sorry for wasting your time.thank you again!

from ddpm-cd.

sitongzhen avatar sitongzhen commented on September 14, 2024

How much memory does this method take? Is 24GB not enough by default?

from ddpm-cd.

wgcban avatar wgcban commented on September 14, 2024

Hi @sitongzhen
Do you mean pre-training or fine-tuning?
I used one 48GB GPU for my experiments, and I believe a 24GB GPU should be fine. You just need to reduce the batch size in the config file. However, you can significantly save time by initializing your model with the pre-trained models provided here, as that will make the training faster even on new datasets.

Feel free to reach out to me know if you have more questions.

Also, the code works on multiple GPUs as well.

from ddpm-cd.

sitongzhen avatar sitongzhen commented on September 14, 2024

Yes, I understand. Thanks for your answer. When the batch is 8, I find the 24 G memory is insufficient in the training process.

from ddpm-cd.

wgcban avatar wgcban commented on September 14, 2024

@sitongzhen I don't expect results to vary significantly, and it is fine given the memory constraint.
I see, make it down to 4 and see the results.
Good luck!

from ddpm-cd.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.