Coder Social home page Coder Social logo

Comments (10)

pixeli99 avatar pixeli99 commented on September 6, 2024 1

@pixeli99 Thanks for your work. I have a similar question, why you chose the rand_cosine_interpolated noise scheduler instead of the one mentioned in the EDM (Karras et al.) paper. The one highlighted in the following image. Correct me if my understanding is wrong!

(Edit: And what does the variables noise_d_low and noise_d_high correspond to?)

Thanks in advance.

image

Hello, as @Muon2 mentioned, this is the stratege of simple diffusion. You can look at section 3.1 of the paper for more detailed information (since I don't fully understand it either, it might be more reliable to read the original paper directly😢).

from svd_xtend.

m-muaz avatar m-muaz commented on September 6, 2024

@pixeli99 Thanks for your work.
I have a similar question, why you chose the rand_cosine_interpolated noise scheduler instead of the one mentioned in the EDM (Karras et al.) paper. The one highlighted in the following image. Correct me if my understanding is wrong!

(Edit: And what does the variables noise_d_low and noise_d_high correspond to?)

Thanks in advance.

image

from svd_xtend.

pixeli99 avatar pixeli99 commented on September 6, 2024

Thank you very much for raising this question,
This was due to an oversight on my part, as I originally thought that sigma in the code followed a simple log-normal distribution, but I uploaded an incorrect version;

The use of simple diffusion was merely an immature attempt of mine, because my intent was to try mixing videos of different resolutions for training. I wanted to use the strategy of simple diffusion to apply noise of varying distribution (in fact, I am not sure if my understanding is correct,). So, I am actually very eager to ask everyone for their understanding of p_train(σ).

from svd_xtend.

pixeli99 avatar pixeli99 commented on September 6, 2024

Regarding the second question, it is solely because I have been lazy that I will complete this section of the code.

from svd_xtend.

m-muaz avatar m-muaz commented on September 6, 2024

@pixeli99 Thanks for your work. I have a similar question, why you chose the rand_cosine_interpolated noise scheduler instead of the one mentioned in the EDM (Karras et al.) paper. The one highlighted in the following image. Correct me if my understanding is wrong!
(Edit: And what does the variables noise_d_low and noise_d_high correspond to?)
Thanks in advance.
image

Hello, as @Muon2 mentioned, this is the stratege of simple diffusion. You can look at section 3.1 of the paper for more detailed information (since I don't fully understand it either, it might be more reliable to read the original paper directly😢).

I see. I'll look into the Simple Diffusion paper.

BTW, I am curious have you tried other noise scheduling techniques besides the one mentioned in the Simple Diffusion paper?

from svd_xtend.

Muon2 avatar Muon2 commented on September 6, 2024

@pixeli99 Thanks for your work. I have a similar question, why you chose the rand_cosine_interpolated noise scheduler instead of the one mentioned in the EDM (Karras et al.) paper. The one highlighted in the following image. Correct me if my understanding is wrong!
(Edit: And what does the variables noise_d_low and noise_d_high correspond to?)
Thanks in advance.
image

Hello, as @Muon2 mentioned, this is the stratege of simple diffusion. You can look at section 3.1 of the paper for more detailed information (since I don't fully understand it either, it might be more reliable to read the original paper directly😢).

I think you may be wrong about the simplediffusion sigma sampling. If you are using simplediffusion , you should probably change the sampling scheduler as well, since the Euler sampler is using original EDM timesteps formula instead of the one in the simplediffusion .

from svd_xtend.

pixeli99 avatar pixeli99 commented on September 6, 2024

I understand what you're saying, but I think that different sigma distributions correspond to different diffusion paths. In theory, would it be possible to use the same sampler for sampling? I suspect there might be a flaw in my understanding, but I'm not sure where I've gone wrong. When we're training, is our definition of the timestep the same, as in 0.25ln(σ)?

from svd_xtend.

pixeli99 avatar pixeli99 commented on September 6, 2024

@m-muaz I haven't tried it yet, but if I make any progress, I will update here.

from svd_xtend.

Muon2 avatar Muon2 commented on September 6, 2024

I understand what you're saying, but I think that different sigma distributions correspond to different diffusion paths. In theory, would it be possible to use the same sampler for sampling? I suspect there might be a flaw in my understanding, but I'm not sure where I've gone wrong. When we're training, is our definition of the timestep the same, as in 0.25ln(σ)?

Sigma distributions would NOT affect diffusion paths actually. The key in simplediffusion is the change of alpha and beta scheduler, just changing the training sigma distributions would not help in my opinion.

from svd_xtend.

pixeli99 avatar pixeli99 commented on September 6, 2024

I roughly understand what you mean, I might still need to read carefully to grasp the principle here, thank you very much for your reply.

from svd_xtend.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.