Great works! I am wondering: Why not use EDM noise samplin

<a class="user-mention notranslate" data-hovercard-type="user" data-hover

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

<a class="user-mention notranslate" data-hovercard-type="use

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Questions about the noise sampling. about svd_xtend HOT 10 OPEN

Muon2 commented on September 6, 2024

Questions about the noise sampling.

from svd_xtend.

Comments (10)

pixeli99 commented on September 6, 2024 1

@pixeli99 Thanks for your work. I have a similar question, why you chose the rand_cosine_interpolated noise scheduler instead of the one mentioned in the EDM (Karras et al.) paper. The one highlighted in the following image. Correct me if my understanding is wrong!

(Edit: And what does the variables noise_d_low and noise_d_high correspond to?)

Thanks in advance.

Hello, as @Muon2 mentioned, this is the stratege of simple diffusion. You can look at section 3.1 of the paper for more detailed information (since I don't fully understand it either, it might be more reliable to read the original paper directly😢).

from svd_xtend.

m-muaz commented on September 6, 2024

@pixeli99 Thanks for your work.
I have a similar question, why you chose the rand_cosine_interpolated noise scheduler instead of the one mentioned in the EDM (Karras et al.) paper. The one highlighted in the following image. Correct me if my understanding is wrong!

(Edit: And what does the variables noise_d_low and noise_d_high correspond to?)

Thanks in advance.

from svd_xtend.

pixeli99 commented on September 6, 2024

Thank you very much for raising this question,
This was due to an oversight on my part, as I originally thought that sigma in the code followed a simple log-normal distribution, but I uploaded an incorrect version;

The use of simple diffusion was merely an immature attempt of mine, because my intent was to try mixing videos of different resolutions for training. I wanted to use the strategy of simple diffusion to apply noise of varying distribution (in fact, I am not sure if my understanding is correct,). So, I am actually very eager to ask everyone for their understanding of p_train(σ).

from svd_xtend.

pixeli99 commented on September 6, 2024

Regarding the second question, it is solely because I have been lazy that I will complete this section of the code.

from svd_xtend.

m-muaz commented on September 6, 2024

@pixeli99 Thanks for your work. I have a similar question, why you chose the rand_cosine_interpolated noise scheduler instead of the one mentioned in the EDM (Karras et al.) paper. The one highlighted in the following image. Correct me if my understanding is wrong!
(Edit: And what does the variables noise_d_low and noise_d_high correspond to?)
Thanks in advance.

Hello, as @Muon2 mentioned, this is the stratege of simple diffusion. You can look at section 3.1 of the paper for more detailed information (since I don't fully understand it either, it might be more reliable to read the original paper directly😢).

I see. I'll look into the Simple Diffusion paper.

BTW, I am curious have you tried other noise scheduling techniques besides the one mentioned in the Simple Diffusion paper?

from svd_xtend.

Muon2 commented on September 6, 2024

@pixeli99 Thanks for your work. I have a similar question, why you chose the rand_cosine_interpolated noise scheduler instead of the one mentioned in the EDM (Karras et al.) paper. The one highlighted in the following image. Correct me if my understanding is wrong!
(Edit: And what does the variables noise_d_low and noise_d_high correspond to?)
Thanks in advance.

Hello, as @Muon2 mentioned, this is the stratege of simple diffusion. You can look at section 3.1 of the paper for more detailed information (since I don't fully understand it either, it might be more reliable to read the original paper directly😢).

I think you may be wrong about the simplediffusion sigma sampling. If you are using simplediffusion , you should probably change the sampling scheduler as well, since the Euler sampler is using original EDM timesteps formula instead of the one in the simplediffusion .

from svd_xtend.

pixeli99 commented on September 6, 2024

I understand what you're saying, but I think that different sigma distributions correspond to different diffusion paths. In theory, would it be possible to use the same sampler for sampling? I suspect there might be a flaw in my understanding, but I'm not sure where I've gone wrong. When we're training, is our definition of the timestep the same, as in 0.25ln(σ)?

from svd_xtend.

pixeli99 commented on September 6, 2024

@m-muaz I haven't tried it yet, but if I make any progress, I will update here.

from svd_xtend.

Muon2 commented on September 6, 2024

I understand what you're saying, but I think that different sigma distributions correspond to different diffusion paths. In theory, would it be possible to use the same sampler for sampling? I suspect there might be a flaw in my understanding, but I'm not sure where I've gone wrong. When we're training, is our definition of the timestep the same, as in 0.25ln(σ)?

Sigma distributions would NOT affect diffusion paths actually. The key in simplediffusion is the change of alpha and beta scheduler, just changing the training sigma distributions would not help in my opinion.

from svd_xtend.

pixeli99 commented on September 6, 2024

I roughly understand what you mean, I might still need to read carefully to grasp the principle here, thank you very much for your reply.

from svd_xtend.

Questions about the noise sampling. about svd_xtend HOT 10 OPEN

Comments (10)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent