Hi Lisa, Thanks for your wonderful work. May I ask

<a target="_blank" rel="noopener noreferrer nofollow" href="https://user-images.github

How did you derive your sampling algo? about diffusion-lm HOT 4 OPEN

xiangli1999 commented on August 28, 2024

How did you derive your sampling algo?

from diffusion-lm.

Comments (4)

XiangLi1999 commented on August 28, 2024

This is actually quite similar to the DDPM sampling algorithm. Both e-prediction and x_0 prediction will be transformed back to derive p(x_{t-1} | x_t), and both derivation rely on x_{t−1} =\sqrt{\alpha} f_\theta(x_t,t)+ \sqrt{1-\alpha} * N(0,1), where f_\theta(x_t,t) is the predicted x_0.

I think reading the last paragraph of section 4.2 could help.

from diffusion-lm.

jzhang38 commented on August 28, 2024

My confusion is that you appear to rely on the forward process q(x_{t-1}| x_0) to sample, whereas DDPM samples by predicting the mean of backward process p(x_{t-1} | x_t) (which we learn through the closed form solution of q(x_{t-1} | x_t, x_0)). Is there any deduction I can find (perhaps in other papers that also use x_0 prediction) to prove that these two samplings are mathematically equivalent?

In other words, DDPM samples through q(x_{t-1} | x_t, x_0), but Diffusion-LM samples through q(x_{t-1} | f_\theta(x_t,t)).

from diffusion-lm.

XiangLi1999 commented on August 28, 2024

Maybe checkout the last equation on page 17 of the Diffusion-LM ArXiv paper.

from diffusion-lm.

jzhang38 commented on August 28, 2024

Thanks for your prompt reply! Yeah I understand the training loss is essentially the same. My question is regarding the sampling algorithm. I think if we follow DDPM to perform sampling, we are supposed to sample with the mean as defined above, with x_0 predicted by f_\theta(x_t,t)

from diffusion-lm.

Recommend Projects

How did you derive your sampling algo? about diffusion-lm HOT 4 OPEN

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent