Hi, First of all, thanks for your excellent research work. This is p

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Intuition behind choosing step size about attend-and-excite HOT 2 CLOSED

yuval-alaluf commented on May 31, 2024

Intuition behind choosing step size

from attend-and-excite.

Comments (2)

hila-chefer commented on May 31, 2024

Hi @nttung1110, thanks for your interest!

The intuition behind this choice is that, as mentioned in the paper, most of the spatial information (e.g., location of objects) is determined in the first denoising steps, while the last ones mainly influence fine details. Since our goal is to encourage the generation of neglected subjects, we perform larger step sizes in the first denoising steps.
You can also see the impact of applying our method in the last timesteps in Fig. 12 of the paper. The figure demonstrates that these steps mostly add artifacts and do not induce meaningful changes in the resulting image.

Referring to your specific issue- I do not recall a convergence issue with our method. The main issue that I saw before adapting the step size was that the loss decreased very significantly but resulted in OOD latents and OOD images.
Without further details on the objective you are exploring it's hard to come up with an explanation to why this is happening in your case.
I would start with very large step sizes just to make sure that your loss can decrease (without our method).

I hope this helps!

from attend-and-excite.

nttung1110 commented on May 31, 2024

Hi,

Thanks for your detailed explanation. I would try to explore more to fix this issue.

from attend-and-excite.

Recommend Projects

Intuition behind choosing step size about attend-and-excite HOT 2 CLOSED

Comments (2)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent