fnzhan / unite Goto Github PK

[CVPR 2022 Oral] Marginal Correspondence for Conditional Image Generation, [CVPR 2021] Unbalanced Feature Transport for Exemplar-based Image Translation

Python 99.03% Shell 0.97%

gan image-translation

unite's People

Contributors

Stargazers

Watchers

unite's Issues

what does equation (6) means?

it is mentioned that "The intuition is that some features cannot be correctly matched if the conditional input contains some parts that do not exist in the exemplar. Thus before injecting the aligned style feature into the generation process, the unmatched feature of conditional input can be effectively corrected according to the accurate semantic information of the conditional input. The... "
why "the unmatched feature of conditional input" can be corrected by "the accurate semantic information of the conditional input"? (the conditional input is corrected by itself? strange... )
can you explain this intuition more clearly and help me figure out this?

COCO_Staff

God job!
But why there is no training or testing code on coco-staff dataset?

Pretrained Model

Hi,

Thank you for your impressive work.
It seems that there is something wrong with the link of pre-trained models. Could you please share them again?
Thanks for your efforts a lot.

FID score

Do you calculate fid score by comparing the training set and the generated images ? I cannot reproduce the same fid in the paper. And which fid git repo you choose to evaluate the results.

关于MCL-Net的SCM模块代码

您好，谢谢您MCL-Net的工作，希望可以看到您关于SCM( Self-Correlation Map)模块的代码，以供参考，非常感谢！

Question about the implementation of log_sinkhorn function

Thanks for sharing such a great work and releasing the codes.

I have a question about the implementation of log_sinkhorn function in sinkhorn.py. Is it should be v = eps * (a + min_eps(u, v, dim=1)) + v instead of v = eps * min_eps(u, v, dim=1) + v in Line 57?

It would be better if you can give a link to the official implementation for this part.

Thanks.

Queries

@fnzhan hi thanks for open-sourcing the code base , its really great work i have few queries

can we train the code for other semantic datasets like bdd100k / cityscapes? if so what changes have to be made
can we train the code for custom fashion dataset for region wise dressing ? if so what is the procedure

Thanks in advance

512 input size error occurs

Hi, I am thankful for being shared your code.
I succeed in executing code with custom dataset.
but, when I use large input size(from 256 to 512), I get this error

File "UNITE\models\networks\correspondence.py", line 312, in forward
y1 = torch.matmul(f_div_C, ref_)
RuntimeError: batch1 dim 2 must match batch2 dim 1

f_div_C size is doubled for width and height. if I change the tensor size, then next code makes error due to size unmatched.

I use loadsize=512 crop_size=512 label_nc = 2

please help me.

thank you.

About data inputs

Hi @fnzhan !

Thank you for providing your nice implementation.

I have a question about inputs for networks, especially for a celeba edge case.

Correspondence predictor is given RGB images and seg_map (https://github.com/fnzhan/UNITE/blob/main/models/networks/correspondence.py#L200).

Celeb segmaps (15 channel) are created via a get_label_tensor function(https://github.com/fnzhan/UNITE/blob/main/data/celebahqedge_dataset.py#L77).
It seems that celeba segmaps include not only an edge but also distanceTransformed images.

Why did you use additional information such as semantic maps?
Do your work not work well for a dataset having no additional labels e.g. AFHQ -- animal face dataset?

Thanks.

Problems with replication on the ADE20K dataset

Hello I am trying to reproduce UNITE on the ADE20K dataset, but after training up to about 3 epochs, the learned correspondences start to converge to constant. May I ask if this is as expected? Will it learn the correct correspondence if I continue training? And how many epochs do I need to train?

Training is very slow, is that normal?

Hi! I'm training UNITE using 4 3090 GPUs with the following settings:
python3 train.py
--name test
--dataset_mode my_custom
--dataroot 'train/'
--correspondence 'ot'
--display_freq 500
--niter 25
--niter_decay 25
--maskmix
--use_attention
--warp_mask_losstype direct
--weight_mask 100.0
--PONO
--PONO_C
--use_coordconv
--adaptor_nonlocal
--ctx_w 1.0
--gpu_ids 0,1,2,3
--batchSize 8
--label_nc 29
--ndf 64
--ngf 64
--mcl
--nce_w 1.0
Yet it seems that the speed is extremely slow, when I print some message each iter like this:
for i, data_i in enumerate(dataloader, start=iter_counter.epoch_iter):
print("iter", I)
And it turns out that each iteration takes about 3 seconds, which maybe abnormally slow.
I have trained CoCosNetv1 with 16 batch_size, and it performs well.
Maybe I doing something wrong? Could you give me some advice? Thanks!

fnzhan / unite Goto Github PK

unite's People

Contributors

Stargazers

Watchers

Forkers

unite's Issues

Recommend Projects

Recommend Topics

Recommend Org