hhb072 / waveletsrnet Goto Github PK

View Code? Open in Web Editor NEW

240.0 7.0 67.0 1.58 MB

A pytorch implementation of Paper "Wavelet-srnet: A wavelet-based cnn for multi-scale face super resolution"

License: MIT License

Python 98.80% Shell 1.20%

face-hallucination super-resolution face

waveletsrnet's People

Contributors

Stargazers

Watchers

Forkers

shubhampachori12110095 oguzhanmeteozturk ehumss ineslyl remyee kittyyinhui q512624756 liningxiao jingang-cv githublzb amwons cosmoshua xmhzz2018 haitmai yummycats 20171130 jedfang merle314 jianqiangren gavin666github ballmdr prasanna1991 hwpengtristin xu-shi andyzhang59 waveletpytorch li-xiaomeng zhuangzhong ncbwct soccergame ahn19 lisongjiang wcy-cs merven96 yanchb3 dickyqi 3d-wuhan lotayou huangbinyang tony109060581 yangtong1989 gertoe joeupwu isunnii hanrui56 yexuehua 2019125078 nouro93 huntkao robotseye maggie84 alexzou14 pro-flynn sunrise6513 buaa-jingxiao road2018 hamidreza-dastmalchi twistedmove jhenma iq-scm chicean

waveletsrnet's Issues

typical texture loss values

What are the typical texture loss values while training... I have done my own implementation of the above network.. the texture loss has a torch.mean its value to 0.0005 range.. where as other losss sr loss, lr loss and full image loss have a torch.sum() component.. and are just divided by the batch size... they are in 10^3 range the total loss is dominated by sr loss,lr loss and full image loss. I dont understand how the texture loss is going to contribute towards the learning.

Terrible results

I trained the model by myself,but the results were terrible,there are always black things around eyes.

The size of tensor a (128) must match the size of tensor b (32) at non-singleton dimension 3

could anyone else help me ,i fail to run this code . it shows in this part of code:
loss_lr = loss_MSE(wavelets_predict[:,0:3,:,:], wavelets_lr, opt.mse_avg)
loss_sr = loss_MSE(wavelets_predict[:, 3:, :, :], wavelets_sr, opt.mse_avg)
loss_textures = loss_Textures(wavelets_predict[:, 3:, :, :], wavelets_sr)
loss_img = loss_MSE(img_predict, target, opt.mse_avg)
the "wavelets_predict[:,0:3,:,:]" tensor.size ([16, 48, 128, 128]), but "wavelets_lr" is([16, 3, 32, 32]), i wanna know did anyone meet the same problem, how did you solve it?

why sgd can't converge?

thx for sharing
in ur paper, u mentioned that u used sgd
but ur code used adam instead
i changed sgd to adam, but the model couldn't get converged, why did it happen?

FileNotFoundError: [Errno 2] No such file or directory: 'img_align_celeba\\002714.jpg'

Hello, I got this error in the python3 version of the code. Can anyone help me?

How could I get the dataset?

I am trying to running the code. It shows an error: No such file or directory: 'path/celeba/train.list'
which mean I have to get the CelebA dataset first. But how should I get the dataset and how should I matching the directories?

####pre-trained model

Hello, could you please provide your training model file or checkpoint ? I want to test it on my data set

why need this

Test result and dicussion

Hello, authors. Thanks for your contribution and good approach.

I have tried to test your code by making test only code and today fully tested you network.
I trained your network (for 950 epoch) and then tested several images by importing the trained model.
Then I found very good result for some images but sometimes I saw very strange noise.
One of the most serious artifact on reconstructed images was like a small snake (let me call it small snake noise, SSN).

When the network has very clean image, the performance was excellent!!
However, when I input a noisy image such as jpeg noise, in the reconstructed images, I could see that noises.
It look like trying to connect every tiny details (some times noises), so very small noises were connected one another and boosted up. In my opinion, they originated from Wavelet method and hence I thought it can be improved by adjusting Wavelet method or something. Could you please tell me how can I or please guide me if I have a wrong point for the problem.

Thank you so much.

Questions about the paper

After reading your Wavelet-SRNet paper,there are three questions in the paper requires consulting you:

How is the channel number 1024 of the output feature map after the embedded network?
Why is the number of channels in the wavelet prediction network 32 and 64?
Can this reconstruction method improve low-resolution face recognition rates?