Coder Social home page Coder Social logo

hhb072 / waveletsrnet Goto Github PK

View Code? Open in Web Editor NEW
240.0 7.0 67.0 1.58 MB

A pytorch implementation of Paper "Wavelet-srnet: A wavelet-based cnn for multi-scale face super resolution"

License: MIT License

Python 98.80% Shell 1.20%
face-hallucination super-resolution face

waveletsrnet's People

Contributors

hhb072 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

waveletsrnet's Issues

typical texture loss values

What are the typical texture loss values while training... I have done my own implementation of the above network.. the texture loss has a torch.mean its value to 0.0005 range.. where as other losss sr loss, lr loss and full image loss have a torch.sum() component.. and are just divided by the batch size... they are in 10^3 range the total loss is dominated by sr loss,lr loss and full image loss. I dont understand how the texture loss is going to contribute towards the learning.

Terrible results

I trained the model by myself,but the results were terrible,there are always black things around eyes.

The size of tensor a (128) must match the size of tensor b (32) at non-singleton dimension 3

could anyone else help me ,i fail to run this code . it shows in this part of code:
loss_lr = loss_MSE(wavelets_predict[:,0:3,:,:], wavelets_lr, opt.mse_avg)
loss_sr = loss_MSE(wavelets_predict[:, 3:, :, :], wavelets_sr, opt.mse_avg)
loss_textures = loss_Textures(wavelets_predict[:, 3:, :, :], wavelets_sr)
loss_img = loss_MSE(img_predict, target, opt.mse_avg)
the "wavelets_predict[:,0:3,:,:]" tensor.size ([16, 48, 128, 128]), but "wavelets_lr" is([16, 3, 32, 32]), i wanna know did anyone meet the same problem, how did you solve it?

why sgd can't converge?

thx for sharing
in ur paper, u mentioned that u used sgd
but ur code used adam instead
i changed sgd to adam, but the model couldn't get converged, why did it happen?

How could I get the dataset?

I am trying to running the code. It shows an error: No such file or directory: 'path/celeba/train.list'
which mean I have to get the CelebA dataset first. But how should I get the dataset and how should I matching the directories?

####pre-trained model

Hello, could you please provide your training model file or checkpoint ? I want to test it on my data set

Test result and dicussion

Hello, authors. Thanks for your contribution and good approach.

I have tried to test your code by making test only code and today fully tested you network.
I trained your network (for 950 epoch) and then tested several images by importing the trained model.
Then I found very good result for some images but sometimes I saw very strange noise.
One of the most serious artifact on reconstructed images was like a small snake (let me call it small snake noise, SSN).

When the network has very clean image, the performance was excellent!!
However, when I input a noisy image such as jpeg noise, in the reconstructed images, I could see that noises.
It look like trying to connect every tiny details (some times noises), so very small noises were connected one another and boosted up. In my opinion, they originated from Wavelet method and hence I thought it can be improved by adjusting Wavelet method or something. Could you please tell me how can I or please guide me if I have a wrong point for the problem.

Thank you so much.

Questions about the paper

After reading your Wavelet-SRNet paper,there are three questions in the paper requires consulting you:

  1. How is the channel number 1024 of the output feature map after the embedded network?
  2. Why is the number of channels in the wavelet prediction network 32 and 64?
  3. Can this reconstruction method improve low-resolution face recognition rates?

pretrained file

I have trained the net with the Celeba datasets,but the result is not so satisfactory. Could you please upload the pretrained file/weight file ? Appreciate your help

data arrangement

Hi, hhb072!

I'm trying to implement your project, but some problems occurred. That is I don't know how the data is arranged in your demo. So would you submit an example for that?

Thank you!

x8 weights

Can someone provide the wavelet weights for x8 upscale? It's not included in the provided pkl file (the key is only up to rec16).

Please tell me how can I ONLY test the network?

Thank you for sharing your code.

But, could you tell me in detail, how can I test the SR network without training?

I'm so sorry but I'm not familiar with these kind of code such as tensorflow and python.

Please describe the way. Thank you in advance.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.