Coder Social home page Coder Social logo

zhaominyiz / c3-stisr Goto Github PK

View Code? Open in Web Editor NEW
58.0 3.0 4.0 653 KB

Official Code for 'C3-STISR: Scene Text Image Super-resolution with Triple Clues' - IJCAI 2022

License: Apache License 2.0

Python 100.00%
scene-text-images scene-text-recognition super-resolution sene-text-image-super-resolution

c3-stisr's People

Contributors

zhaominyiz avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

c3-stisr's Issues

Weird artifact in the restored result

I made the test with your code and get the same performance as you reported in Readme. That's cool, however, when I visualize the reconstruct result with visualize option in your code, I can find it has some unusual visual artifact.

restored picture is as followed, and as you can see, restoration picture shows decreased visual quality and background color is getting a little different.
lr_sr_hr_0006_810ujioz_1880uazo3_188041903_18301903_18301903

Performance is exactly same as you reported, so I ask you whether this artifact a usual one.

Best

Lower experimental precision

I have fused the code into TPGSR, but after 500 epochs of training, the result is about 5% lower than the result in the article.
(The accuracy obtained by loading the parameter file you posted is normal.)

error using the default base.py from TPGSR

hey, it seems if we want to run the test/train using your arch, we need to modify the base.py from interface directory (especially the 293rd line about generator_init.
can you share the modified base.py?

About Pretrained BCN_correct_model.pt

Hi, author!
When I want to load the pretrained model as you have advised in #7 (comment), however, I met problem in the line params = torch.load('ckpt/BCN_correct_model.pt') . I couldn't find this pretrained model BCN_correct_model.pt in your release. I wonder if it is convenient for you to share this? Thank you !

Code

I have given my start!
I also met the same issue, could you please provide me with the full code?
My email is [email protected].
Thanks a lot!

Code

Hi~ I have given my start!
I also met the issue of running the training code, could you please provide me with the full code?
My email is [email protected].
Thank you very much!

单图推理生成超分辨图

如果我只想做单张图像的超分辨率结果,请问我应该怎么改代码?

image_lr = transforms.ToTensor()(Image.open('./val_data/20230317.jpeg').convert('RGB')).unsqueeze(0).cuda()
print(image_lr.shape)
images_sr = model(image_lr) 

但这样会报错,我用的crnn.pth

conv_feature = torch.cat([conv_feature, position2d],1) # batch, 128(64+64), 32, 128
RuntimeError: Sizes of tensors must match except in dimension 1. Expected size 240000 but got size 1024 for tensor number 1 in the list.

About Image Quality Performance

Hi, have a nice day! Thanks for your impressive work. Recently I was trying to reproduce your code. However, I have encountered a problem regarding image quality, i.e. PSNR and SSIM. The avg PSNR reported in the paper is 21.51 dB, but the log below is 20.38 dB. and the same problem is encountered with SSIM. I don't know why this is the case. I would be grateful if you could reply :D

evaling easy
[2022-09-05 14:25:13]	PSNR 22.13 | SSIM 0.8532	
save display images
sr_accuray_iter0: 65.60%
lr_accuray: 37.49%
hr_accuray: 76.41%
best_easy = 65.60%*
evaling medium
[2022-09-05 14:25:29]	PSNR 18.98 | SSIM 0.6449	
save display images
sr_accuray_iter0: 53.44%
lr_accuray: 21.40%
hr_accuray: 75.05%
best_medium = 54.43%
evaling hard
[2022-09-05 14:25:45]	PSNR 19.75 | SSIM 0.7125	
save display images
sr_accuray_iter0: 39.91%
lr_accuray: 21.15%
hr_accuray: 64.56%
best_hard = 39.91%*
saving best model

CODE

您好,通过您的文章,我对您的工作很感兴趣,请问大概什么时候公布代码呢

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.