Coder Social home page Coder Social logo

zhaominyiz / rfda-pytorch Goto Github PK

View Code? Open in Web Editor NEW
59.0 2.0 6.0 14.54 MB

Official Code for 'Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction' - ACM Multimedia2021 (ACMMM2021) Accepted Paper Task: Video Quality Enhancement / Video Compression Artifact Reduction

License: Apache License 2.0

Python 73.01% Shell 0.11% C++ 11.00% Cuda 15.89%
acmmm2021 deeplearning video-enhancement video-restoration video-quality video-processing

rfda-pytorch's Issues

关于消融实验的问题

您好,非常感谢您的工作。我在看您的论文的消融实验时我有些疑惑,为什么只加RF模块,指标甚至比不加的效果还要差,这是什么原因?只加dsta,指标好像已经很高了,即便又多加了RF和two-stage,psnr只有0.02的提升。如果用RF和DSTA,是不是two-stage就没那么重要了,我看表中并没有列出RF和DSTA的组合结果,期待您的解答,非常感谢🙏
1CDD3762-11EE-4843-8EE7-A1FBB4985FF4

How to achieve the two-stage training strategy?

In your paper, you train the RFDA using the two-stage training strategy, but we can not find the two stage training files in the release code. So we are very confused that how to achieve the two-stage training strategy. Could you provide the training strategy files in detail?

Thank you very much!

bash build.sh is not workinh

I am trying to run the code by following the instructions
error: '/usr/local/cuda/bin/nvcc' failed: No such file or directory: '/usr/local/cuda/bin/nvcc'

Please help me

TRAINING CODE

hello,I'm interested in your great work,could you release the training code so that i can reproduce the result for my thesis. THX!

关于率失真曲线的问题

您好,我不太懂视频压缩。对您论文里的率失真曲线不太懂。我看您是取了测试集里的十八个视频中的一个,但是您的这个比特率是怎么算出来的?是根据不同的qp设置,得到的几个值吗?如果是这样,那您用的是1s多少帧,这个是不是涉及到和传输相关的码率控制的问题?即便是固定qp,比特率也会受别的影响吧,不同的图片编码出来的比特数也不一致吧?期待您的回复,万分感谢!🙏

关于brf的问题

您好,非常感谢您的工作!您的brf版本我看到又有改进,但是代码是不是还没开源?您的这个改进版本工作是不是准备发论文的?期待您的回复,非常感谢!

Instructions for Training

Hello,

Thank you for releasing the training code. Could you please provide the instructions for training?

I tried to use the instructions for training STDF - python -m torch.distributed.launch train_rf_ft.py --opt_path config/Final_QP37.yml and ran into an error with the STDF module's in_conv layer:
Expected 4-dimensional input for 4-dimensional weight [32, 7, 3, 3], but got 5-dimensional input of size [8, 15, 1, 96, 96] instead
Could you please tell me what radius should be specified in the config file for training? The currently released config file specifies a radius of 7 but from the error log, it seems the training code expects R=3.

Additionally, your paper mentions two-stage training where you train the STDF with radius 3 and then the full network with 15 frame clips. Does your training code support doing that?

Thank you!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.