Coder Social home page Coder Social logo

Comments (7)

passerer avatar passerer commented on July 18, 2024

Thank you for your interest. Due to the inclusion of attention modules in HPINet, the training process requires much computational resources and GPU memory. As mentioned in the paper and README, we had to use 8 V100 GPUs with 32GB of memory each to meet the training requirements. I'm unsure if using 4 RTX 3090 GPUs would be sufficient to achieve the desired batch size and patch size as specified in train.py. Could you please provide the reproduction results on BSD100, Urban100, and Manga109?

from hpinet.

wenyang001 avatar wenyang001 commented on July 18, 2024

The configuration remains the same in the train.py with the min_batch_size=8 and max_batch_size=64; min_patch_size = 192 and max_patch_size = 720.

The reproduction results are attached. Thanks.

image

from hpinet.

wenyang001 avatar wenyang001 commented on July 18, 2024

The reproduction results at the 420 epoch are attached.

image

from hpinet.

passerer avatar passerer commented on July 18, 2024

I have tried training with four 24G GPUs like you did, but I encounter the memory exceed issue. For better results, the training utilizes a patch size that increases with each epoch, leading to a gradual increase in GPU memory usage until it exceeds the capacity of 24G. Therefore, I have no idea that why four 24G GPUs worked perfectly for you. If you're interested, we recommend starting with HPINet-S, as it requires less training time and consumes less memory.

We are planning to release a new training framework in the coming weeks that addresses the issue of excessive GPU memory usage. We will create a new branch and notify you once it's available.

from hpinet.

passerer avatar passerer commented on July 18, 2024

Here is another code. It can be accommodated by four GPUs.

from hpinet.

passerer avatar passerer commented on July 18, 2024

If there are further questions, feel free to reopen this issue at any time.

from hpinet.

wenyang001 avatar wenyang001 commented on July 18, 2024

Thank you for your reply. It helps me a lot.

from hpinet.

Related Issues (5)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.