Coder Social home page Coder Social logo

Slow loading time about cudnn.torch HOT 8 OPEN

Atcold avatar Atcold commented on June 10, 2024 1
Slow loading time

from cudnn.torch.

Comments (8)

Atcold avatar Atcold commented on June 10, 2024

Hmm, other server here take 10 to 15 seconds... And the one above 40 to 45 seconds...
How can I debug this?

from cudnn.torch.

clement-masson avatar clement-masson commented on June 10, 2024

'require cudnn' initialize some stuff on every visible GPU. If you're on a machine with many GPUs, it may be the cause of the long loading time.

We've got a machine with 4 GPUs. Setting CUDA_VISIBLE_DEVICES=0 (for instance) reduce the loading time by almost a factor 4. On our machine, it takes <10sec though ...

from cudnn.torch.

Atcold avatar Atcold commented on June 10, 2024

@clement-masson, right. I just saw that. Still, I believe some things must be wrong. I've contacted the IT (I don't have sudo here...).

from cudnn.torch.

ajhool avatar ajhool commented on June 10, 2024

I'm finding that require cudnn on a volta takes 10 minutes. @clement-masson , any idea how I can profile the require function to see what exactly is taking so long with the volta architecture?

from cudnn.torch.

ajhool avatar ajhool commented on June 10, 2024

@nagadomi , I'm using your distro with cuda9/10 support. Any ideas why the bindings might be struggling with the Volta architecture?

from cudnn.torch.

nagadomi avatar nagadomi commented on June 10, 2024

@ajhool
If you are using Docker, it may be caused by JIT Caching.
See nagadomi/waifu2x#138 ,
https://github.com/nagadomi/waifu2x/pull/138/files#diff-04c6e90faac2675aa89e2176d2eec7d8

from cudnn.torch.

ajhool avatar ajhool commented on June 10, 2024

I am using docker and I'll give that a shot, thanks!

from cudnn.torch.

ajhool avatar ajhool commented on June 10, 2024

So far, the JIT Caching fix does not appear to be working, although I'm having a hard time debugging Torch/Lua without a debug environment or print statements. I believe I have the cache and cache path configured correctly and the load time is still about 10 minutes.

The fact that the code executes quickly on K80's but takes so much longer on Voltas makes me suspect there's more to it than just luajit. Will continue to try and get to the bottom of this.

from cudnn.torch.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.