Coder Social home page Coder Social logo

GPU questions about calamari HOT 8 CLOSED

calamari-ocr avatar calamari-ocr commented on June 30, 2024 1
GPU questions

from calamari.

Comments (8)

ChWick avatar ChWick commented on June 30, 2024
  • we never tested training on several GPUs, however, Tensorflow does support multi-GPU training, thus this might work out of the box. Possibly, a high batch size is required.
  • The charts in your link suggest using 16-bit floats for the new RTX cards. However, Calamari uses 32-bit floats. We never tested 16-bit floats for training, and the effect on the accuracies. Thus, I cannot say if the RTX2070 is significantly faster than the older 10XX generations.

from calamari.

 avatar commented on June 30, 2024

@ChWick

  • In-order to train using multi-gpu, what would be the training command?

  • Also, can Calamari handle 5 Terabytes of training data? ( Not joking )

  • What should my settings be when dealing with Terabytes of training data?

from calamari.

ChWick avatar ChWick commented on June 30, 2024
  • By default Tensorflow aquires all available Resources, therefore not additional commands should be required (check by nvidia-smi)
  • Yes and no. Currently, calamari loads all data into the RAM before training, and streams it into tensorflow. If you have 5 TB of RAM this should work (joking). So, no, it will not work. However, one could consider to move some logic to the data streaming pipeline to load/unload/preprocess the data on-the-fly. Another problem is, that calamari automatically computes the codec by loading all text files and computing the set of all characters. This will take a long time for 5 TB of data, which is why the codec should be given by hand. Unfortunately, both are larger changes.
  • Thus, you can currently only train on a small amount of data.

from calamari.

 avatar commented on June 30, 2024

@ChWick Why didn't you use Mixed-Precision? It:

from calamari.

 avatar commented on June 30, 2024

https://www.youtube.com/watch?v=i1fIBtdhjIg

from calamari.

 avatar commented on June 30, 2024

@ChWick
Will Calamari be able to synchronize the batch normalization among the 2 GPUs?

So, Calamari training speed would scale by a factor of 2x using 2 GPUs?
What would be the most important factor in Calamari training: Bandwidth, Flops, or number of GPUs?

from calamari.

 avatar commented on June 30, 2024

@ChWick Thanks man for the on-the-fly branch! 😎

from calamari.

 avatar commented on June 30, 2024

Note that you would need Tensor Cores to fully benefit from half-precision. Also note that the gtx 10x series are mostly optimized for fp32.

111

222

from calamari.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.