Comments (8)
- we never tested training on several GPUs, however, Tensorflow does support multi-GPU training, thus this might work out of the box. Possibly, a high batch size is required.
- The charts in your link suggest using 16-bit floats for the new RTX cards. However, Calamari uses 32-bit floats. We never tested 16-bit floats for training, and the effect on the accuracies. Thus, I cannot say if the RTX2070 is significantly faster than the older 10XX generations.
from calamari.
-
In-order to train using multi-gpu, what would be the training command?
-
Also, can Calamari handle 5 Terabytes of training data? ( Not joking )
-
What should my settings be when dealing with Terabytes of training data?
from calamari.
- By default Tensorflow aquires all available Resources, therefore not additional commands should be required (check by nvidia-smi)
- Yes and no. Currently, calamari loads all data into the RAM before training, and streams it into tensorflow. If you have 5 TB of RAM this should work (joking). So, no, it will not work. However, one could consider to move some logic to the data streaming pipeline to load/unload/preprocess the data on-the-fly. Another problem is, that calamari automatically computes the codec by loading all text files and computing the set of all characters. This will take a long time for 5 TB of data, which is why the codec should be given by hand. Unfortunately, both are larger changes.
- Thus, you can currently only train on a small amount of data.
from calamari.
@ChWick Why didn't you use Mixed-Precision? It:
- Halves the memory requirements.
- Speed-up the training substantially.
- No loss in model accuracy.
- No need to modify hyperparameters.
https://devblogs.nvidia.com/mixed-precision-training-deep-neural-networks/
https://arxiv.org/abs/1710.03740
from calamari.
https://www.youtube.com/watch?v=i1fIBtdhjIg
from calamari.
@ChWick
Will Calamari be able to synchronize the batch normalization among the 2 GPUs?
So, Calamari training speed would scale by a factor of 2x using 2 GPUs?
What would be the most important factor in Calamari training: Bandwidth, Flops, or number of GPUs?
from calamari.
@ChWick Thanks man for the on-the-fly
branch!
from calamari.
Note that you would need Tensor Cores to fully benefit from half-precision. Also note that the gtx 10x series are mostly optimized for fp32.
- NVIDIA/apex#76
- https://en.wikipedia.org/wiki/List_of_Nvidia_graphics_processing_units#GeForce_20_series
from calamari.
Related Issues (20)
- Cannot convert a symbolic Tensor - Cannot even initialize the Predictor object HOT 2
- Characters coordinates HOT 1
- training: Cannot convert a symbolic Tensor to a numpy array HOT 7
- HDF5 dataset format: how to convert HOT 4
- calamari-train: warmstart not working without also giving network spec
- featreq: when warmstart-training, init weights of new chars from existing ones HOT 2
- calamari-eval: skip missing pairs HOT 3
- calamari-eval: unknown arguments HOT 6
- calamari-eval: confusion table miscalculates relative frequency HOT 3
- Error when convert old trained model to latest version model HOT 1
- Got exception during training HOT 4
- calamari-ocr 2.2.2 on ubuntu 22.04 partial success, difficulty with GPU software
- Prediction from calamari trained .pb model HOT 5
- Issue while using the model and json HOT 8
- setup.py on Ubuntu20.04: tensorflow is wrong version HOT 7
- Model very sensitive on PNG input HOT 3
- calamari/1.0: hold Tensorflow and Protobuf dependencies HOT 6
- What is the accuracy on Chinese/Japanese text? HOT 2
- Attention layer
- "No training configuration" for code that should not have one HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from calamari.