Comments (8)
Hi, if you are trying to process this document in the pero-ocr web application (pero-ocr.fit.vutbr.cz), you can use the Complex printed and handwritten layout model. For the baseline detector model, select Universal. For the OCR model you can definitely try Universal Transformer, but it will probably be quite slow. Otherwise, you can try Czech Fraktur Printed or German Fraktur Printed on this document, but without the language model (leave NONE selected). Of course, you can try different settings to see what works best for you.
If your question was not about the pero-ocr web application, then I would need more information to assist you.
from pero-ocr.
Thank you so much for your reply,
My question about the model when i run the test in my local repository, because I use the pero-ocr model that is shared.
With this script how can i select the Universal Transformer model please ??
from pero-ocr.
The Universal Transformer model is not publicly available. It is available only in the pero-ocr web application (pero-ocr.fit.vutbr.cz).
from pero-ocr.
is it possible to share it with me please??
from pero-ocr.
I'm sorry, but we will not share the model with you. If you want to process documents with our latest models, you can use the free web application pero-ocr (pero-ocr.fit.vutbr.cz). In case you need to process a really large amount of documents, we could discuss it and maybe the processing could be done via API, but that would probably be for a fee.
from pero-ocr.
Yes I will process a large amount of documents, for this reason I should have a solution for that.
Thanks for your response!
from pero-ocr.
OK, I would suggest continuing the conversation via email. Could you send me an email to [email protected] so we can discuss this further?
from pero-ocr.
Yes for sure, thank you so much.
from pero-ocr.
Related Issues (20)
- Where does model for region detector place? HOT 1
- training model HOT 2
- Layout analysis crashes HOT 1
- Page color does not switch to DONE if some lines were deleted. HOT 1
- Rename "Download Pages" to "Download transcriptions" HOT 1
- .
- .
- FIX: Switch WIDTH and HIGHT in ALTO export HOT 1
- Layout for one of the pages not displayed on preview
- layout detection not good on exercise books HOT 1
- Can't install through pip HOT 7
- PERO generates 0kB ALTO files HOT 6
- XML headers HOT 1
- OMR transformers produce nonsense transcriptions HOT 1
- Getting KeyError HOT 5
- Add region categories HOT 1
- Music pull request feedback
- problem of numpy version HOT 2
- Problem with the pretrained model not available HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pero-ocr.