Comments (4)
The errormsg is alread fixed. I will implemented the optional image param soon.
from ocrd_pagetopdf.
Sure it is. I will think about how to implement it, because i dont want to lose the option to add processed images e.g. binarized version instead of the original images.
from ocrd_pagetopdf.
In the OCR-D functional model, all PAGE annotations will always refer to the original image. Derived images are under AlternativeImage
only.
You could look at /PcGts/Page/AlternativeImage/@filename
for binarized/dewarped/deskewed etc images. But you have to make sure to re-calculate all coordinates then: any segment's @points
always refer to the original image under /Page/@imageFilename
in PAGE, but AlternativeImage
can be cropped (consistent with Border
), deskewed (consistent with @orientation
) or even dewarped (without information).
from ocrd_pagetopdf.
So maybe you can at least make the second input file group for images optional (and default to @imageFilename
), also avoiding the above strange error message when missing?
from ocrd_pagetopdf.
Related Issues (11)
- Add as transform script to ocr-fileformat? HOT 4
- throw error if input-filegrp doesn't exist HOT 2
- Installation fails on Debian 10 HOT 10
- workaround for pagetopdf.jar exceptions HOT 1
- Usage example for converting page xml to searchable pdf? HOT 1
- does not work on two input fileGrps anymore HOT 5
- run without showing commands executed on stdout HOT 1
- itextpdf installation does not work HOT 16
- allow creating multi-page PDFs HOT 11
- Add license HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ocrd_pagetopdf.