Comments (6)
Needs to be specified ASAP in OCR-D/spec. Send PR for page-wise and line-wise rotation to PAGE-XML for upcoming 2019 version.
@chris1010010 @wrznr @bertsky @tboenig @cneud
from assets.
Hi all, I might have mentioned in another thread, the full PAGE format collection has a dedicated XML format for this, as deskewing was seen as a pre-processing step that does not need to be reflected in the page content XML. But as it turned out, these other XML formats were never adopted much.
I guess adding the rotation doesn't hurt. Baselines would include the angles implicitly, but that's overkill in many cases I assume.
@wrznr @bertsky @tboenig @cneud
from assets.
Hi @chris1010010, thanks for your quick feedback!
Are you referring to the 2009 subschema deskew
? This functionality would be superseeded completely if we had @orientation
in PageType
– we can already have the binary rotated image in AlternativeImage
.
I would like to do the PR myself, but looking more closely, I have trouble interpreting the existing Region
level attributes with certainty: @orientation
sounds more like referring to the Coords.points
, whereas @readingOrientation
to the actual image. Is that impression correct? If so, are they mutually exclusive?
BTW, we just have a discussion on AlternativeImage
based annotation including rotation issues here.
from assets.
Hi @bertsky
Yes, I was referring to that subschema.
I always interpreted orientation and readingOrientation as additive.
Orientation is for the block/region in general. ReadingOrientation was intended for cases where the text is at an angle with reference to the block/region. We had examples somewhere, but only using orientation should be sufficient in most cases. In the current format I would probably use nested regions.
from assets.
Oh, I see. Thanks for the clarification!
I mentioned the additive semantics in the new PR.
from assets.
@tboenig can this be closed?
from assets.
Related Issues (20)
- 1000pages: Inconsistent annotation of separators in "hobrecht_strassenbau_1890" HOT 1
- 1000pages: Incomplete annotation on page 0001 of "immermann_muenchhausen02_1839"" HOT 2
- 1000pages: Separators missing on page 0010 of "immermann_muenchhausen02_1839" HOT 1
- 1000pages: Inconsistent annotation of column separators in "krafft_landwirtschaft02_1876"" HOT 1
- 1000pages: Non-existent separator annotated on page 0018 of "krafft_landwirthschaft03_1876"" HOT 2
- 1000pages: Missing text on page 0003 and 0004 of "lenau_gedichte_1832" HOT 3
- Change the file name in DFKI test data HOT 2
- Most/All workspaces in bag files don't validate HOT 4
- Add references to OCR-D Ground Truth repo. HOT 1
- provide TableRegion/Grid examples HOT 6
- Repository not usable on case insensitive filesystems (like macOS and Windows) HOT 6
- Update scribo-tests with correct `k` parameters for sauvola-ms-fg HOT 1
- Add a METS with lots of files for testing HOT 9
- Lots of XSD validation errors HOT 2
- Self-contained make "update-bagit" target
- zip files broken links
- euler_rechenkunst01_1738 has wrong structLink
- OCR-D GT uses wrong mods:languageTerm/@authority
- wrong image references
- Validation errors for 'gutachten'
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from assets.