Comments (5)
You surely mean between some element's TextEquiv:Unicode
and its sub-component element's TextEquiv:Unicode
, as in:
- between
TextRegion
and itsTextLine
sequence - between
TextLine
and itsWord
sequence - between
Word
and itsGlyph
sequence
Should the consistency principle not be added to the spec in PAGE.md?
I sometimes use XSL transformations to concatenate sub-components (joining them by whitespace or newline, depending on position) – maybe this is a good starting point for such schematron. If you think those would in fact be useful, where can I put them?
(But then again, why not add a function to WorkspaceValidator.validate()
in core instead?)
from assets.
Should the consistency principle not be added to the spec in PAGE.md?
It should.
Why not add a function to WorkspaceValidator.validate() in core instead?
We could. Preferably define it in the spec first and then implement it referring to it.
from assets.
Shouldn't we at least start an issue on core to support PAGE-related consistency in WorkspaceValidator.validate()
before closing here? The problem is that we might have to actually look at the GT data in order to get the consistency principle right in the details. See this comment.
from assets.
Sure. Closed it because I thought OCR-D/spec#82 would be the fix, but that's just the spec not implementation.
from assets.
Implemented in OCR-D/core#223
from assets.
Related Issues (20)
- 1000pages: Non-annotated handwritten annotation on page 0005 of "hobrecht_polytechnikum_1878" HOT 3
- 1000pages: Inconsistent annotation of separators in "hobrecht_strassenbau_1890" HOT 1
- 1000pages: Incomplete annotation on page 0001 of "immermann_muenchhausen02_1839"" HOT 2
- 1000pages: Separators missing on page 0010 of "immermann_muenchhausen02_1839" HOT 1
- 1000pages: Inconsistent annotation of column separators in "krafft_landwirtschaft02_1876"" HOT 1
- 1000pages: Non-existent separator annotated on page 0018 of "krafft_landwirthschaft03_1876"" HOT 2
- 1000pages: Missing text on page 0003 and 0004 of "lenau_gedichte_1832" HOT 3
- Change the file name in DFKI test data HOT 2
- Most/All workspaces in bag files don't validate HOT 4
- Add references to OCR-D Ground Truth repo. HOT 1
- provide TableRegion/Grid examples HOT 6
- Repository not usable on case insensitive filesystems (like macOS and Windows) HOT 6
- Update scribo-tests with correct `k` parameters for sauvola-ms-fg HOT 1
- Add a METS with lots of files for testing HOT 9
- Lots of XSD validation errors HOT 2
- Self-contained make "update-bagit" target
- zip files broken links
- euler_rechenkunst01_1738 has wrong structLink
- OCR-D GT uses wrong mods:languageTerm/@authority
- wrong image references
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from assets.