Coder Social home page Coder Social logo

no deskewing/orientation in GT about assets HOT 6 OPEN

ocr-d avatar ocr-d commented on July 24, 2024
no deskewing/orientation in GT

from assets.

Comments (6)

kba avatar kba commented on July 24, 2024

Needs to be specified ASAP in OCR-D/spec. Send PR for page-wise and line-wise rotation to PAGE-XML for upcoming 2019 version.

@chris1010010 @wrznr @bertsky @tboenig @cneud

from assets.

chris1010010 avatar chris1010010 commented on July 24, 2024

Hi all, I might have mentioned in another thread, the full PAGE format collection has a dedicated XML format for this, as deskewing was seen as a pre-processing step that does not need to be reflected in the page content XML. But as it turned out, these other XML formats were never adopted much.
I guess adding the rotation doesn't hurt. Baselines would include the angles implicitly, but that's overkill in many cases I assume.
@wrznr @bertsky @tboenig @cneud

from assets.

bertsky avatar bertsky commented on July 24, 2024

Hi @chris1010010, thanks for your quick feedback!

Are you referring to the 2009 subschema deskew? This functionality would be superseeded completely if we had @orientation in PageType – we can already have the binary rotated image in AlternativeImage.

I would like to do the PR myself, but looking more closely, I have trouble interpreting the existing Region level attributes with certainty: @orientation sounds more like referring to the Coords.points, whereas @readingOrientation to the actual image. Is that impression correct? If so, are they mutually exclusive?

BTW, we just have a discussion on AlternativeImage based annotation including rotation issues here.

from assets.

chris1010010 avatar chris1010010 commented on July 24, 2024

Hi @bertsky
Yes, I was referring to that subschema.
I always interpreted orientation and readingOrientation as additive.
Orientation is for the block/region in general. ReadingOrientation was intended for cases where the text is at an angle with reference to the block/region. We had examples somewhere, but only using orientation should be sufficient in most cases. In the current format I would probably use nested regions.

from assets.

bertsky avatar bertsky commented on July 24, 2024

Oh, I see. Thanks for the clarification!

I mentioned the additive semantics in the new PR.

from assets.

EEngl52 avatar EEngl52 commented on July 24, 2024

@tboenig can this be closed?

from assets.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.