Coder Social home page Coder Social logo

Tigrigna Resources about langdata HOT 3 CLOSED

tesseract-ocr avatar tesseract-ocr commented on September 2, 2024
Tigrigna Resources

from langdata.

Comments (3)

zmeharen avatar zmeharen commented on September 2, 2024

Hi Shreeshrii,

I have been working on improving Tesseract's ability to recognize Tigrinya (ti or tir) language from PDF and other image files. I have noticed a few errors and I have been working on training the software. I have been working on it for about a week. Thank you for posting the link it has helped me in finding the resources I need. If you need help with this language let me know. I might need some guidance or direction but I will do what I can to help. Thank You,

Z

from langdata.

Shreeshrii avatar Shreeshrii commented on September 2, 2024

from langdata.

Shreeshrii avatar Shreeshrii commented on September 2, 2024

tesseract-ocr/tesseract#654 (comment)

@theraysmith commented 2 days ago
Update: after going back to the www to get fresh data, I believe that my corpus text is now good for:
chr
dzo
iku
snd
syr
tgk
tir
I have put a lot of time into cleaners/filters for languages that use 'virama' characters.
I am not convinced that they are perfect, but I will add the code to the github repo in due course, so experts/native speakers can offer suggestions/fixes to make them better.

from langdata.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.