Coder Social home page Coder Social logo

Uyghur Language about easyocr HOT 8 CLOSED

Abdusalamstd avatar Abdusalamstd commented on May 18, 2024
Uyghur Language

from easyocr.

Comments (8)

rkcosmos avatar rkcosmos commented on May 18, 2024 1

According to Wikipedia, it seems like you have 4 set of alphabets.

  1. Uyghur Arabic alphabet or UEY
  2. Uyghur Cyrillic alphabet or USY
  3. The Uyghur New Script or UYY
  4. Uyghur Latin alphabet or ULY

We currently have Latin model. Arabic and Cyrillic are on the way. If Uyghur use all 4 set of alphabet above, then it's not gonna be easy. You can create a pull request to add all characters and words (see #25 ), but I cannot promise to do it in the near future because my priority will have to go to popular language or set of languages that share most of characters together.

from easyocr.

Abdusalamstd avatar Abdusalamstd commented on May 18, 2024 1

According to Wikipedia, it seems like you have 4 set of alphabets.

  1. Uyghur Arabic alphabet or UEY
  2. Uyghur Cyrillic alphabet or USY
  3. The Uyghur New Script or UYY
  4. Uyghur Latin alphabet or ULY

We currently have Latin model. Arabic and Cyrillic are on the way. If Uyghur use all 4 set of alphabet above, then it's not gonna be easy. You can create a pull request to add all characters and words (see #25 ), but I cannot promise to do it in the near future because my priority will have to go to popular language or set of languages that share most of characters together.

Thanks for your reply!
Of the above four model, first model(Uyghur Arabic alphabet or UEY) is the most widely used. So just add first model(Uyghur Arabic alphabet or UEY) . I have finished the alphabet file "ug_char.txt", and now preparing the 'dict/ug.txt' common Uyghur words file.

from easyocr.

Abdusalamstd avatar Abdusalamstd commented on May 18, 2024 1

"ug_char.txt", and now preparing the 'dict/ug.tx

Hello, If it's not too much to ask, could you please share with me the "ug_char.txt", and "dict/ug.tx" files you prepared for Uygur, so that I can use them for my language?

You can download it from this EasyOCR project repository.

from easyocr.

arulrajnet avatar arulrajnet commented on May 18, 2024

Already explained here

#25 (comment)

from easyocr.

rkcosmos avatar rkcosmos commented on May 18, 2024

I'm not a language expert so you have to help me understand Uyghur language a bit. From first look, it looks like Arabic language. Let me ask a few question.

  1. Is Uyghur using the same script as Arabic? Or are there additional script?
  2. I know that in Arabic there is a specific pattern when you write character next to each other to create a word. Is Uyghur using the same pattern?

from easyocr.

Abdusalamstd avatar Abdusalamstd commented on May 18, 2024

I'm not a language expert so you have to help me understand Uyghur language a bit. From first look, it looks like Arabic language. Let me ask a few question.

  1. Is Uyghur using the same script as Arabic? Or are there additional script?
  2. I know that in Arabic there is a specific pattern when you write character next to each other to create a word. Is Uyghur using the same pattern?

Reply: 1.There additional script in Uyghur,Not exactly the same.
2.Yes,Uyghur using the same pattern.

from easyocr.

rkcosmos avatar rkcosmos commented on May 18, 2024

please make sure 'dict/ug.txt' has enough words (other languages has ~30000).

from easyocr.

hilaloytun avatar hilaloytun commented on May 18, 2024

"ug_char.txt", and now preparing the 'dict/ug.tx

Hello, If it's not too much to ask, could you please share with me the "ug_char.txt", and "dict/ug.tx" files you prepared for Uygur, so that I can use them for my language?

from easyocr.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.