Coder Social home page Coder Social logo

gtziafas / hebrew-characters-hwr21 Goto Github PK

View Code? Open in Web Editor NEW
0.0 0.0 0.0 60.99 MB

Code for Hebrew character and style recognition for the Dead See Scrolls dataset, in the context of the "Handwritting Recognition" course of RUG, 2021

Python 100.00%

hebrew-characters-hwr21's People

Contributors

andreimiculita avatar anjalinair012 avatar gtziafas avatar nielsrocholl avatar

Watchers

 avatar  avatar  avatar  avatar

hebrew-characters-hwr21's Issues

Cropping boxes appropriately in sliding window

For each window (without discarding ascenders/descenders):

  • project horizontally/vertically
  • get medians of projections
  • draw box centered around the coordinates of the medians (with a size such as 75x75)
  • predict/get max probability from that box
  • add x coordinate of centerpoint + max probability to array
  • identify minima again
  • crop from the minima (either with lines or with A*)

char-segm

  • [] Read some papers about char segm
  • [] Implement naive contours + centre of gravity method
  • [] Build templates
  • [] Make smarter cutoffs with prob distributions from templates

Grid search for best CNN hyperparameters

we must experiment with the following hyper-params (10-fold):
dropout_rates (in cnn.py):
[0.1, 0.1], [0.1, 0.2], [0.1, 0.25], [0.1, 0.33]
learning rates:
1e-04, 5e-04, 1e-03, 5e-03
weight decays:
0.001, 0.005, 0.01, 0.05, 0.1
batch size:
32, 64, 128

From project root run e.g. python3 -m qumran_seagulls.scripts.train_cnn_monkbrill -e 15 -early 2 -bs 64 -lr 0.001 -wd 0.01 -kfold 10

Implementing A*

From what I've seen, the main advantage of Arivazhagan et al. is that it can handle skewed documents. However, none of the scrolls is skewed. I think the fact that A* also allows going diagonally makes it the better choice

Created from JetBrains using CodeStream

Data augmentation habbakuk font

Morph.cy currently performs three different augmentations:

  1. affine
  2. perspective
  3. elastic

See the folder "augmented_font_samples" to see some augmented examples. The original samples are located in the folder "dump"

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.