Coder Social home page Coder Social logo

penny4860 / yolo-digit-detector Goto Github PK

View Code? Open in Web Editor NEW
275.0 18.0 115.0 179.3 MB

Implemented digit detector in natural scene using resnet50 and Yolo-v2. I used SVHN as the training set, and implemented it using tensorflow and keras.

License: MIT License

Python 18.84% Jupyter Notebook 81.16%
yolov2 keras svhn-dataset digit-detector

yolo-digit-detector's Introduction

Build Status codecov

SVHN yolo-v2 digit detector

I have implemented a digit detector that applies yolo-v2 to svhn dataset.

Usage for python code

0. Requirement

  • python 3.6
  • tensorflow 1.14.0
  • keras 2.1.1
  • opencv 3.3.0
  • Etc.

I recommend that you create and use an anaconda env that is independent of your project. You can create anaconda env for this project by following these simple steps. This process has been verified on Windows 10 and ubuntu 16.04.

$ conda create -n yolo python=3.6
$ activate yolo # in linux "source activate yolo"
(yolo) $ pip install -r requirements.txt
(yolo) $ pip install -e .

1. Digit Detection using pretrained weight file

In this project, the pretrained weight file is stored in weights.h5.

  • Example code for predicting a digit region in a natural image is described in detection_example.ipynb.
  • Training set evaluation (1000-images) is as follows:
    • fscore / precision / recall: 0.799, 0.791, 0.807

2. Training from scratch

This project provides a way to train digit detector from scratch. If you follow the command below, you can build a digit detector with just two images.

  • First, train all layers through the following command.
    • project/root> python train.py -c configs/from_scratch.json
  • Next, fine tune only the last layer through the following command.
    • project/root> python train.py -c configs/from_scratch2.json
  • Finally, evaluate trained digit detector.
    • project/root> python evaluate.py -c configs/from_scratch.json -w svhn/weights.h5
    • The evaluation results are output in the following manner.
      • {'fscore': 1.0, 'precision': 1.0, 'recall': 1.0}
    • The prediction result images are saved in the project/detected directory.

Now you can add more images to train a digit detector with good generalization performance.

3. SVHN dataset in Pascal Voc annotation format

In this project, I use pascal voc format as annotation information to train object detector. An annotation file of this format can be downloaded from svhn-voc-annotation-format.

Other Results

  • pretrained weight file is stored at raccoon
  • training set evaluation (160-images)
    • fscore / precision / recall: 0.937, 0.963, 0.913
  • test set evaluation (40-images)
    • fscore / precision / recall: 0.631, 0.75, 0.545

Copyright

  • See LICENSE for details.
  • This project started at basic-yolo-keras. I refactored the source code structure of basic-yolo-keras and added the CI test. I also applied the SVHN dataset to implement the digit detector. Thanks to the Huynh Ngoc Anh for providing a good project as open source.

See Also

If you are interested in advanced digit detection algorithm, please refer to the following projects. The following projects can be more stable training than the algorithm of this project.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.