Coder Social home page Coder Social logo

chinese_ocr-1's Introduction

chinese_ocr

yolo3 + densenet + ctc ocr

setup

see setup

dowon model

模型效果

目前提供的模型只适合学习使用,只用当前代码在生成的数据集上训练了很多轮保存的最好的一个版本,但不足以商用, 你可以自己用代码训练更好的模型,参考白翔老师的crnn也是个不错的选择

test

python demo.py

you can also see understand_detect

result

train

cd train

python train.py or you can use train_with_param to deal with different dataset

dataset format

 ---dataset
    --images
        --xxx.jpg
    --data_train.txt
    --data_test.txt

dataset

this dataset is generate by code.

link:https://pan.baidu.com/s/1JgS1gSRcfnjWF_epU-E2vA password:wigu

The dataset contains 800,000 pictures 300,000 from chinese novel
100,000 from random number 0-9
100,000 from random code
300,000 random selected by it's frequency

  • Random char space
  • Random font size
  • 10 different fonts
  • Blur
  • noise(gauss,uniform,salt_pepper,poisson)
  • ...

for more detial see train_with_param

Or you can use YCG09's dataset to train,url:

url:https://pan.baidu.com/s/1QkI7kjah8SPHwOQ40rS1Pw (passwd:lu7m)

put your dataset into train/images and change the label file data_test.txt data_train.txt

generate you own dataset

or you can generate your own dataset:

update

  1. use pretrain model to detect word

    • add demo √
    • add densenet training code √
    • test gpu nms √
    • generate my own dataset √
  2. add framework to easy train on your own dataset

    • add yolo3 train code
    • make the code can be easy use on other dataset

Reference

https://github.com/chineseocr/chineseocr https://github.com/YCG09/chinese_ocr

chinese_ocr-1's People

Contributors

bing1zhi2 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.