Coder Social home page Coder Social logo

fireae / fots.pytorch Goto Github PK

View Code? Open in Web Editor NEW

This project forked from chenjun2hao/fots.pytorch

0.0 1.0 0.0 26.08 MB

an unofficial implementation of FOTS: Fast Oriented Text Spotting with a Unified Network

Makefile 0.04% Python 26.78% C++ 71.10% Objective-C 0.02% Shell 0.08% C 0.82% Cuda 1.16%

fots.pytorch's Introduction

FOTS.pytorch

This is an unofficial implementation of FOTS: Fast Oriented Text Spotting with a Unified Network, which is a unified end-to-end trainable Fast Oriented Text Spotting (FOTS) network for simultaneous detection and recognition, sharing computation and visual information among the two complementary tasks. and i mainly borrows from E2E-MLT, which is an End-to-end text training and recognition network.

Requirements

Compile extension file

  • RoIRotate for roirotate layer, I've written a pytorch automatic layer roirotate in paper compiling:
# optional
source activate conda_env
cd $project_path/rroi_align
sh make.sh          # compile

TEST

first download the pretrained model from baidu,password:ndav. which is trained on ICDAR2015. put the model in weights folder, then can test on some icdar2015 test samples

cd $project_path
python test.py

some examples:

图1 图2
图3 图4
图5 图6

RoIRotate

RoIRotate applies transformation on oriented feature regions to obtain axis-aligned feature maps.use bilinear interpolation to compute the values of the output

图1 图2
图3 图4
图5 图6

Train

download the ICDAR2015 data and the train_list from baidu, password:q1au

# train_list.txt list the train images path
/home/yangna/deepblue/OCR/data/ICDAR2015/icdar-2015-Ch4/img_546.jpg
/home/yangna/deepblue/OCR/data/ICDAR2015/icdar-2015-Ch4/img_277.jpg
/home/yangna/deepblue/OCR/data/ICDAR2015/icdar-2015-Ch4/img_462.jpg
/home/yangna/deepblue/OCR/data/ICDAR2015/icdar-2015-Ch4/img_237.jpg

training:

python train.py -train_list=$path_to/ICDAR2015.txt

Acknowledgments

Code borrows from MichalBusta/E2E-MLT

fots.pytorch's People

Contributors

chenjun2hao avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.