Coder Social home page Coder Social logo

fangaofeng / table-detection-and-document-layout-analysis Goto Github PK

View Code? Open in Web Editor NEW

This project forked from prakhar-97/table-detection-and-document-layout-analysis

0.0 1.0 0.0 884 KB

License: MIT License

Python 5.94% Jupyter Notebook 94.06%

table-detection-and-document-layout-analysis's Introduction

Table-detection-and-Document-layout-analysis

Introduction

Using State of the Art techniques for table detection and Document layout analysis. For table detection we are using MMDetection version(1.2), however in Document layout analysis we are using the models which have been developed in MMDetection version(2.0)

Setup

Models are developed in Pytorch based MMdetection framework (Version 2.0)

git clone -'https://github.com/open-mmlab/mmdetection.git'
cd "mmdetection"
python setup.py install
python setup.py develop
pip install -r {"requirements.txt"}

Image Augmentation

We have followed Dilation and Smudge techniques for Data Augmentation


Model Zoo

Config file for the Models :

  1. For table detection Config_file

  2. For Document Analysis Config_file

Note: Config paths are only required to change during training

Checkpoints of the Models that have been trained :

Model NameCheckpoint File
Table structure recognitionCheckpoint
Document layout analysisCheckpoint

Datasets

  1. Table detection and Structure Recignition: You can refer to Dataset to have a better understanding of the Dataset

  2. Document layout Analysis: You can refer to Dataset to have a better understanding of the dataset.

Training

Refer to the two colab notebooks thathave been mentioned as they will direct you through the steps that need to be followed. If using a custom dataset do go through MMdet Docs

table-detection-and-document-layout-analysis's People

Contributors

prakhar-97 avatar sizhky avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.