Coder Social home page Coder Social logo

tabprods's Introduction

TabProDS

TabProDS is the accompanying dataset for the paper on "TabProIS: A Transfer Learning-Based Model for Detecting Tables in Product Information Sheets". It consists of 5,600 document images which were converted from Product Information Sheets. Out of these, 4,489 images contain at least one table. A table can have a wide variety of layouts, ranging from full grid-style borders to no borders at all. The layout of the document pages also varies. We tried to ensure that not too many document images with a similar layout bias the dataset.

Example documents

If you want to reuse our work, please cite it. An exemplary BibTex entry is listed at the bottom of this page.

Annotation Format

The images are annotated in the standard COCO format. We also provide automatically generated YOLO annotations.

Splits

The dataset is split into three parts:

  • 3922 (70%) images for training
  • 838 (15%) images for development
  • 840 (15%) images for validation

Resolution

The individual images were converted from PDF files in two different resolutions.

Full Resolution (300 dpi)

The full resolution images typically have the dimensions 2480x3509 pixels (portrait). Some might have a landscape orientation or different dimensions as well as different aspect ratios altogether.

In total, the full resolution dataset is 3.9 GB in size with an average of 670 kB per image.

Reduced Resolution (75 dpi)

The reduced resolution images are a sixteenth the size of their full resolution counterparts. This results in the typical dimensions of 620x877 pixels. Again, these dimensions are not guaranteed as some images might be in landscape orientation or a different size and aspect ratio.

The reduced resolution dataset weighs in at 344 MB with an average of 62 kB per image.

File Structure

.
├── annotations                       # all annotations
│   ├── coco                          # annotations in COCO format
│   │   ├── dev.full.json             # full resolution development set
│   │   ├── dev.reduced.json          # reduced resolution development set
│   │   ├── train.full.json           # full resolution training set
│   │   ├── train.reduced.json        # reduced resolution training set
│   │   ├── val.full.json             # full resolution validation set
│   │   └── val.reduced.json          # reduced resolution validation set
│   └── yolo                          # annotations in YOLO format
│       ├── labels
│       │   ├── full                  # full resolution labels
│       │   │   ├── P000053802-1.txt
│       │   │   ├── ... (5598 more)
│       │   │   └── P000237430-2.txt
│       │   └── reduced               # reduced resolution labels
│       │       ├── P000053802-1.txt
│       │       ├── ... (5598 more)
│       │       └── P000237430-2.txt
│       ├── dev.full.txt              # full resolution development set
│       ├── dev.reduced.txt           # reduced resolution development set
│       ├── train.full.txt            # full resolution training set
│       ├── train.reduced.txt         # reduced resolution training set
│       ├── val.full.txt              # full resolution validation set
│       └── val.reduced.txt           # reduced resolution validation set
├── img                               # all images
│   ├── full                          # full resolution (300 dpi) images  (coming soon)
│   │   ├── P000053802-1.jpg
│   │   ├── ... (5598 more)
│   │   └── P000237430-2.jpg
│   └── reduced                       # reduced resolution (150 dpi) images
│       ├── P000053802-1.jpg
│       ├── ... (5598 more)
│       └── P000237430-2.jpg
├── resources                         # resources for README.md
│   └── ...
└── README.md                         # this README file

Cite This

@article {
    # ... coming soon
}

tabprods's People

Watchers

 avatar Jan Delember avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.