psarpei / multi-type-td-tsr Goto Github PK

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:

License: MIT License

Python 1.20% Jupyter Notebook 98.80%

image-processing deep-learning table-structure-recognition table-detection table-detection-using-deep-learning ocr ocr-recognition ocr-python natural-language-processing nlp

multi-type-td-tsr's People

Stargazers

Watchers

Forkers

gds101054108 aiwenforgit beyondyourself ibatra esword618 ramakrishnamamidi iceflameworm saurav1423 rmermin cbe135 dhanachandra pshwetank maxkinny222 eighttails tablerecognitionorg ahmad-afaq sre-fuse python3-dev chouroukhelaoui jimeverest nikhilsingh291 zerocodepro poornasainagendra jojoka-234 kamiao g0ldcha1n jingmouren cumthxy smaritvision manishgotame lgblkb qutrino lishnih texttechnologylab ponteineptique ursu1964 monicaarnaud vasco989k anoop-qasolve jawherressaissi jkamlah czha168 christofuyy shyamgupta196 eunchan24 simtori weilong-zh god-serenaa suenavc raphaellee-waikorea

multi-type-td-tsr's Issues

Issue with running table detection and structure recognition

Hi,
Thanks for the great work. I am trying to run your tdtsr.py script on my custom images but I am running into error with the config file:

' Any idea how to resolve this will be appreciated. Thanks

Training code

Hey This work looks really good, may I have the training code to further fine tune it on my data-set?

Training Data

Thank you for making your source code public. Can u guys give access to your train dataset ?

Script for CSV conversion possible with OCR?

Hello there, @Psarpei,

Thanks a lot for the work. I have a simple request. I checked on google collab that you have TSR.table_xml and TSR.table_csv, however, those are missing here. Especially the CSV one.

Is it feasible for you to provide that? That is my major requirement for extraction. other than that everything is working superbly good.

Hope to get some update on this

Can it recognize merged cells?

Hi, I was trying to run the model on the below image:

in Colab with following command:
!python /content/Multi_Type_TD_TSR/scripts/tsr.py --folder=/content/images --type="partially_color_inv" --img_output=/content/img_output --xml_output=/content/xml_output
But I was getting the below error:
Traceback (most recent call last): File "/content/Multi_Type_TD_TSR/scripts/tsr.py", line 30, in <module> boxes, img_processed = type_dict[args.type].recognize_structure(img) File "/content/Multi_Type_TD_TSR/scripts/TSR/table_structure_recognition_all.py", line 208, in recognize_structure center = [int(row[index][j][0] + row[index][j][2] / 2) for j in range(len(row[index]))] IndexError: list index out of range

Not sure though I have provided the correct value of 'type' flag over here.

Possibility of using Multi-Type-TD-TSR locally

Hi, great repo! I am trying out this repo as part of my project and I wish to ask is it possible to use this git repo in local jupyter environment? I faced this error as shown in the screenshot below when trying to run locally, and was unsure how to solve it. Thank you!

How to detect the center region and remove margins, and then deskew the table for correct detection?

Onnx conversion of the model

First of all, thanks a lot for sharing your work. I wanted to ask you if you have tried the onnx conversion of the model or if you could provide some guidance in that direction?

Thanks

Error while running colab notebook

Hello,

When I try to compile the model using your colab notebook, I get the following error:

The checkpoint state_dict contains keys that are not used by the model:
  pixel_mean
  pixel_std
  proposal_generator.anchor_generator.cell_anchors.{0, 1, 2, 3, 4}

Any idea what could be going wrong? Cheers!

Is the model available for a windows CPU or only with cuda?

I couldn't find the torchvision==0.8.1 version on windows.
can it be replaced with the higher version? and could it work on CPU?

IndexError: list index out of range

When I executed Multi-Type-TD-TSR-main/scripts/tsr.py with sample image(Multi-Type-TD-TSR-main/images/bordered_example.png),
I have an error
Multi-Type-TD-TSR-main/scripts/TSR/table_structure_recognition_lines_wol.py", line 211, in recognize_structure center = [int(row[index][j][0] + row[index][j][2] / 2) for j in range(len(row[index]))]

provided model config: absolute path for base

When I run with the weights and config linked in the Readme, Detectron2 cannot correctly resolve the base config file:

  File "/lib/python3.7/site-packages/detectron2/config/config.py", line 46, in merge_from_file
    loaded_cfg = self.load_yaml_with_base(cfg_filename, allow_unsafe=allow_unsafe)
  File "/lib/python3.7/site-packages/fvcore/common/config.py", line 103, in load_yaml_with_base
    base_cfg = _load_with_base(base_cfg_file)
  File "/lib/python3.7/site-packages/fvcore/common/config.py", line 93, in _load_with_base
    return cls.load_yaml_with_base(base_cfg_file, allow_unsafe=allow_unsafe)
  File "/lib/python3.7/site-packages/fvcore/common/config.py", line 59, in load_yaml_with_base
    with cls._open_cfg(filename) as f:
  File "/lib/python3.7/site-packages/detectron2/config/config.py", line 34, in _open_cfg
    return PathManager.open(filename, "r")
  File "/lib/python3.7/site-packages/iopath/common/file_io.py", line 1012, in open
    bret = handler._open(path, mode, buffering=buffering, **kwargs)  # type: ignore
  File "/lib/python3.7/site-packages/iopath/common/file_io.py", line 612, in _open
    opener=opener,
FileNotFoundError: [Errno 2] No such file or directory: '/content/Base-RCNN-FPN.yaml'

That's because your config contains

_BASE_: "/content/Base-RCNN-FPN.yaml"

Which should correctly read

_BASE_: "../configs/Base-RCNN-FPN.yaml"

or simply

_BASE_: "Base-RCNN-FPN.yaml"

If you provide the training code?

About the model :Multi-Type-TD-TSR, whether you can share the implemtent code

psarpei / multi-type-td-tsr Goto Github PK

multi-type-td-tsr's People

Stargazers

Watchers

Forkers

multi-type-td-tsr's Issues

Recommend Projects

Recommend Topics

Recommend Org