Comments (5)
I have managed to resolve the issue above. However, after running the tdtsr.py script, there were bunch of warnings and no output was generated according to README file. Do I need to do any preprocessing on my table images before running the tdtsr.py module?
from multi-type-td-tsr.
@Kehindeajayi01
how did you solve this error? trying to get a command line for tdtsr on colab.
What application are you working on? I'm applying these to medical documents to extract table with contents to obtain a dataframe in pandas at the end.
from multi-type-td-tsr.
@Kehindeajayi01 how did you solve this error? trying to get a command line for tdtsr on colab. What application are you working on? I'm applying these to medical documents to extract table with contents to obtain a dataframe in pandas at the end.
@salman-moh : I ran the code on google colab. This method mainly generates xml files for the table structures and not the text content.
from multi-type-td-tsr.
Thanks @Kehindeajayi01 . What did you do with the xml files? Also, how are you resolving the text extraction process? (I'm sure there are better algorithms now). Thanks!
from multi-type-td-tsr.
Thanks @Kehindeajayi01 . What did you do with the xml files? Also, how are you resolving the text extraction process? (I'm sure there are better algorithms now). Thanks!
on their collab, there is tesseract ocr which they used for extraction and then popped it to csv. but I can't find the function here.
Did you ?
from multi-type-td-tsr.
Related Issues (14)
- If you provide the training code? HOT 6
- Error while running colab notebook HOT 1
- Is the model available for a windows CPU or only with cuda? HOT 1
- provided model config: absolute path for base
- How to detect the center region and remove margins, and then deskew the table for correct detection?
- Script for CSV conversion possible with OCR?
- IndexError: list index out of range
- Can it recognize merged cells?
- Training Data
- Onnx conversion of the model
- Possibility of using Multi-Type-TD-TSR locally
- Training code HOT 1
- list index out of range error HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from multi-type-td-tsr.