watersink / ocrsegment Goto Github PK
View Code? Open in Web Editor NEWa deep learning model for page layout analysis / segmentation.
a deep learning model for page layout analysis / segmentation.
I downloaded UW3 frame but they do not contain original images.
Could you upload UW3 on drive and share it for me?
Thanks
@watersink How can I generate the .txt
bounding box file for an image?
@watersink I tested training on my other system with 30GB ram and Quadro M4000 graphics card and it worked there.
Thanks!
Originally posted by @ghost in #1 (comment)
I tried it to train using GTX970M 16GB RAM and it failed. I tried Google Colab and it failed too. I also tried using half of the dataset, setting per_process_gpu_memory_fraction=0.7 but it did not work. There is no other process using GPU. I am looking at the code trying to figure out is there something defining the input shape but it seems ok. Have you ever encountered the same problem?
Thanks in advance.
我有几点疑问之处,1.data_pre_process.py是根据txt文件生成对应的gt图片,但是给出的数据集中并没有txt文件,只有gt图片;2.提供的文件中并没有label.txt文件
@watersink
After downloading uw3-framed-lines-degraded-000
, I extracted the images to a folder called uw3
.
Then I ran the training command train_test.py
and got an error message:
home@home-lnx:~/ocrsegment$ python3 train_test.py
Traceback (most recent call last):
File "train_test.py", line 254, in <module>
train()
File "train_test.py", line 194, in train
dataset,one_epoch_num = get_tf_dataset(dataset_text_file="./uw3/label.txt",batch_size=batch_size,channels=batch_channel)
File "train_test.py", line 63, in get_tf_dataset
filenames, labels,one_epoch_num = read_labeled_image_list(dataset_text_file)
File "train_test.py", line 53, in read_labeled_image_list
with open(dataset_text_file,"r",encoding="utf-8") as f_l:
FileNotFoundError: [Errno 2] No such file or directory: './uw3/label.txt'
你好,请问save文件夹下的模型没有公布出来吗
1080ti用了一个卡都不行,您确定训练的代码没问题吗?报错在lstm
@watersink Thank you for your hard work
I have successfully trained a new model using train_test.py
.
ocrseg.ckpt-200.zip
But when using my model for segmentation, an error message appears:
home@home-lnx:~/ocrsegment$ python3 segmentation.py
Traceback (most recent call last):
File "segmentation.py", line 444, in <module>
lines = seg.extract_textlines(image)
File "segmentation.py", line 425, in extract_textlines
if len(image.shape)!=2:
AttributeError: 'NoneType' object has no attribute 'shape'
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.