watersink / ocrsegment Goto Github PK

View Code? Open in Web Editor NEW

99.0 99.0 27.0 2.24 MB

a deep learning model for page layout analysis / segmentation.

Python 100.00%

ocrsegment's People

Stargazers

Watchers

ocrsegment's Issues

网络结构

你好！我在尝试用你的代码复现论文，网络结构好像和论文里的不太相同？请问你是用你的这版代码跑出了结果吗？

Cannot download UW3 on the internet?

I downloaded UW3 frame but they do not contain original images.
Could you upload UW3 on drive and share it for me?
Thanks

Generating the Bounding Box label

@watersink How can I generate the .txt bounding box file for an image?

CUDA ERROR, run out of memory.

@watersink I tested training on my other system with 30GB ram and Quadro M4000 graphics card and it worked there.
Thanks!

Originally posted by @ghost in #1 (comment)

I tried it to train using GTX970M 16GB RAM and it failed. I tried Google Colab and it failed too. I also tried using half of the dataset, setting per_process_gpu_memory_fraction=0.7 but it did not work. There is no other process using GPU. I am looking at the code trying to figure out is there something defining the input shape but it seems ok. Have you ever encountered the same problem?

Thanks in advance.

并没有label.txt文件

我有几点疑问之处，1.data_pre_process.py是根据txt文件生成对应的gt图片，但是给出的数据集中并没有txt文件，只有gt图片；2.提供的文件中并没有label.txt文件

Error when trying to train

@watersink
After downloading uw3-framed-lines-degraded-000, I extracted the images to a folder called uw3.
Then I ran the training command train_test.py and got an error message:

home@home-lnx:~/ocrsegment$ python3 train_test.py 
Traceback (most recent call last):
  File "train_test.py", line 254, in <module>
    train()
  File "train_test.py", line 194, in train
    dataset,one_epoch_num = get_tf_dataset(dataset_text_file="./uw3/label.txt",batch_size=batch_size,channels=batch_channel)
  File "train_test.py", line 63, in get_tf_dataset
    filenames, labels,one_epoch_num = read_labeled_image_list(dataset_text_file)
  File "train_test.py", line 53, in read_labeled_image_list
    with open(dataset_text_file,"r",encoding="utf-8") as f_l:
FileNotFoundError: [Errno 2] No such file or directory: './uw3/label.txt'

But when using my model for segmentation, an error message appears:

home@home-lnx:~/ocrsegment$ python3 segmentation.py 
Traceback (most recent call last):
  File "segmentation.py", line 444, in <module>
    lines = seg.extract_textlines(image)
  File "segmentation.py", line 425, in extract_textlines
    if len(image.shape)!=2:
AttributeError: 'NoneType' object has no attribute 'shape'

watersink / ocrsegment Goto Github PK

ocrsegment's People

Stargazers

Watchers

Forkers

ocrsegment's Issues

网络结构

Blank output !

Cannot download UW3 on the internet?

Generating the Bounding Box label

CUDA ERROR, run out of memory.

并没有label.txt文件

Error when trying to train

模型位置

您好请问一下，用uw3里面的24对图跑训练显存不足

Segmentation.py [AttributeError: 'NoneType' object has no attribute 'shape']

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent