Coder Social home page Coder Social logo

yinchangchang / ocr_densenet Goto Github PK

View Code? Open in Web Editor NEW
456.0 456.0 159.0 6.81 MB

第一届西安交通大学人工智能实践大赛(2018AI实践大赛--图片文字识别)第一名;仅采用densenet识别图中文字

Python 100.00%
densenet ocr ocr-recognition python pytorch

ocr_densenet's People

Contributors

yinchangchang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

ocr_densenet's Issues

输出文字有乱序现象。

您好,我在用自己图片训练时,loss到0.01以下后,测试训练集内的数据,发现输出的文字有乱序现象。
训练图片是32*280,每张图至多10个字符。字符集大概400,包括汉字、标点、数字等。
乱序很多是三个字的: 123 会变成 132, 但也有12345变成51234的。
不知道是原因,请不吝赐教!

IndexError: tuple index out of range

您好,我是在window7运行该工程的。运行环境是是torch1.0.0,最后改成可以训练时,出现如题所示的错误。错误如下:
`best_f1score 0

1 train-1
0%| | 0/596 [00:00<?, ?it/s]T
raceback (most recent call last):
File "main.py", line 864, in
if name == 'main':
File "main.py", line 831, in main

File "main.py", line 494, in train_eval
images, labels = [Variable() for x in data[1:3]]
File "main.py", line 471, in get_weight
label_true = (labels>0.5).sum(0)
IndexError: tuple index out of range
0%| | 0/596 [00:16<?, ?it/s]

在get_weight函数中我是没有动的,如下:def get_weight(labels):
labels = labels.data.cpu().numpy()
weights = np.zeros_like(labels)
# weight_false = 1.0 / ((labels<0.5).sum() + 10e-20)
# weight_true = 1.0 / ((labels>0.5).sum() + 10e-20)
weight_false = 1.0 / ((labels<0.5).sum(0) + 10e-20)
label_true = (labels>0.5).sum(0)
for i in range(labels.shape[1]):
label_i = labels[:,i]
weight_i = np.ones(labels.shape[0]) * weight_false[i]
# weight_i = np.ones(labels.shape[0]) * weight_false
if label_true[i] > 0:
weight_i[label_i>0.5] = 1.0 / label_true[i]
weights[:,i] = weight_i
weights *= np.ones_like(labels).sum() / (weights.sum() + 10e-20)
weights[labels<-0.5] = 0
return weights`
给为大佬能给点提示吗?或者大概说一下这个函数的具体作用,labels的数据类型是啥?

about learning rate schedule

F1到0.9以后一直固定是1e-4,为什么继续训练下去学习率不选择继续下降?是出于怎样的考虑呢?谢谢

为什么不能运行啊

为什么我用你的源码运行不了呢 总是提醒module不能带“."

_ _20181203114510

把. 去掉也不行,会有
_ _20181203114831

建议写一下各个模块的版本

这边运行的时候出现很多问题,pytorch的版本需要下调,matplotlibpip2直接安装报错,需要指定2点几的版本,希望楼主可以写一下各个模块的基本版本

test label咨询

你好,可以分享下test label,下载链接只有train label

args.model = resnet

args.model = resnet 什么时候用到呢,训练时,args.model = resnet 还是 densenet呢

有关训练准确率问题

您好,我下载了您的代码和数据,训练很长一段时间准确率一直在80-90徘徊,召回率一直在60-80徘徊,而F最高只有68,和您前面给出的数据差距挺大的,能请问下可能是什么原因吗?谢谢!

请问下怎么使用你最新的预训练模型呢?

我直接使用你的代码和数据,训练了一晚上精度只有70%,后边精度突然变为了0,我怀疑是预训练的模型没有记载你最新的模型,请问我该怎么做呢?多谢你分享的源码!

训练时出现的bug

2018-07-06 00-08-08
在epoch=4时,仍能正常训练,当epoch=5时,发现train1和train2的f1score、recall、precision突然变得很小,eval的三个全变成了0,导致epoch从5开始,eval的三个指标一直都是0,这是什么原因呢

分析数据集的代码analysis_dataset.py好像有问题,具体如下

  1. 第37行对h进行除以10的操作,会产生小数,导致第47行的h_count_dict.keys()全是小数,从而x = range(max(h_count_dict.keys())+1)会报错,因为range里面只能是整数;
  2. 继续看下去,第48行的y = [0 for _ in x]让y初始化为一个列表,51行y[h] = h_count_dict[h]赋值操作,因为h是小数,所以y[h]写法是错误的,会报错;
  3. 请问作者是怎么成功运行analysis_dataset.py的???

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.