Light

rennsax / jaccount_captcha Goto Github PK

View Code? Open in Web Editor NEW

1.0 1.0 0.0 271 KB

A project for the course "Inter cutting-edge algorithm and technology".

License: MIT License

Python 100.00%

jaccount_captcha's Introduction

Jaccount 验证码识别

说明：该项目是课程 英特尔前沿AI算法与实践 的大作业。

该项目中包含：

一篇 ResNet 网络的原论文，来自何晓明博士
用以获取训练集、搭建及训练模型的构建代码
可视化用户端实现（位于 app 文件夹中）

项目使用说明

注：最要不要擅自更改文件夹的名称和位置！

安装依赖库。
```
pip install -r requirements.txt
```
如果不想体验从头搭建一个模型，由于模型过大未上传 github，前往链接交大云盘下载后存入 app/ 文件夹，然后跳至 11；否则请跳至 3。
运行 data_get.py 爬取若干张原始验证码图片，将创建 original_pic 文件夹并自动存入。
运行 noise_1.py 对原始图片进行二值化处理，将创建 pic/close 文件夹并自动存入。
运行 division.py 对二值化后的图片进行分割操作，形成单个的字符，分割结果将存入 divided 文件夹。
运行 noise_2.py 对 divided 文件夹中的图片进行平滑处理，结果存入 smoothed 文件夹。
运行 recognize.py，调用 Google 开源的 tesseract 进行字符识别，识别结果将存入 result 文件夹。
识别结果必然不是完美的，未能识别出的图片位于 smoothed/unrecognized 文件夹中，有两种情况：
- 纯粹没识别出来
- 分割错误导致未能识别
除此之外，result 文件夹中的识别结果也会存在错误。需要人工对错误的字符进行重识别，放入正确的文件夹中。
运行 transform.py，将 result 中的识别结果转化为一张 pic.csv 数据集。图片数较多的时候，pic.csv 可能会很大，1000 张图片对应的大小大概是 700MB。
运行 resnet.py，读取 pic.csv 数据集，创建训练集、测试集、验证集，创建模型并进行训练。可以对 resnet.py 中的全局变量做出适当调整。训练出的模型将覆盖 app/model.h5。
运行 app/main.py，选择导入验证码图片进行识别。

项目总结

收获

学习了机器学习数据集的获取及制作
对卷积神经网络有了更深的理解，对 ResNet 网络也有了一定的了解
更熟练地使用 opencv 进行图像处理
学习了 GUI 界面的简单实现
一些项目管理的经验

不足之处

在数据集处理过程中，需要人工识别字符，效率低，费时费力
部分代码逻辑不够清晰，未能做到完美 DeBug
GUI 界面有些简陋；以 GUI 界面实现的 Jaccount 验证码识别也不够便利

jaccount_captcha's People

Contributors

Stargazers

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.