Coder Social home page Coder Social logo

vision's Introduction

Vision

Computuer Vision and Image Processing with Python, OpenCV, and Keras.


๐ŸŒ 1. Dehazing to Object Recognition ๐Ÿšง

Human, and fire detection and recognition in indoor foggy environment.

i. Image Processing with OpenCV

A. Dehazing

a. Dark Channel Prior | GitHub | Paper

B. Object Recognition

a. Human detection with RPi | Github
b. HOG detectMultiScale | Blog (KR) | Blog (US)

ii. Image Processing with Deep Learning

๐Ÿ“š A. Dataset | Reference

a. RESIDE: V0 (REalistic Single Image DEhazing) | Homepage | Paper (IEEE) | Paper (arXiv)
b. I-HAZE | Homepage | Paper (arXiv)
c. D-HAZY | Homepage | Paper (IEEE)
d. O-HAZE | Paper (arXiv)

๐Ÿ iii. Challenge

A. NTIRE2018 | Homepage


๐ŸŽฏ 2. Vanishing Point to Control Posture โœ”๏ธ

Using Image Processing with OpenCV


3. Depth to Obstacle Avoidance ๐Ÿšง

i. Image Processing with Deep Learning

๐Ÿ“š A. Dataset

a. NYU Dataset V1 | Homepage | Paper
b. NYU Dataset V2 | Homepage | Paper
c. Sample Code for NYU
> pip install scikit-image
> python -m pip install -U matplotlib
import skimage.io as io

to

import matplotlib
matplotlib.use('TkAgg')
from skimage import io
io.use_plugin('matplotlib')
c. RGBD Dataset | Homepage

B. Model

a. FCRN | Paper (arXiv) | GitHub

4. Optical Character Recognition to Path Planning ๐Ÿšง

i. Tesseract | GitHub | Demo

ii. PyTesseract | GitHub | Install with pip

> pip install pytesseract

A. Install Windows Version | GitHub | Download

B. Tesseract training | GitHub

C. Variable-size Graph Specification Language (VGSL) | GitHub

D. StreetView Tensorflow Recurrent End-to-End Transcription (STREET) | GitHub

E. Korean OCR | Blog (KR)

  • a. Remove spaces | Blog (KR)
  • b. Once you change the route, you need to turn off and restart Pycharm.
  • c. Remove special characters | Blog (KR)
  • d. Tesseract Optimal conditions | Blog (KR)
  • e. Color Reversal | Blog (KR)
image = cv2.bitwise_not(input_image)
image = cv2.resize(input_image, dsize=(0, 0), fx=0.3, fy=0.7, interpolation=cv2.INTER_LINEAR)

๐Ÿ iii. Challenge

A. Robust Reading Competition ICDAR | Homepage

Overview - ICDAR 2019 Robust Reading Challenge on Multi-lingual scene text detection and recognition | Homepage

iv. Korean STR

Korean Scene Text Recognition by Character-level

๐Ÿ“š A. Dataset

a. Dataset Version 1 (.png) | MNIST.zip | PHD08.zip
Dataset Language # of fonts # of characters total # of images
PHD08 Korean 9 2,350 5,139,450
EMNIST (ByClass) Number & English Hand Writing 62 814,255
b. Dataset Version 2 (.png) | 7z
Language # of fonts # of characters total # of images
Korean 70 11,172 782,040
Number 70 10 700
English 70 52 (upper 26, lower 26) 3,640
c. Dataset Version 3 (.png) | 7z | zip
Language # of fonts # of characters total # of images
Korean 69 972 58,320
Number 69 10 690
English 69 52 (upper 26, lower 26) 3,588
d. Dataset Version 4 (.png) | 7z | zip | pkl
Language # of fonts # of characters total # of images
Korean 69 972 116,640
Number 69 10 1,240
English 69 26 3,588
  • Korean and numbers use two font sizes.
  • Incorporate uppercase letters into lowercase letters.
  • Font sizes of Korean and number are 44, and 54, font size of English is 44.
  • Image size is 64.
  • Hangul characters refer to this and calculated the frequency.

Install


  • Image Processing vs. Computer Vision Reference
  • Dehazing - Image Processing/Object Recognition - Computer Vision
  • Vanishing Point - Computer Vision
  • Depth Prediction - Computer Vision
  • Opitcal Character Recognition - Computer Vision

vision's People

Contributors

inyong37 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.