Coder Social home page Coder Social logo

mlnd-capstone-1's Introduction

MLND-Capstone

Capstone project for Udacity's Machine Learning Engineer Nanodegree

Problem Statement

This project will use a deep learning convolutional neural network to classify multi-digit number sequences.

The project is a TensorFlow implementation of "Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks", available here

Dataset

The dataset can be downloaded from The Street View House Numbers (SVHN) Dataset

Download train.tar.gz, test.tar.gz, and extra.tar.gz and extract into this folder.

Scripts

Several Python scripts are available to train the model:

  • preprocess.py: Use this script to preprocess each of the downloaded images. This will detect the bounding boxes around the house numbers, crop out the numbers, and resize the numbers to 64x64 images.
  • split.py: Use this script to split the data into training, validation, and test sets.
  • train.py: Use this script to train the model. Training with an NVIDIA Titan X (Pascal) GPU will take approximately three days to reach 95% accuracy.
  • eval.py: Use this script to periodically evaluate the validation set during training. After finishing training, run with -set=test to calculate accuracy and coverage on the test set.
  • predict.py: Use this script to make predictions on new images in the predict directory. Images should be 64x64 jpegs. Results will be available in predict.csv.
  • export.py: Use this script to extract the trained model in a format suitable for use in external applications.

Two Python utility modules are used for training and evaluation.

  • model.py: This defines the model, optimizer, and loss functions.
  • input.py: This defines the image input and preprocessing data pipelines.

Report

A final report explaining this project and the surrounding problem domain is available as report.pdf.

License

The code for this project is open source and available under the terms of the license in this repository.

mlnd-capstone-1's People

Contributors

rgcottrell avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.