Coder Social home page Coder Social logo

philferriere / aind-projects Goto Github PK

View Code? Open in Web Editor NEW
27.0 9.0 31.0 84.15 MB

Phil Ferriere's Artificial Intelligence Nanodegree Projects (reports and code)

Python 2.32% Batchfile 0.01% Shell 0.01% HTML 50.55% CSS 0.01% JavaScript 0.04% Jupyter Notebook 47.09%

aind-projects's Introduction

Artificial Intelligence Nanodegree Projects

This repo contains our work on Udacity's Artificial Intelligence Nanodegree Projects.

Facial Keypoint Detection + Real-time Filtering [Capstone Project]

This exciting and fun capstone project required combining traditional computer vision techniques and deep learning to build and end-to-end facial keypoint recognition system. Fisrt, we used OpenCV to build a face detector and pre-process video input data. Second, we trained a CNN to detect facial keypoints in the face area of the video frame. Third, we used OpenCV again to apply a template (sunglasses) on top of the facial keypoints in real-time.

facial-keypoints1 facial-keypoints2

Our capstone project can be reviewed here.

CNN-based Dog Breed Classifier

For this computer vision problem, we experimented with two different CNN (Convolutional Neural Network) architectures. In our first approach, we built a CNN from scratch. Second, we used transfer learning to leverage pre-trained models (VGG-16, VGG-19, ResNet-50, Inception, Xception), only training the head of the network (the fully connected classification layer) for our specific application (84.8% test accuracy w. ResNet-50)

dog-project

The notebook for this project can be found here.

Facial Expressions Identification

In this project, we use Affectiva’s Emotion-as-a-Service API to track faces in a video and identify facial expressions. We tag each face with an appropriate emoji next to it. If you own a webcam, you may also play a simple game: mimic a random emoji we display and we will automatically recognize if your expression matches our random selection!

cv-mimic

Our implementation is available here.

Sign Language Recognition

Here, we use HMMs (Hidden Markov Models) to recognize gestures in American Sign Language, from individual words to complete sentences. We train our system on a dataset of videos that have been pre-processed and annotated, and test on novel sequences.

recognizer

Click here to review the IPython notebook for this project.

Planning Search

For this project, we implement a planning search agent to solve deterministic logistics planning problems for an Air Cargo transport system. We use a planning graph and automatic domain-independent heuristics with A search* and compare their results/performance against several uninformed non-heuristic search methods (breadth-first, depth-first, etc.).

Additional details can be found here.

Game-Playing Agent

This game-playing agent uses techniques such as iterative deepening, Minimax, and Alpha-Beta Pruning to beat its opponent in a game of Isolation (a two-ply discrete competitive game with perfect information).

isolation

For details about our implementation, please visit this link.

Diagonal Sudoku Solver

In this introductory project, we use constraint propagation to find solutions to Sudoku puzzles, repeatedly applying game rules (constraints) until the Sudoku puzzle stops changing.

cv-sudoku

For more information about this project, go here.

Contact Info

If you have any questions about this work, please feel free to contact us here:

https://www.linkedin.com/in/philferriere

aind-projects's People

Contributors

philferriere avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.