jiayue-zhou / vqa-system-with-mac-and-clevr Goto Github PK

View Code? Open in Web Editor NEW

A Visual Question Answering System with CLEVR and MAC Structure. It gets a picture from CLEVR dataset and a sentense of question, and outputs the answer of the question by a deep learning model.

Python 79.64% HTML 20.36%

vqa-system-with-mac-and-clevr's Introduction

demo_vqa

A Backup for VQA System

This is the code for Visual Question Answering (VQA) System.

The code has two parts. One is a simple demo for presentation. And another is the back end for VQA System.

Demo for Presentation

Python(Flask), HTML

It is an easy part. I use very basic features of Flask to have an application for presentation.

Basically I have a server.py to connect the back-end and front-end. The server is the starter class(or start point / entry) for the program.

Additionally, I have a HTML page (static/demo2.html) to show everything when I was on the presentation.

VQA System

Python(Pytorch)

It is the big part of this program.

As part of an VQA question(https://visualqa.org/), I built a VQA System with CLEVR dataset(https://cs.stanford.edu/people/jcjohns/clevr/).

The algorithm and deep learning structure are from Compositional Attention Networks for Machine Reasoning (ICLR 2018)

It is a 512 hidden layers and 12 iteration memory "cells" deep learning model. With ResNet preprocessing the images and GloVe + LSTM preprocessing the natural languages.

The best accuracy based on test data set is 96.95%.

Part of this VQA system is built according to VQA System built from Media Intelligence Lab, Hangzhou Dianzi University.

Recommend Projects

jiayue-zhou / vqa-system-with-mac-and-clevr Goto Github PK

vqa-system-with-mac-and-clevr's Introduction

demo_vqa

Demo for Presentation

VQA System

vqa-system-with-mac-and-clevr's People

Contributors

Stargazers

Watchers

Forkers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent