Coder Social home page Coder Social logo

img_caption_project's Introduction

Image-Captioning-Project

Instructions

1. Please ensure you install coco api

cd cocoapi/PythonAPI
make

or

python setup.py install

2. Make sure your cocoapi folder in following tree struct (images/ at least with train and val)

cocoapi
├─annotations
│  └─.idea
├─common
├─images
│  ├─test2014
│  ├─train2014
│  └─val2014
├─MatlabAPI
│  └─private
├─PythonAPI
│  ├─build
│  │  ├─lib.win-amd64-3.6
│  │  │  └─pycocotools
│  │  └─temp.win-amd64-3.6
│  │      ├─common
│  │      ├─MatlabAPI
│  │      │  └─private
│  │      └─Release
│  │          └─pycocotools
│  ├─demos
│  └─pycocotools
│      └─__pycache__
└─results

3. Play with the app

  • Move to working dir
cd img_caption
  • Run the app
python TinkerUI.py
  • Click Browse and we do provide some test images for you:
img_caption/user_uploaded
  • Boom! You should have see some good stuff!

4. Model Applied

alt text Our models is in:

img_caption\models\legit_model

(PS: Dont ask how many times failed in training and just take a look in the jupyter notebook log - img_caption\2_Training.ipynb)

5. Jupyter Notebook Workflow

  • Vocab generation -- img_caption\1_Preliminaries.ipynb
  • Training (with Val BLEU score at the bottom of the notebook) -- img_caption\2_Training.ipynb
  • Demo -- img_caption\3_Demo.ipynb

6. Legit and Funny Outputs

  • Perfect captioning:

alt text alt text

  • Weird sentence format or not fully captioning

alt text alt text

  • These are some really funny outcome

alt text alt text

(PS: I can be a giraffe as well!!!)

7. Group Member

1001603 ZHOU XUEXUAN
1001427 WENG YUNFAN
1001417 ZHANG CHENG
1001426 SHANG XIAOSHENG (Play less PUBG plz)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.