Coder Social home page Coder Social logo

bh-an / image-captioning Goto Github PK

View Code? Open in Web Editor NEW
1.0 2.0 0.0 275.19 MB

ML models for Image captoining using CNN+LSTM and ResNet+GRU on the Flickr8k dataset

Jupyter Notebook 100.00%
caption-generation gru-neural-networks language-model lstm-neural-network python resnet

image-captioning's Introduction

Image-captioning

ResNet+GRU results:

Image 1

Actual Captions :- 
The two ladies are riding bicycles near the beach .
Two women in summer wear ride beach cruiser tricycles on the concrete near the beach .
Two women on low riding three-wheeled vehicles with baskets .
two women ride their three wheelers .
Two women riding tricycles .

Predicted Caption : Two women ride a three wheelers .
0.5814307369682193

Image 2

Actual Captions :- 
Two girls arm wrestle as another observes
Two girls arm wrestle while a third girl in a pink shirt and glasses watches .
Two girls arm wrestling , while another looks on .
Two teenage girls arm wrestle while a third girl watches .
Two young girls are arm wrestling in their hotel room while another girl watches .

Predicted Caption : Two girls arm wrestle while a third girl watches .
1.0

Image 3

Actual Captions :- 
A bunch of dogs are competing in a race .
Five greyhounds are racing on a sand track .
Muzzled greyhounds are racing on the track .
several muzzled greyhound dogs racing around a track
The number 2 dog in the blue vest is in the lead at the dog races .

Predicted Caption : Three muzzled greyhounds race around a track while a dog watches .
0.33180774028439436

CNN+LSTM results:

Image 1 Image 2 Image 3

image-captioning's People

Contributors

bh-an avatar

Stargazers

 avatar

Watchers

 avatar  avatar

image-captioning's Issues

Can you improve it please

There is no verification set generator in the code, so there is no verification set loss and training set loss when training the model, and the code for Bleu score evaluation is missing. Can you improve it when you have time?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.