We explore the task of retrieving similar captioned images from a dataset, given a previously unseen captioned image.
Note: the source code in SpatialPyramid
has some bugs fixed. It is not exactly the same as the original source code from UIUC.
- Add
SpatialPyramid
to path - Unzip the Flickr 8k dataset to a
data
subdirectory - Run
close all; clear all; baseline;
- Add the required search tags into
searchTags
array in line 13 ofcrawler/gallerySearch.py
- Execute
python crawler/gallerySearch.py
Note: The crawled Imgur dataset (DataM) consists of 32K images and ~110K captions and is sized at ~16 gb. It can be provided on request.
See readme in lda/
See readme in caffenet/
The dataset and associated captions can be found at: http://pages.cs.wisc.edu/~ms/CS766-ComputerVision/captioned-image-retrieval/