Coder Social home page Coder Social logo

wanfang13 / melm Goto Github PK

View Code? Open in Web Editor NEW
39.0 2.0 7.0 78 KB

Min-Entropy Latent Model for Weakly Supervised Object Detection

Home Page: https://github.com/Winfrand/MELM

Shell 1.18% Lua 65.68% CMake 3.10% Cuda 25.44% C 4.60%
min-entropy wsod

melm's Introduction

New

A simplified version of MELM with context in PyTorch is released [here] by vasgaowei.

Prerequisites

  • Linux (tested on ubuntu 14.04LTS)
  • NVIDIA GPU + CUDA CuDNN
  • Torch7

Getting started

  1. Install the dependencies

    luarocks install hdf5 matio protobuf rapidjson loadcaffe xml
  2. Download dataset, proposals and ImageNet pre-trained model

    Download VOC2007 from: http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtrainval_06-Nov-2007.tar http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtest_06-Nov-2007.tar

    Download proposals from: https://dl.dropboxusercontent.com/s/orrt7o6bp6ae0tc/selective_search_data.tgz

    Download VGGF from: http://www.robots.ox.ac.uk/~vgg/software/deep_eval/releases/bvlc/VGG_CNN_F.caffemodel https://gist.githubusercontent.com/ksimonyan/a32c9063ec8e1118221a/raw/6a3b8af023bae65669a4ceccd7331a5e7767aa4e/VGG_CNN_F_deploy.prototxt

    The data folder has the following structure:

    $MELM/data/datasets/VOCdevkit_2007/
    $MELM/data/datasets/VOCdevkit_2007/VOCcode
    $MELM/data/datasets/VOCdevkit_2007/VOC2007
    $MELM/data/datasets/VOCdevkit_2007/...
    $MELM/data/datasets/proposals/
    $MELM/data/models/
    $MELM/data/results/
  3. Install functions

    cd ./MELM
    export DIR=$(pwd)   
    
    cd $DIR/utils/c-cuda-functions
    sh install.sh
    
    cd $DIR/layers
    luarocks make
  4. Train and test

    cd $DIR
    sh Run_MELM.sh 0 VOC2007 VGGF SSW 0.1 None melm

Acknowledgements

This work would not have been possible without prior work: Vadim Kantorov's contextlocnet, Spyros Gidaris's LocNet, Sergey Zagoruyko's loadcaffe, Facebook FAIR's fbnn/Optim.lua.

melm's People

Contributors

wanfang13 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

melm's Issues

ImageNet Det split

Hi,

Thanks for sharing your implementation. I notice that the val1/val2 splits are used for training and testing on ImageNet follow the RCNN.
I would ask do you use the images from train set? Because rbgirshick mentioned that images from val1 and 1K images per class from train are both utilized.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.