Coder Social home page Coder Social logo

hpf's Introduction

Hyperpixel Flow:
Semantic Correspondence with Multi-layer Neural Features

This is the implementation of the paper "Hyperpixel Flow: Semantic Correspondence with Multi-layer Neural Features" by J. Min, J. Lee, J. Ponce and M. Cho. Implemented on Python 3.6 and Pytorch 1.0.1.

For more information, check out project [website] and the paper on [arXiv].

Conda environment settings

conda create -n hpf python=3.6
conda activate hpf

cat /usr/local/cuda/version.txt
conda install pytorch=1.0.1 torchvision cudatoolkit=10.0 -c pytorch (if CUDA 10) 
conda install pytorch=1.0.1 torchvision cudatoolkit=9.0 -c pytorch (if CUDA 9) 

conda install -c anaconda scikit-image
conda install -c anaconda pandas
conda install -c anaconda requests
conda install pillow=6.1
pip install gluoncv-torch

Reproduction

Beam search on SPair-71k validation set:

python beamsearch.py --dataset spair --thres bbox --backbone resnet50
python beamsearch.py --dataset spair --thres bbox --backbone resnet101

Beam search on PF-PASCAL validation set:

python beamsearch.py --dataset pfpascal --thres bbox --backbone resnet50
python beamsearch.py --dataset pfpascal --thres bbox --backbone resnet101  

Results on PF-PASCAL: (PCK: 83.4%, 84.8%, 88.3%)

python evaluate.py --dataset pfpascal --backbone resnet50 --hyperpixel '(2,7,11,12,13)'
python evaluate.py --dataset pfpascal --backbone resnet101 --hyperpixel '(2,17,21,22,25,26,28)'
python evaluate.py --dataset pfpascal --backbone fcn101 --hyperpixel '(2,4,5,18,19,20,24,32)'

Results on PF-WILLOW: (PCK: 74.4%)

python evaluate.py --dataset pfwillow --backbone resnet101 --hyperpixel '(2,17,21,22,25,26,28)'

Results on Caltech-101: (LT-ACC: 0.88, IoU: 0.64)

python evaluate.py --dataset caltech --backbone resnet50 --hyperpixel '(2,7,11,12,13)'

Results on SPair-71k: (PCK: 27.2%, 28.2%)

python evaluate.py --dataset spair --backbone resnet50 --hyperpixel '(0,9,10,11,12,13)'
python evaluate.py --dataset spair --backbone resnet101 --hyperpixel '(0,8,20,21,26,28,29,30)'

To visualize predictions using TPS transformation, add command line argument --visualize:

python evaluate.py --visualize

Bibtex

If you use this code and SPair-71k dataset for your research, please consider citing:

@InProceedings{min2019hyperpixel, 
   title={Hyperpixel Flow: Semantic Correspondence with Multi-layer Neural Features},
   author={Juhong Min and Jongmin Lee and Jean Ponce and Minsu Cho},
   booktitle={ICCV},
   year={2019}
}
@article{min2019spair,
   title={SPair-71k: A Large-scale Benchmark for Semantic Correspondence},
   author={Juhong Min and Jongmin Lee and Jean Ponce and Minsu Cho},
   journal={arXiv prepreint arXiv:1908.10543},
   year={2019}
}

hpf's People

Contributors

juhongm999 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.