Coder Social home page Coder Social logo

wacv2021's Introduction

Computer Vision in Practice - Building an End-to-End Pipeline for Object Segmentation

This repo hosts the resources for the Computer Vision Workshop presented at WACV 2021.

Segmentation Visualized

Slides

You can find our overview slides and our Chapter 1 Tutorial Slides linked in this line.

Presenters

  • Yuanbo (Cody) Wang
  • Osama Sakhi
  • Matthew Hagen
  • Ala Eddine Ayadi

Downloading the Repository

You will need the Git client in a terminal to download all the packages and run the following command in your terminal:

git clone --recursive https://github.com/mosdragon/wacv2021.git

Tutorial

This repo contains all resources used for the workshop, including our hands-on Python-based tutorial.

In this repo, you'll find two versions of our tutorial:

  • Jupyter notebook-only version: This version requires a working Jupyter notebook installation and a CUDA-enabled GPU to run training and inference. This version can be found in the tutorial directory, with each notebook under a different directory in the form Chapter-x.
  • Google Colab Version: This version runs on Google Colab, an online notebook hosted by Google with pre-installed packages and access to a CUDA-enabled GPU. This GPU will allow you to run our training and post-processing code without needing a GPU of your own. This can be found in colab_notebook.

Directory Structure

Here's the layout of our project. When you download the repository, your directory structure will look exactly like this until you download and generate new datasets.

.
├── README.md
├── datasets
│   ├── README.md
├── site
│   ├── Makefile
│   ├── css
│   ├── img
│   ├── index.html
│   ├── js
│   └── sass
└── tutorial
    ├── Chapter-1-Introduction
    ├── Chapter-2-Preprocessing
    ├── Chapter-3-Training-Segmentation-Models
    ├── Chapter-4-Postprocessing
    ├── Chapter-5-Mobile
    ├── Colab_Chapters_3_and_4
    └── README.md

Downloading Datasets

To run through Chapter 2, which generates the COCO and VOC-formatted datasets from the original ADE20K dataset, you'll need to first download the full ADE20K dataset. You can do so by running the following:

cd datasets
wget https://groups.csail.mit.edu/vision/datasets/ADE20K/ADE20K_2016_07_26.zip
unzip -qq ADE20K_2016_07_26.zip

If you want to skip Chapter 2 altogether and just move onto training and post-processing, you can download the generated datasets by running the following:

cd datasets
wget https://storage.googleapis.com/kdd2020hdvisai/static/datasets/bedroom_scenes_coco_final.zip
wget https://storage.googleapis.com/kdd2020hdvisai/static/datasets/bedroom_scenes_voc.zip

unzip -qq bedroom_scenes_coco_final.zip
unzip -qq bedroom_scenes_voc.zip

NOTE: You do not need to run through this step if you're using the Google Colab version of the tutorial, as that version will download the dataset for you as part of the notebook initialization.


wacv2021's People

Contributors

mcdy143 avatar mosdragon avatar ayadiala avatar

Stargazers

 avatar Alexey Sidnev avatar zhiwen avatar  avatar Apoorv Khattar avatar Marcus Valtonen Örnhag avatar Fangwen Shu avatar

Watchers

James Cloos avatar  avatar  avatar  avatar

Forkers

nikky4d

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.