Coder Social home page Coder Social logo

replicate-cct's Introduction

Semi-Supervised Semantic Segmentation with Cross-Consistency Training (CCT)

https://github.com/yassouali/CCT

In this project, I try to improve the result of the above original paper by applying multiple methods like GAN, weakly supervised learning, and trying multiple backbones. You can learn more about the work in the poster.

Requirement

You can create a new conda environment first. Then inside that environment: First, you need to install the compatible pytorch and torchvision version following the offical instruction at https://pytorch.org/ Then, you need to install modules in the requirements.txt file by running: pip install -r requirements.txt

Dataset preparing

The PASCAL VOC dataset can be downloaded at http://host.robots.ox.ac.uk/pascal/VOC/voc2012/VOCtrainval_11-May-2012.tar You create a folder name VOCtrainval_11-May-2012 in the CCT folder, and extract the downloaded .tar file in the folder VOCtrainval_11-May-2012

Then, you have to download data label at https://www.dropbox.com/s/oeu149j8qtbs1x0/SegmentationClassAug.zip?dl=0. Extract the downloaded zip file, then move them to the path VOCtrainval_11-May-2012/VOCdevkit/VOC2012

Reproduce our best improvement method

Make sure you activate the created environment.

In your terminal, navigate to CCT folder, then run the following command: python3 CCT_GAN_WSSL.py

There is also a notebook CCT_GAN_WSSL.ipynb. You can install jupyter notebook in the conda environment to run this notebook.

Other Encoder backbone

You may also try different backbones implemented for our project. First, uncomment every commented line in the forward function of file model.py Then comment/uncomment out the lines in some files to try poolformer-m36 and convnext-base-ink22 architectures to produce better results. Please follow the instructions in encoder.py comments (line 58-80) to implement the architecture change correctly.

You should see the best validation mIoU at 73.5 at epoch(4) or after the program finishes running

replicate-cct's People

Contributors

kietngt00 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.