Coder Social home page Coder Social logo

objectdetectionimbalance's Introduction

A Repository of the Papers Addressing Imbalance Problems in Object Detection

This repository provides an up-to-date the list of studies addressing imbalance problems in object detection. It follows the taxonomy provided in the following paper (please cite the paper if you benefit from this repository):

K. Oksuz, B. C. Cam, S. Kalkan, E. Akbas, "Imbalance Problems in Object Detection: A Review", Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020.[preprint]

BibTeX entry:

@ARTICLE{imbalance,
       author = {Kemal Oksuz and Baris Can Cam and Sinan Kalkan and Emre Akbas},
        title = "{Imbalance Problems in Object Detection: A Review}",
      journal = {Transactions on Pattern Analysis and Machine Intelligence (TPAMI)},
         year = "2020",
        pages = {1-1}
        }

How to request addition of a paper

If you know of a paper that addresses an imbalance problem concerning generic object detection and is not on this repository, you are welcome to request the addition of that paper by submitting a pull request. In your pull request please briefly state which section of your paper is related to which problem.

Following the methodology in our paper, the papers should be designed for the generic object detection problem (i.e. reporting results on generic object detection datasets such as ILSVRC, Pascal VOC, MS-COCO, Open Images, Objects 365 etc.).

Table of Contents (Follows the taxonomy in the paper)

  1. Class Imbalance
    1.1 Foreground-Background Class Imbalance
    1.2 Foreground-Foreground Class Imbalance
  2. Scale Imbalance
    2.1 Object/box-level Scale Imbalance
    2.2 Feature-level Imbalance
  3. Spatial Imbalance
    3.1 Imbalance in Regression Loss
    3.2 IoU Distribution Imbalance
    3.3 Object Location Imbalance
  4. Objective Imbalance

1. Class Imbalance

1.1. Foreground-Background Class Imbalance

  • Hard Sampling Methods
    • Random Sampling
    • Hard Example Mining
      • Bootstrapping, NeurIPS 1996, [paper]
      • SSD, ECCV 2016, [paper]
      • Online Hard Example Mining, CVPR 2016, [paper]
      • IoU-based Sampling, CVPR 2019, [paper]
      • Overlap Sampler, WACV 2020, [paper]
    • Limit Search Space
      • Two-stage Object Detectors
      • IoU-lower Bound, ICCV 2015, [paper]
      • Objectness Prior, CVPR 2017, [paper]
      • Negative Anchor Filtering, CVPR 2018, [paper]
      • Objectness Module, ICCV 2019, [paper]
  • Soft Sampling Methods
    • Focal Loss, ICCV 2017, [paper]
    • Gradient Harmonizing Mechanism, AAAI 2019, [paper]
    • Prime Sample Attention, CVPR 2020, [paper]
    • Unified Sample Weighting Network, CVPR 2020, [paper]
  • Sampling-Free Methods
    • Is Sampling Heuristics Necessary in Training Deep Object Detectors?, arXiv 2019, [paper]
    • Residual Objectness for Imbalance Reduction, arXiv 2019, [paper]
    • AP Loss, CVPR 2019, [paper]
    • DR Loss, CVPR 2020, [paper]
  • Generative Methods
    • Adversarial Faster-RCNN, CVPR 2017, [paper]
    • Task Aware Data Synthesis, CVPR 2019, [paper]
    • PSIS, arXiv 2019, [paper]
    • pRoI Generator, WACV 2020, [paper]

1.2. Foreground-Foreground Class Imbalance

  • Fine-tuning Long Tail Distribution for Obj.Det., CVPR 2016, [paper]
  • PSIS, arXiv 2019, [paper]
  • OFB Sampling, WACV 2020, [paper]
  • Large-Scale Object Detection in the Wild from Imbalanced Multi-Labels, CVPR 2020, [paper]
  • Balanced Group Softmax Loss, CVPR 2020, [paper]

2. Scale Imbalance

2.1. Object/box-level Scale Imbalance

  • Methods Predicting from the Feature Hierarchy of Backbone Features

    • Scale-dependent Pooling, CVPR 2016, [paper]
    • SSD, ECCV 2016, [paper]
    • Multi Scale CNN, ECCV 2016, [paper]
    • Scale Aware Fast R-CNN, IEEE Transactions on Multimedia, 2018 [paper]
  • Methods Based on Feature Pyramids

    • FPN, CVPR 2017, [paper]
    • See feature-level imbalance methods
  • Methods Based on Image Pyramids

  • Methods Combining Image and Feature Pyramids

    • Efficient Featurized Image Pyramids, CVPR 2019, [paper]
    • Enriched Feature Guided Refinement Network, ICCV 2019, [paper]
    • Super-Resolution for Small Objects, ICCV 2019, [paper]
    • Scale Aware Trident Network, ICCV 2019, [paper]

2.2. Feature-level Imbalance

  • Methods Using Pyramidal Features as a Basis

  • Methods Using Backbone Features as a Basis

3. Spatial Imbalance

3.1. Imbalance in Regression Loss

  • Lp norm based

    • Smooth L1, ICCV 2015, [paper]
    • Balanced L1, CVPR 2019, [paper]
    • KL Loss, CVPR 2019, [paper]
    • Gradient Harmonizing Mechanism, AAAI 2019, [paper]
  • IoU based

    • IoU Loss, ACM IMM 2016, [paper]
    • Bounded IoU Loss, CVPR 2018, [paper]
    • Generalized IoU Loss, CVPR 2019, [paper]
    • Distance IoU Loss, AAAI 2020, [paper]
    • Complete IoU Loss, AAAI 2020, [paper]
  • Other Approaches

    • Offset Bin Classification Network, CVPR 2020, [paper]

3.2. IoU Distribution Imbalance

  • Cascade R-CNN, CVPR 2018, [paper]
  • Hierarchical Shot Detector, ICCV 2019, [paper]
  • IoU-uniform R-CNN, arXiv 2019, [paper]
  • pRoI Generator, WACV 2020, [paper]

3.3. Object Location Imbalance

  • Guided Anchoring, CVPR 2019, [paper]
  • FreeAnchor, NeurIPS 2019, [paper]

4. Objective Imbalance

  • Task Weighting
  • Classification Aware Regression Loss, CVPR 2020, [paper]
  • Guided Loss, arXiv 2019, [paper]

Contact

Please contact Kemal Öksüz ([email protected]) for your questions about this webpage.

objectdetectionimbalance's People

Contributors

cancam avatar eakbas avatar sinankalkan avatar chenjoya avatar pedromlsreis avatar

Watchers

James Cloos avatar  avatar paper2code - bot avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.