Coder Social home page Coder Social logo

gmalivenko / awesome-computer-vision-models Goto Github PK

View Code? Open in Web Editor NEW
504.0 26.0 92.0 105 KB

A list of popular deep learning models related to classification, segmentation and detection problems

License: Creative Commons Attribution 4.0 International

awesome-list machine-learning machine-learning-algorithms machine-learning-models computer-vision computer-vision-algorithms semantic-segmentation image-classification object-detection deep-learning

awesome-computer-vision-models's Introduction

Awesome Computer Vision Models Awesome

A curated list of popular classification, segmentation and detection models with corresponding evaluation metrics from papers.

Contents

Classification models

Model Number of parameters FLOPS Top-1 Error Top-5 Error Year
AlexNet ('One weird trick for parallelizing convolutional neural networks') 62.3M 1,132.33M 40.96 18.24 2014
VGG-16 ('Very Deep Convolutional Networks for Large-Scale Image Recognition') 138.3M ? 26.78 8.69 2014
ResNet-10 ('Deep Residual Learning for Image Recognition') 5.5M 894.04M 34.69 14.36 2015
ResNet-18 ('Deep Residual Learning for Image Recognition') 11.7M 1,820.41M 28.53 9.82 2015
ResNet-34 ('Deep Residual Learning for Image Recognition') 21.8M 3,672.68M 24.84 7.80 2015
ResNet-50 ('Deep Residual Learning for Image Recognition') 25.5M 3,877.95M 22.28 6.33 2015
InceptionV3 ('Rethinking the Inception Architecture for Computer Vision') 23.8M ? 21.2 5.6 2015
PreResNet-18 ('Identity Mappings in Deep Residual Networks') 11.7M 1,820.56M 28.43 9.72 2016
PreResNet-34 ('Identity Mappings in Deep Residual Networks') 21.8M 3,672.83M 24.89 7.74 2016
PreResNet-50 ('Identity Mappings in Deep Residual Networks') 25.6M 3,875.44M 22.40 6.47 2016
DenseNet-121 ('Densely Connected Convolutional Networks') 8.0M 2,872.13M 23.48 7.04 2016
DenseNet-161 ('Densely Connected Convolutional Networks') 28.7M 7,793.16M 22.86 6.44 2016
PyramidNet-101 ('Deep Pyramidal Residual Networks') 42.5M 8,743.54M 21.98 6.20 2016
ResNeXt-14(32x4d) ('Aggregated Residual Transformations for Deep Neural Networks') 9.5M 1,603.46M 30.32 11.46 2016
ResNeXt-26(32x4d) ('Aggregated Residual Transformations for Deep Neural Networks') 15.4M 2,488.07M 24.14 7.46 2016
WRN-50-2 ('Wide Residual Networks') 68.9M 11,405.42M 22.53 6.41 2016
Xception ('Xception: Deep Learning with Depthwise Separable Convolutions') 22,855,952 8,403.63M 20.97 5.49 2016
InceptionV4 ('Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning') 42,679,816 12,304.93M 20.64 5.29 2016
InceptionResNetV2 ('Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning') 55,843,464 13,188.64M 19.93 4.90 2016
PolyNet ('PolyNet: A Pursuit of Structural Diversity in Very Deep Networks') 95,366,600 34,821.34M 19.10 4.52 2016
DarkNet Ref ('Darknet: Open source neural networks in C') 7,319,416 367.59M 38.58 17.18 2016
DarkNet Tiny ('Darknet: Open source neural networks in C') 1,042,104 500.85M 40.74 17.84 2016
DarkNet 53 ('Darknet: Open source neural networks in C') 41,609,928 7,133.86M 21.75 5.64 2016
SqueezeResNet1.1 ('SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size') 1,235,496 352.02M 40.09 18.21 2016
SqueezeNet1.1 ('SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size') 1,235,496 352.02M 39.31 17.72 2016
ResAttNet-92 ('Residual Attention Network for Image Classification') 51.3M ? 19.5 4.8 2017
CondenseNet (G=C=8) ('CondenseNet: An Efficient DenseNet using Learned Group Convolutions') 4.8M ? 26.2 8.3 2017
DPN-68 ('Dual Path Networks') 12,611,602 2,351.84M 23.24 6.79 2017
ShuffleNet x1.0 (g=1) ('ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices') 1,531,936 148.13M 34.93 13.89 2017
DiracNetV2-18 ('DiracNets: Training Very Deep Neural Networks Without Skip-Connections') 11,511,784 1,796.62M 31.47 11.70 2017
DiracNetV2-34 ('DiracNets: Training Very Deep Neural Networks Without Skip-Connections') 21,616,232 3,646.93M 28.75 9.93 2017
SENet-16 ('Squeeze-and-Excitation Networks') 31,366,168 5,081.30M 25.65 8.20 2017
SENet-154 ('Squeeze-and-Excitation Networks') 115,088,984 20,745.78M 18.62 4.61 2017
MobileNet ('MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications') 4,231,976 579.80M 26.61 8.95 2017
NASNet-A 4@1056 ('Learning Transferable Architectures for Scalable Image Recognition') 5,289,978 584.90M 25.68 8.16 2017
NASNet-A 6@4032('Learning Transferable Architectures for Scalable Image Recognition') 88,753,150 23,976.44M 18.14 4.21 2017
DLA-34 ('Deep Layer Aggregation') 15,742,104 3,071.37M 25.36 7.94 2017
AirNet50-1x64d (r=2) ('Attention Inspiring Receptive-Fields Network for Learning Invariant Representations') 27.43M ? 22.48 6.21 2018
BAM-ResNet-50 ('BAM: Bottleneck Attention Module') 25.92M ? 23.68 6.96 2018
CBAM-ResNet-50 ('CBAM: Convolutional Block Attention Module') 28.1M ? 23.02 6.38 2018
1.0-SqNxt-23v5 ('SqueezeNext: Hardware-Aware Neural Network Design') 921,816 285.82M 40.77 17.85 2018
1.5-SqNxt-23v5 ('SqueezeNext: Hardware-Aware Neural Network Design') 1,953,616 550.97M 33.81 13.01 2018
2.0-SqNxt-23v5 ('SqueezeNext: Hardware-Aware Neural Network Design') 3,366,344 897.60M 29.63 10.66 2018
ShuffleNetV2 ('ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design') 2,278,604 149.72M 31.44 11.63 2018
456-MENet-24ร—1(g=3) ('Merging and Evolution: Improving Convolutional Neural Networks for Mobile Applications') 5.3M ? 28.4 9.8 2018
FD-MobileNet ('FD-MobileNet: Improved MobileNet with A Fast Downsampling Strategy') 2,901,288 147.46M 34.23 13.38 2018
MobileNetV2 ('MobileNetV2: Inverted Residuals and Linear Bottlenecks') 3,504,960 329.36M 26.97 8.87 2018
IGCV3 ('IGCV3: Interleaved Low-Rank Group Convolutions for Efficient Deep Neural Networks') 3.5M ? 28.22 9.54 2018
DARTS ('DARTS: Differentiable Architecture Search') 4.9M ? 26.9 9.0 2018
PNASNet-5 ('Progressive Neural Architecture Search') 5.1M ? 25.8 8.1 2018
AmoebaNet-C ('Regularized Evolution for Image Classifier Architecture Search') 5.1M ? 24.3 7.6 2018
MnasNet ('MnasNet: Platform-Aware Neural Architecture Search for Mobile') 4,308,816 317.67M 31.58 11.74 2018
IBN-Net50-a ('Two at Once: Enhancing Learning andGeneralization Capacities via IBN-Net') ? ? 22.54 6.32 2018
MarginNet ('Large Margin Deep Networks for Classification') ? ? 22.0 ? 2018
A^2 Net ('A^2-Nets: Double Attention Networks') ? ? 23.0 6.5 2018
FishNeXt-150 ('FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction') 26.2M ? 21.5 ? 2018
Shape-ResNet ('IMAGENET-TRAINED CNNS ARE BIASED TOWARDS TEXTURE; INCREASING SHAPE BIAS IMPROVES ACCURACY AND ROBUSTNESS') 25.5M ? 23.28 6.72 2019
SimCNN(k=3 train) ('Greedy Layerwise Learning Can Scale to ImageNet') ? ? 28.4 10.2 2019
SKNet-50 ('Selective Kernel Networks') 27.5M ? 20.79 ? 2019
SRM-ResNet-50 ('SRM : A Style-based Recalibration Module for Convolutional Neural Networks') 25.62M ? 22.87 6.49 2019
EfficientNet-B0 ('EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks') 5,288,548 414.31M 24.77 7.52 2019
EfficientNet-B7b ('EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks') 66,347,960 39,010.98M 15.94 3.22 2019
ProxylessNAS ('PROXYLESSNAS: DIRECT NEURAL ARCHITECTURE SEARCH ON TARGET TASK AND HARDWARE') ? ? 24.9 7.5 2019
MixNet-L ('MixNet: Mixed Depthwise Convolutional Kernels') 7.3M ? 21.1 5.8 2019
ECA-Net50 ('ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks') 24.37M 3.86G 22.52 6.32 2019
ECA-Net101 ('ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks') 7.3M 7.35G 21.35 5.66 2019
ACNet-Densenet121 ('ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks') ? ? 24.18 7.23 2019
LIP-ResNet-50 ('LIP: Local Importance-based Pooling') 23.9M 5.33G 21.81 6.04 2019
LIP-ResNet-101 ('LIP: Local Importance-based Pooling') 42.9M 9.06G 20.67 5.40 2019
LIP-DenseNet-BC-121 ('LIP: Local Importance-based Pooling') 8.7M 4.13G 23.36 6.84 2019
MuffNet_1.0 ('MuffNet: Multi-Layer Feature Federation for Mobile Deep Learning') 2.3M 146M 30.1 ? 2019
MuffNet_1.5 ('MuffNet: Multi-Layer Feature Federation for Mobile Deep Learning') 3.4M 300M 26.9 ? 2019
ResNet-34-Bin-5 ('Making Convolutional Networks Shift-Invariant Again') 21.8M 3,672.68M 25.80 ? 2019
ResNet-50-Bin-5 ('Making Convolutional Networks Shift-Invariant Again') 25.5M 3,877.95M 22.96 ? 2019
MobileNetV2-Bin-5 ('Making Convolutional Networks Shift-Invariant Again') 3,504,960 329.36M 27.50 ? 2019
FixRes ResNeXt101 WSL ('Fixing the train-test resolution discrepancy') 829M ? 13.6 2.0 2019
Noisy Student*(L2) ('Self-training with Noisy Student improves ImageNet classification') 480M ? 12.6 1.8 2019
TResNet-M ('TResNet: High Performance GPU-Dedicated Architecture') 29.4M 5.5G 19.3 ? 2020
DA-NAS-C ('DA-NAS: Data Adapted Pruning for Efficient Neural Architecture Search') ? 467M 23.8 ? 2020
ResNeSt-50 ('ResNeSt: Split-Attention Networks') 27.5M 5.39G 18.87 ? 2020
ResNeSt-101 ('ResNeSt: Split-Attention Networks') 48.3M 10.2G 17.73 ? 2020
ResNet-50-FReLU ('Funnel Activation for Visual Recognition') 25.5M 3.87G 22.40 ? 2020
ResNet-101-FReLU ('Funnel Activation for Visual Recognition') 44.5M 7.6G 22.10 ? 2020
ResNet-50-MEALv2 ('MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks') 25.6M ? 19.33 4.91 2020
ResNet-50-MEALv2 + CutMix ('MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks') 25.6M ? 19.02 4.65 2020
MobileNet V3-Large-MEALv2 ('MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks') 5.48M ? 23.08 6.68 2020
EfficientNet-B0-MEALv2 ('MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks') 5.29M ? 21.71 6.05 2020
T2T-ViT-7 ('Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet') 4.2M 0.6G 28.8 ? 2021
T2T-ViT-14 ('Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet') 19.4M 4.8G 19.4 ? 2021
T2T-ViT-19 ('Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet') 39.0M 8.0G 18.8 ? 2021
NFNet-F0 ('High-Performance Large-Scale Image Recognition Without Normalization') 71.5M 12.38G 16.4 3.2 2021
NFNet-F1 ('High-Performance Large-Scale Image Recognition Without Normalization') 132.6M 35.54G 15.4 2.9 2021
NFNet-F6+SAM ('High-Performance Large-Scale Image Recognition Without Normalization') 438.4M 377.28G 13.5 2.1 2021
EfficientNetV2-S ('EfficientNetV2: Smaller Models and Faster Training') 24M 8.8G 16.1 ? 2021
EfficientNetV2-M ('EfficientNetV2: Smaller Models and Faster Training') 55M 24G 14.9 ? 2021
EfficientNetV2-L ('EfficientNetV2: Smaller Models and Faster Training') 121M 53G 14.3 ? 2021
EfficientNetV2-S (21k) ('EfficientNetV2: Smaller Models and Faster Training') 24M 8.8G 15.0 ? 2021
EfficientNetV2-M (21k) ('EfficientNetV2: Smaller Models and Faster Training') 55M 24G 13.9 ? 2021
EfficientNetV2-L (21k) ('EfficientNetV2: Smaller Models and Faster Training') 121M 53G 13.2 ? 2021

Segmentation models

Model Year PASCAL-Context Cityscapes (mIOU) PASCAL VOC 2012 (mIOU) COCO Stuff ADE20K VAL (mIOU)
U-Net ('U-Net: Convolutional Networks for Biomedical Image Segmentation') 2015 ? ? ? ? ?
DeconvNet ('Learning Deconvolution Network for Semantic Segmentation') 2015 ? ? 72.5 ? ?
ParseNet ('ParseNet: Looking Wider to See Better') 2015 40.4 ? 69.8 ? ?
Piecewise ('Efficient piecewise training of deep structured models for semantic segmentation') 2015 43.3 71.6 78.0 ? ?
SegNet ('SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation') 2016 ? 56.1 ? ? ?
FCN ('Fully Convolutional Networks for Semantic Segmentation') 2016 37.8 65.3 62.2 22.7 29.39
ENet ('ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation') 2016 ? 58.3 ? ? ?
DilatedNet ('MULTI-SCALE CONTEXT AGGREGATION BY DILATED CONVOLUTIONS') 2016 ? ? 67.6 ? 32.31
PixelNet ('PixelNet: Towards a General Pixel-Level Architecture') 2016 ? ? 69.8 ? ?
RefineNet ('RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation') 2016 47.3 73.6 83.4 33.6 40.70
LRR ('Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation') 2016 ? 71.8 79.3 ? ?
FRRN ('Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes') 2016 ? 71.8 ? ? ?
MultiNet ('MultiNet: Real-time Joint Semantic Reasoning for Autonomous Driving') 2016 ? ? ? ? ?
DeepLab ('DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs') 2017 45.7 64.8 79.7 ? ?
LinkNet ('LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation') 2017 ? ? ? ? ?
Tiramisu ('The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation') 2017 ? ? ? ? ?
ICNet ('ICNet for Real-Time Semantic Segmentation on High-Resolution Images') 2017 ? 70.6 ? ? ?
ERFNet ('Efficient ConvNet for Real-time Semantic Segmentation') 2017 ? 68.0 ? ? ?
PSPNet ('Pyramid Scene Parsing Network') 2017 47.8 80.2 85.4 ? 44.94
GCN ('Large Kernel Matters โ€” Improve Semantic Segmentation by Global Convolutional Network') 2017 ? 76.9 82.2 ? ?
Segaware ('Segmentation-Aware Convolutional Networks Using Local Attention Masks') 2017 ? ? 69.0 ? ?
PixelDCN ('PIXEL DECONVOLUTIONAL NETWORKS') 2017 ? ? 73.0 ? ?
DeepLabv3 ('Rethinking Atrous Convolution for Semantic Image Segmentation') 2017 ? ? 85.7 ? ?
DUC, HDC ('Understanding Convolution for Semantic Segmentation') 2018 ? 77.1 ? ? ?
ShuffleSeg ('SHUFFLESEG: REAL-TIME SEMANTIC SEGMENTATION NETWORK') 2018 ? 59.3 ? ? ?
AdaptSegNet ('Learning to Adapt Structured Output Space for Semantic Segmentation') 2018 ? 46.7 ? ? ?
TuSimple-DUC ('Understanding Convolution for Semantic Segmentation') 2018 80.1 ? 83.1 ? ?
R2U-Net ('Recurrent Residual Convolutional Neural Network based on U-Net (R2U-Net) for Medical Image Segmentation') 2018 ? ? ? ? ?
Attention U-Net ('Attention U-Net: Learning Where to Look for the Pancreas') 2018 ? ? ? ? ?
DANet ('Dual Attention Network for Scene Segmentation') 2018 52.6 81.5 ? 39.7 ?
ENCNet ('Context Encoding for Semantic Segmentation') 2018 51.7 75.8 85.9 ? 44.65
ShelfNet ('ShelfNet for Real-time Semantic Segmentation') 2018 48.4 75.8 84.2 ? ?
LadderNet ('LADDERNET: MULTI-PATH NETWORKS BASED ON U-NET FOR MEDICAL IMAGE SEGMENTATION') 2018 ? ? ? ? ?
CCC-ERFnet ('Concentrated-Comprehensive Convolutions for lightweight semantic segmentation') 2018 ? 69.01 ? ? ?
DifNet-101 ('DifNet: Semantic Segmentation by Diffusion Networks') 2018 45.1 ? 73.2 ? ?
BiSeNet(Res18) ('BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation') 2018 ? ? 74.7 28.1 ?
ESPNet ('ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation') 2018 ? ? 63.01 ? ?
SPADE ('Semantic Image Synthesis with Spatially-Adaptive Normalization') 2019 ? 62.3 ? 37.4 38.5
SeamlessSeg ('Seamless Scene Segmentation') 2019 ? 77.5 ? ? ?
EMANet ('Expectation-Maximization Attention Networks for Semantic Segmentation') 2019 ? ? 88.2 39.9 ?

Detection models

Model Year VOC07 (mAP@IoU=0.5) VOC12 (mAP@IoU=0.5) COCO (mAP)
R-CNN ('Rich feature hierarchies for accurate object detection and semantic segmentation') 2014 58.5 ? ?
OverFeat ('OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks') 2014 ? ? ?
MultiBox ('Scalable Object Detection using Deep Neural Networks') 2014 29.0 ? ?
SPP-Net ('Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition') 2014 59.2 ? ?
MR-CNN ('Object detection via a multi-region & semantic segmentation-aware CNN model') 2015 78.2 73.9 ?
AttentionNet ('AttentionNet: Aggregating Weak Directions for Accurate Object Detection') 2015 ? ? ?
Fast R-CNN ('Fast R-CNN') 2015 70.0 68.4 ?
Fast R-CNN ('Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks') 2015 73.2 70.4 36.8
YOLO v1 ('You Only Look Once: Unified, Real-Time Object Detection') 2016 66.4 57.9 ?
G-CNN ('G-CNN: an Iterative Grid Based Object Detector') 2016 66.8 66.4 ?
AZNet ('Adaptive Object Detection Using Adjacency and Zoom Prediction') 2016 70.4 ? 22.3
ION ('Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks') 2016 80.1 77.9 33.1
HyperNet ('HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection') 2016 76.3 71.4 ?
OHEM ('Training Region-based Object Detectors with Online Hard Example Mining') 2016 78.9 76.3 22.4
MPN ('A MultiPath Network for Object Detection') 2016 ? ? 33.2
SSD ('SSD: Single Shot MultiBox Detector') 2016 76.8 74.9 31.2
GBDNet ('Crafting GBD-Net for Object Detection') 2016 77.2 ? 27.0
CPF ('Contextual Priming and Feedback for Faster R-CNN') 2016 76.4 72.6 ?
MS-CNN ('A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection') 2016 ? ? ?
R-FCN ('R-FCN: Object Detection via Region-based Fully Convolutional Networks') 2016 79.5 77.6 29.9
PVANET ('PVANET: Deep but Lightweight Neural Networks for Real-time Object Detection') 2016 ? ? ?
DeepID-Net ('DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection') 2016 69.0 ? ?
NoC ('Object Detection Networks on Convolutional Feature Maps') 2016 71.6 68.8 27.2
DSSD ('DSSD : Deconvolutional Single Shot Detector') 2017 81.5 80.0 ?
TDM ('Beyond Skip Connections: Top-Down Modulation for Object Detection') 2017 ? ? 37.3
FPN ('Feature Pyramid Networks for Object Detection') 2017 ? ? 36.2
YOLO v2 ('YOLO9000: Better, Faster, Stronger') 2017 78.6 73.4 21.6
RON ('RON: Reverse Connection with Objectness Prior Networks for Object Detection') 2017 77.6 75.4 ?
DCN ('Deformable Convolutional Networks') 2017 ? ? ?
DeNet ('DeNet: Scalable Real-time Object Detection with Directed Sparse Sampling') 2017 77.1 73.9 33.8
CoupleNet ('CoupleNet: Coupling Global Structure with Local Parts for Object Detection') 2017 82.7 80.4 34.4
RetinaNet ('Focal Loss for Dense Object Detection') 2017 ? ? 39.1
Mask R-CNN ('Mask R-CNN') 2017 ? ? 39.8
DSOD ('DSOD: Learning Deeply Supervised Object Detectors from Scratch') 2017 77.7 76.3 ?
SMN ('Spatial Memory for Context Reasoning in Object Detection') 2017 70.0 ? ?
YOLO v3 ('YOLOv3: An Incremental Improvement') 2018 ? ? 33.0
SIN ('Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships') 2018 76.0 73.1 23.2
STDN ('Scale-Transferrable Object Detection') 2018 80.9 ? ?
RefineDet ('Single-Shot Refinement Neural Network for Object Detection') 2018 83.8 83.5 41.8
MegDet ('MegDet: A Large Mini-Batch Object Detector') 2018 ? ? ?
RFBNet ('Receptive Field Block Net for Accurate and Fast Object Detection') 2018 82.2 ? ?
CornerNet ('CornerNet: Detecting Objects as Paired Keypoints') 2018 ? ? 42.1
LibraRetinaNet ('Libra R-CNN: Towards Balanced Learning for Object Detection') 2019 ? ? 43.0
YOLACT-700 ('YOLACT Real-time Instance Segmentation') 2019 ? ? 31.2
DetNASNet(3.8) ('DetNAS: Backbone Search for Object Detection') 2019 ? ? 42.0
YOLOv4 ('YOLOv4: Optimal Speed and Accuracy of Object Detection') 2020 ? ? 46.7
SOLO ('SOLO: Segmenting Objects by Locations') 2020 ? ? 37.8
D-SOLO ('SOLO: Segmenting Objects by Locations') 2020 ? ? 40.5
SNIPER ('Scale Normalized Image Pyramids with AutoFocus for Object Detection') 2021 86.6 ? 47.9
AutoFocus ('Scale Normalized Image Pyramids with AutoFocus for Object Detection') 2021 85.8 ? 47.9

awesome-computer-vision-models's People

Contributors

gmalivenko avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.