本贴是对 CVPR2021 已接受论文的粗略汇总,后期会有更详细的总结。期待ing......
官网链接:http://cvpr2021.thecvf.com
开会时间:2021年6月19日-6月25日
论文接收公布时间:2021年2月28日
接收论文IDs:
🎆🎆🎆更新提示:3月4日新增 33 篇(2目标检测+3点云+1半监督+1医学+5分割+1域泛化+1人脸+1视图合成+16D位姿+2分类+1跟踪+1图像增强+1GAN+1GNN+1图像字幕+1三维+1相机定位+8未分)
- [Vab-AL: Incorporating Class Imbalance and Difficulty with Variational Bayes for Active Learning]
- Learning the Superpixel in a Non-iterative and Lifelong Manner
- Rainbow Memory: Continual Learning with a Memory of Diverse Samples
- Coarse-Fine Networks for Temporal Activity Detection in Videos
- 3D CNNs with Adaptive Temporal Feature Resolutions
- Improving Unsupervised Image Clustering With Robust Learning
⭐code
利用鲁棒学习改进无监督图像聚类技术
- PML: Progressive Margin Loss for Long-tailed Age Classification
- Re-labeling ImageNet: from Single to Multi-Labels, from Global to Localized Labels
⭐code - Fine-grained Angular Contrastive Learning with Coarse Labels
😮oral
使用自监督进行 Coarse Labels(粗标签)的细粒度分类方面的工作。粗标签与细粒度标签相比,更容易和更便宜,因为细粒度标签通常需要域专家。
- Counterfactual Zero-Shot and Open-Set Visual Recognition
⭐code - Few-shot Open-set Recognition by Transformation Consistency
- PCLs: Geometry-aware Neural Reconstruction of 3D Pose with Perspective Crop Layers
📺video
通过消除 location-dependent 透视效果来改进3D人体姿势估计技术工作。 - CanonPose: Self-supervised Monocular 3D Human Pose Estimation in the Wild
- Densely connected multidilated convolutional networks for dense prediction tasks
提出的D3Net在语义分割&音乐源分离任务上的表现优于SOTA网络
- A Deep Emulator for Secondary Motion of 3D Characters
- Neural Deformation Graphs for Globally-consistent Non-rigid Reconstruction
😮oral🏠project📺video
- Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition
⭐github
ECCV 2020 Facebook Mapillary Visual Place Recognition Challenge 冠军方案
- 3D Graph Anatomy Geometry-Integrated Network for Pancreatic Mass Segmentation, Diagnosis, and Quantitative Patient Management
用纯多模态 CT 影像可替代目前 JHMI 的需要做肿瘤化学检测和 DNA 测序+医学影像的综合多模态诊断流程,从诊断准确度上有可比较性,定量诊断精度更优 - Deep Lesion Tracker: Monitoring Lesions in 4D Longitudinal Imaging Studies
肿瘤影像里面智能 PACS 辅助医生读片的重要功能 - Automatic Vertebra Localization and Identification in CT by Spine Rectification and Anatomically-constrained Optimization
基于CT 影像的骨折/骨质疏松系统 - Multi-institutional Collaborations for Improving Deep Learning-based Magnetic Resonance Image Reconstruction Using Federated Learning
⭐code
多机构合作,利用联合学习改进基于深度学习的磁共振图像重建技术
- Transformer Interpretability Beyond Attention Visualization
⭐code - UP-DETR: Unsupervised Pre-training for Object Detection with Transformers
- Pre-Trained Image Processing Transformer
- 3D Vision Transformers for Action Recognition
用于动作识别的3D视觉Transformer - MIST: Multiple Instance Spatial Transformer Network
试图从热图中进行可微的top-K选择(MIST)(目前在自然图像上也有了一些结果;) 用它可以在没有任何定位监督的情况下进行检测和分类(并不是它唯一能做的事情!)
- Learning Student Networks in the Wild
⭐code - Rethinking Channel Dimensions for Efficient Model Design
⭐code - Manifold Regularized Dynamic Network Pruning
- RepVGG: Making VGG-style ConvNets Great Again
⭐code
- Dogfight: Detecting Drones from Drone Videos
- PointFlow: Flowing Semantics Through Points for Aerial Image Segmentation
- Data-Free Knowledge Distillation For Image Super-Resolution
- AdderSR: Towards Energy Efficient Image Super-Resolution
⭐code
- Weakly-supervised Grounded Visual Question Answering using Capsules
- Exploiting Spatial Dimensions of Latent in GAN for Real-time Image Editing
- Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation
⭐code🏠project - Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs
- Image-to-image Translation via Hierarchical Style Disentanglement
⭐code - Efficient Conditional GAN Transfer with Knowledge Propagation across Classes
⭐code
- Exploring Complementary Strengths of Invariant and Equivariant Representations for Few-Shot Learning
- Exploring Complementary Strengths of Invariant and Equivariant Representations for Few-Shot Learning
- FSDR: Frequency Space Domain Randomization for Domain Generalization
受 JPEG 将空间图像转换为多个频率分量(FCs)的启发,提出频率空间域随机化(FSDR),通过保留域变量FCs(DIFs)和只随机化域变量FCs(DVFs)来随机化频率空间的图像。 - Domain Generalization via Inference-time Label-Preserving Target Projections
😮 Oral
- Multi-Stage Progressive Image Restoration
⭐code - Auto-Exposure Fusion for Single-Image Shadow Removal
⭐code - DeFMO: Deblurring and Shape Recovery of Fast Moving Objects
⭐code📺video
- A 3D GAN for Improved Large-pose Facial Recognition
- When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework
⭐github - Multi-attentional Deepfake Detection
- AttentiveNAS: Improving Neural Architecture Search via Attentive
- HourNAS: Extremely Fast Neural Architecture Search Through an Hourglass Lens
- ReNAS: Relativistic Evaluation of Neural Architecture Search
- Probabilistic Tracklet Scoring and Inpainting for Multiple Object Tracking
- Rotation Equivariant Siamese Networks for Tracking
- Track to Detect and Segment: An Online Multi-Object Tracker
🏠project📺video
- 4D Panoptic LiDAR Segmentation
- PLOP: Learning without Forgetting for Continual Semantic Segmentation
- Cross-View Regularization for Domain Adaptive Panoptic Segmentation
😮oral
用于域自适应全景分割的跨视图正则化方法 - Information-Theoretic Segmentation by Inpainting Error Maximization
- Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges
⭐dataset📺video - Exploring Data Efficient 3D Scene Understanding with Contrastive Scene Contexts
😮oral🏠project📺video - Real-Time High Resolution Background Matting
😮oral⭐code🏠project📺video
最新开源抠图技术,实时快速高分辨率,4k(30fps)、现代GPU(60fps)
解读:单块GPU实现4K分辨率每秒30帧,华盛顿大学实时视频抠图再升级,毛发细节到位
最新开源抠图技术,实时快速高分辨率,4k(30fps)、现代GPU(60fps) - Part-aware Panoptic Segmentation
- Multiple Instance Active Learning for Object Detection
⭐code - Positive-Unlabeled Data Purification in the Wild for Object Detection
- Depth from Camera Motion and Object Detection
⭐github📺video - There is More than Meets the Eye: Self-Supervised Multi-Object Detection and Tracking with Sound by Distilling Multimodal Knowledge
🏠project - Categorical Depth Distribution Network for Monocular 3D Object Detection
- Semantic Relation Reasoning for Shot-Stable Few-Shot Object Detection
首个研究少样本检测任务的语义关系推理,并证明它可提升强基线的潜力。 - Towards Open World Object Detection
😮oral⭐code - General Instance Distillation for Object Detection
- Distilling Object Detectors via Decoupled Features
- 3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object Detection
😮oral⭐code🏠project📺video
更多:CVPR 2021|利用IoU预测进行半监督式3D目标检测
- Weakly Supervised Learning of Rigid 3D Scene Flow
⭐code🏠project - Adaptive Consistency Regularization for Semi-Supervised Transfer Learning
⭐code
- PREDATOR: Registration of 3D Point Clouds with Low Overlap
😮oral⭐code🏠project - Diffusion Probabilistic Models for 3D Point Cloud Generation
⭐code - Style-based Point Generator with Adversarial Rendering for Point Cloud Completion
- SpinNet: Learning a General Surface Descriptor for 3D Point Cloud Registration
⭐code - MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization
😮oral⭐code
- Sequential Graph Convolutional Network for Active Learning
- Quantifying Explainers of Graph Neural Networks in Computational Pathology
- Inverting the Inherence of Convolution for Visual Recognition
- Representative Batch Normalization with Feature Calibration
- UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pretraining
- Reconsidering Representation Alignment for Multi-view Clustering
- Self-supervised Simultaneous Multi-Step Prediction of Road Dynamics and Cost Map
- Instance Localization for Self-supervised Detection Pretraining
⭐code - Model-Contrastive Federated Learning
提出模型对比学习来解决联合学习中的非IID数据问题 - Neural Geometric Level of Detail:Real-time Rendering with Implicit 3D Surfaces
😮Oral⭐code🏠project - Data-Free Model Extraction
⭐code - Single-Stage Instance Shadow Detection with Bidirectional Relation Learning
😮oral - Continual Adaptation of Visual Representations via Domain Randomization and Meta-learning
😮oral - PatchmatchNet: Learned Multi-View Patchmatch Stereo
😮oral⭐code - [Online Bag-of-Visual-Words Generation for Unsupervised Representation Learning]
- [Semantic Palette: Guiding Scene Generation with Class Proportions]
-
Visual Perception for Navigation in Human Environments
第二届人类环境导航视觉感知征稿⚠️ 4月15截止 -
UG 2 + Challenge
旨在通过应用图像恢复和增强算法提高分析性能,推动对 "difficult"图像的分析。参与者任务是开发新的算法,以改进对在问题条件下拍摄的图像分析。
👑10K美元奖金- 低能见度环境下的目标检测
- 雾霾条件下的(半)监督目标检测
- (半)低光条件下的人脸检测
- 黑暗视频中的动作识别
- 黑暗中进行完全监督动作识别
- 黑暗中进行半监督动作识别
- 低能见度环境下的目标检测
-
Continual Learning in Computer Vision 征稿中
旨在聚集学术界和工业界的研究人员和工程师,讨论持续学习的最新进展。- Best paper award: 500 USD + 500 USD worth of Huawei cloud credits (HUAWEI)
- Overall Challenge winner: 1,000 USD + 500 USD worth of Huawei cloud credits (HUAWEI)
- Supervised-Learning track winner: 500 USD (HUAWEI)
- Reinforcement-Learning track winner: 500 USD (ServiceNow)
-
Responsible Computer Vision
⚠️ 3月25日截止
本次研讨会将广泛讨论计算机视觉背景下负责任的人工智能的三个主要方面:公平性;可解释性和透明度;以及隐私。