amusi / iccv2023-papers-with-code Goto Github PK

View Code? Open in Web Editor NEW

2.5K 39.0 247.0 104 KB

ICCV 2023 论文和开源项目合集

iccv iccv2021 object-detection computer-vision artificial-intelligence semantic-segmentation transformer iccv2023

iccv2023-papers-with-code's Introduction

ICCV2023-Papers-with-Code

ICCV 2023 论文和开源项目合集(papers with code)！

2160 papers accepted！

ICCV 2023 收录论文IDs：https://t.co/A0mCH8gbOi

注1：欢迎各位大佬提交issue，分享ICCV 2023论文和开源项目！

注2：关于往年CV顶会论文以及其他优质CV论文和大盘点，详见： https://github.com/amusi/daily-paper-computer-vision

ICCV 2021

如果你想了解最新最优质的的CV论文、开源项目和学习资料，欢迎扫码加入【CVer学术交流群】！互相学习，一起进步~

【ICCV 2023 论文开源目录】

Backbone
CLIP
MAE
GAN
GNN
MLP
NAS
OCR
NeRF
DETR
Prompt
Diffusion Models(扩散模型)
Prompt
Avatars
ReID(重识别)
长尾分布(Long-Tail)
Vision Transformer
视觉和语言(Vision-Language)
自监督学习(Self-supervised Learning)
数据增强(Data Augmentation)
目标检测(Object Detection)
目标跟踪(Visual Tracking)
语义分割(Semantic Segmentation)
实例分割(Instance Segmentation)
全景分割(Panoptic Segmentation)
医学图像分类(Medical Image Classfication)
医学图像分割(Medical Image Segmentation)
视频目标分割(Video Object Segmentation)
视频实例分割(Video Instance Segmentation)
参考图像分割(Referring Image Segmentation)
图像抠图(Image Matting)
Low-level Vision
超分辨率(Super-Resolution)
去噪(Denoising)
去模糊(Deblur)
3D点云(3D Point Cloud)
3D目标检测(3D Object Detection)
3D语义分割(3D Semantic Segmentation)
3D目标跟踪(3D Object Tracking)
3D语义场景补全(3D Semantic Scene Completion)
3D配准(3D Registration)
3D人体姿态估计(3D Human Pose Estimation)
3D人体Mesh估计(3D Human Mesh Estimation)
医学图像(Medical Image)
图像生成(Image Generation)
视频生成(Video Generation)
图像编辑(Image Editing)
视频编辑(Video Editing)
视频理解(Video Understanding)
人体运动生成(Human Motion Generation)
低光照图像增强(Low-light Image Enhancement)
场景文本识别(Scene Text Recognition)
图像检索(Image Retrieval)
图像融合(Image Fusion)
轨迹预测(Trajectory Prediction)
人群计数(Crowd Counting)
Video Quality Assessment(视频质量评价)
其它(Others)

Avatars

Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control

Paper: https://arxiv.org/abs/2303.17606

Code: https://github.com/songrise/AvatarCraft

Backbone

Rethinking Mobile Block for Efficient Attention-based Models

Paper: https://arxiv.org/abs/2301.01146
Code: https://github.com/zhangzjn/EMO

CLIP

PromptStyler: Prompt-driven Style Generation for Source-free Domain Generalization

Paper: https://arxiv.org/abs/2307.15199
Code: https://PromptStyler.github.io/

CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation

Paper: https://arxiv.org/abs/2308.15226
Code: http://www.github.com/devaansh100/CLIPTrans

NeRF

IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis

Homepage: https://zju3dv.github.io/intrinsic_nerf/
Paper: https://arxiv.org/abs/2210.00647
Code: https://github.com/zju3dv/IntrinsicNeRF

Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control

Paper: https://arxiv.org/abs/2303.17606
Code: https://github.com/songrise/AvatarCraft

FlipNeRF: Flipped Reflection Rays for Few-shot Novel View Synthesis

Homepage: https://shawn615.github.io/flipnerf/
Code: https://github.com/shawn615/FlipNeRF
Paper: https://arxiv.org/abs/2306.17723

Tri-MipRF: Tri-Mip Representation for Efficient Anti-Aliasing Neural Radiance Fields

Homepage: https://wbhu.github.io/projects/Tri-MipRF
Paper: https://arxiv.org/abs/2307.11335
Code: https://github.com/wbhu/Tri-MipRF

Diffusion Models(扩散模型)

PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment

Paper: https://arxiv.org/abs/2306.15667
Code: https://github.com/facebookresearch/PoseDiffusion

FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model

Paper: https://arxiv.org/abs/2303.09833
Code: https://github.com/vvictoryuki/FreeDoM

BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

Paper: https://arxiv.org/abs/2307.10816
Code: https://github.com/Sierkinhane/BoxDiff

BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction

Paper: https://arxiv.org/abs/2211.14304
Code: https://github.com/BarqueroGerman/BeLFusion

DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion

Paper: https://arxiv.org/abs/2303.06840
Code: https://github.com/Zhaozixiang1228/MMIF-DDFM

DIRE for Diffusion-Generated Image Detection

Paper: https://arxiv.org/abs/2303.09295
Code: https://github.com/ZhendongWang6/DIRE

Prompt

Read-only Prompt Optimization for Vision-Language Few-shot Learning

Paper: https://arxiv.org/abs/2308.14960
Code: https://github.com/mlvlab/RPO

Introducing Language Guidance in Prompt-based Continual Learning

Paper: https://arxiv.org/abs/2308.15827
Code: None

视觉和语言(Vision-Language)

Read-only Prompt Optimization for Vision-Language Few-shot Learning

Paper: https://arxiv.org/abs/2308.14960
Code: https://github.com/mlvlab/RPO

目标检测(Object Detection)

Femtodet: an object detection baseline for energy versus performance tradeoffs

Paper: https://arxiv.org/abs/2301.06719
Code: https://github.com/yh-pengtu/FemtoDet

Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment

Paper: https://arxiv.org/abs/2207.13085
Code: https://github.com/Atten4Vis/GroupDETR

Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection

Paper: https://arxiv.org/abs/2205.09613
Code: https://github.com/LiewFeng/imTED

ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor Generation

Paper: https://arxiv.org/abs/2308.09242
Code: https://github.com/iSEE-Laboratory/ASAG

目标跟踪(Visual Tracking)

Cross-modal Orthogonal High-rank Augmentation for RGB-Event Transformer-trackers

语义分割(Semantic Segmentation)

Segment Anything

Homepage: https://segment-anything.com/
Paper: https://arxiv.org/abs/2304.02643
Code: https://github.com/facebookresearch/segment-anything

MARS: Model-agnostic Biased Object Removal without Additional Supervision for Weakly-Supervised Semantic Segmentation

Paper: https://arxiv.org/abs/2304.09913
Code: https://github.com/shjo-april/MARS

FreeCOS: Self-Supervised Learning from Fractals and Unlabeled Images for Curvilinear Object Segmentation

Paper: https://arxiv.org/abs/2307.07245
Code: https://github.com/TY-Shi/FreeCOS

Residual Pattern Learning for Pixel-wise Out-of-Distribution Detection in Semantic Segmentation

Paper: https://arxiv.org/abs/2211.14512
Code: https://github.com/yyliu01

Disentangle then Parse:Night-time Semantic Segmentation with Illumination Disentanglement

Paper: https://arxiv.org/abs/2307.09362
Code: https://github.com/w1oves/DTP

视频目标分割(Video Object Segmentation)

Towards Robust Referring Video Object Segmentation with Cyclic Relational Consensus

Paper: https://arxiv.org/abs/2207.01203
Code: https://github.com/lxa9867/R2VOS

视频实例分割(Video Instance Segmentation)

DVIS: Decoupled Video Instance Segmentation Framework

Paper: https://arxiv.org/abs/2306.03413
Code: https://github.com/zhang-tao-whu/DVIS

医学图像分类

BoMD: Bag of Multi-label Descriptors for Noisy Chest X-ray Classification

Paper: https://arxiv.org/abs/2203.01937
Code: https://github.com/cyh-0/BoMD

医学图像分割

CLIP-Driven Universal Model for Organ Segmentation and Tumor Detection

Paper: https://arxiv.org/abs/2301.00785
Code: https://github.com/ljwztc/CLIP-Driven-Universal-Model

Low-level Vision

Self-supervised Learning to Bring Dual Reversed Rolling Shutter Images Alive

Paper: https://arxiv.org/abs/2305.19862
Code: https://github.com/shangwei5/SelfDRSC

超分辨率(Super-Resolution)

Spherical Space Feature Decomposition for Guided Depth Map Super-Resolution.

Paper: https://arxiv.org/abs/2303.08942
Code: https://github.com/Zhaozixiang1228/GDSR-SSDNet

3D点云(3D Point Cloud)

Robo3D: Towards Robust and Reliable 3D Perception against Corruptions

Homepage: https://ldkong.com/Robo3D
Paper: https://arxiv.org/abs/2303.17597
Code: https://github.com/ldkong1205/Robo3D

Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models

Paper: https://arxiv.org/abs/2304.07221
Code: https://github.com/zyh16143998882/ICCV23-IDPT

Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos

Paper: https://arxiv.org/abs/2308.09247
Code: None

3D目标检测(3D Object Detection)

PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

Paper: https://arxiv.org/abs/2206.01256
Code: https://github.com/megvii-research/PETR

DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection

Paper: https://arxiv.org/abs/2304.13031
Code: https://github.com/AIR-DISCOVER/DQS3D

SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection

Paper: https://arxiv.org/abs/2304.14340
Code: https://github.com/yichen928/SparseFusion

StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Paper: https://arxiv.org/abs/2303.11926
Code: https://github.com/exiawsh/StreamPETR.git

Cross Modal Transformer: Towards Fast and Robust 3D Object Detection

Paper: https://arxiv.org/abs/2301.01283
Code: https://github.com/junjie18/CMT.git

MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation

Paper: https://arxiv.org/abs/2304.09801
Project: https://chongjiange.github.io/metabev.html
Code: https://github.com/ChongjianGE/MetaBEV

Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling

Paper: https://arxiv.org/abs/2307.07944
Code: https://github.com/zhuoxiao-chen/ReDB-DA-3Ddet

SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection

Paper: https://arxiv.org/abs/2307.11477
Code: https://github.com/mengtan00/SA-BEV

3D语义分割(3D Semantic Segmentation)

Rethinking Range View Representation for LiDAR Segmentation

Homepage: https://ldkong.com/RangeFormer
Paper: https://arxiv.org/abs/2303.05367
Code: None

3D目标跟踪(3D Object Tracking)

MBPTrack: Improving 3D Point Cloud Tracking with Memory Networks and Box Priors

Paper: https://arxiv.org/abs/2303.05071
Code : https://github.com/slothfulxtx/MBPTrack3D

视频理解(Video Understanding)

Unmasked Teacher: Towards Training-Efficient Video Foundation Models

Paper: https://arxiv.org/abs/2303.16058
Code: https://github.com/OpenGVLab/unmasked_teacher

图像生成(Image Generation)

FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model

Paper: https://arxiv.org/abs/2303.09833
Code: https://github.com/vvictoryuki/FreeDoM

BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

Paper: https://arxiv.org/abs/2307.10816
Code: https://github.com/Sierkinhane/BoxDiff

视频生成(Video Generation)

Simulating Fluids in Real-World Still Images

Homepage: https://slr-sfs.github.io/
Paper: https://arxiv.org/abs/2204.11335
Code: https://github.com/simon3dv/SLR-SFS

图像编辑(Image Editing)

Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing

视频编辑(Video Editing)

FateZero: Fusing Attentions for Zero-shot Text-based Video Editing

Project: https://fate-zero-edit.github.io/
Paper: https://arxiv.org/abs/2303.09535
Code: https://github.com/ChenyangQiQi/FateZero

人体运动生成(Human Motion Generation)

BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction

Paper: https://arxiv.org/abs/2211.14304
Code: https://github.com/BarqueroGerman/BeLFusion

低光照图像增强(Low-light Image Enhancement)

Implicit Neural Representation for Cooperative Low-light Image Enhancement

Paper: https://arxiv.org/abs/2303.11722
Code: https://github.com/Ysz2022/NeRCo

场景文本检测(Scene Text Detection)

场景文本识别(Scene Text Recognition)

Self-supervised Character-to-Character Distillation for Text Recognition

Paper: https://arxiv.org/abs/2211.00288
Code: https://github.com/TongkunGuan/CCD

MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition

Paper: https://arxiv.org/abs/2305.14758
Code: https://github.com/simplify23/MRN
中文解读：https://zhuanlan.zhihu.com/p/643948935

图像检索(Image Retrieval)

Zero-Shot Composed Image Retrieval with Textual Inversion

Paper: https://arxiv.org/abs/2303.15247
Code: https://github.com/miccunifi/SEARLE

图像融合(Image Fusion)

DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion

Paper: https://arxiv.org/abs/2303.06840
Code: https://github.com/Zhaozixiang1228/MMIF-DDFM

轨迹预测(Trajectory Prediction)

EigenTrajectory: Low-Rank Descriptors for Multi-Modal Trajectory Forecasting

Homepage: https://inhwanbae.github.io/publication/eigentrajectory/
Paper: https://arxiv.org/abs/2307.09306
Code: https://github.com/InhwanBae/EigenTrajectory

人群计数(Crowd Counting)

Point-Query Quadtree for Crowd Counting, Localization, and More

Paper: https://arxiv.org/abs/2308.13814
Code: https://github.com/cxliu0/PET

Video Quality Assessment(视频质量评价)

Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives

Paper: https://arxiv.org/abs/2211.04894
Code: https://github.com/VQAssessment/DOVER

其它(Others)

MotionBERT: A Unified Perspective on Learning Human Motion Representations

Homepage: https://motionbert.github.io/
Paper: https://arxiv.org/abs/2210.06551
Code: https://github.com/Walter0807/MotionBERT

Graph Matching with Bi-level Noisy Correspondence

LDL: Line Distance Functions for Panoramic Localization

Paper: https://arxiv.org/abs/2308.13989
Code: https://github.com/82magnolia/panoramic-localization

Active Neural Mapping

Homepage: https://zikeyan.github.io/active-INR/index.html
Paper: https://arxiv.org/abs/2308.16246
Code: https://zikeyan.github.io/active-INR/index.html#

Reconstructing Groups of People with Hypergraph Relational Reasoning

Paper: https://arxiv.org/abs/2308.15844
Code: https://github.com/boycehbz/GroupRec

iccv2023-papers-with-code's People

Contributors

Stargazers

Watchers

Forkers

qgh1223 ammieqi vincentseven1 suiuko panghongwei17 ykeivn yinhefeng proy10 piaofu110 wddwzc rocklijun sttomato msathishkumar1990 masterbin-iiau 0000duck emiyaning zfxu daydayupdyp lelegan ma-chenbin reena-hr johnbhlm grid-gudx wujinlonglovezhangmiao1314 xudongwang0828 jmu201521121021 xuanxu92 big-data-ai leegerpeng andytianph wangguangyuan jackhu-bme overbestfitting ruyuan2512 maphysart wang-wenqing piglogic-cyber yudadabing sui6662012 shujunyy123 qiaoptdun xzjzsa zongzi13545329 wewan note-liu ajayarunachalam yueyang07 yuni1314 allisonshen yonghoonkwon wyfei1999 rtmdfg zgsxwsdxg jianshijim rainymoo zzhc3321 zeroonegame trevor-philips-cbd filterbank yanggui19891007 gaotrees yangfukui xiangjun0103 weixuanli-1024 hzj1558718 feixuedudiao bruinxiong coderxuxiang ranqing jinghao99 mrtalhjl walterhu1015 wyx19980727 simon-pu frezaeix lotayou peisun1115 jawaechan xxccb zikang12138 catfootprint yayo13 ahwhbc tcwltcwl octavianchen zhangyuancv albertotono github-sci wxdgithub-bp yifeiwang97 121644048 18724799167 wangxihao zb12138 qitaozhao bywjge yanxioa sanghun3819 jiang-chd-yunnan raj-gupta1

iccv2023-papers-with-code's Issues

ICCV 2021 paper: Dynamic Attentive Graph Learning for Image Restoration

Title - Dynamic Attentive Graph Learning for Image Restoration
Arxiv - https://arxiv.org/abs/2109.06620
Code - https://github.com/jianzhangcs/DAGL

ICCV2021- FREE: Feature Refinement for Generalized Zero-Shot Learning

ICCV2021 paper:

Title: FREE: Feature Refinement for Generalized Zero-Shot Learning
arXiv: https://arxiv.org/pdf/2107.13807.pdf
code: https://github.com/shiming-chen/FREE
topic: Zero-shot Learning | Knowledge transfer

ICCV 2021 paper: Relational Embedding for Few-Shot Classification

Hello,

Please consider including our paper "Relational Embedding for Few-Shot Classification" (https://arxiv.org/abs/2108.09666) as well as its code (https://github.com/dahyun-kang/renet) in the "Few-shot learning" category.

Have a great day! 😃

Best,
Dahyun

ICCV 21 Paper: Social NCE

Title: Social NCE: Contrastive Learning of Socially-aware Motion Representations
Arxiv: https://arxiv.org/abs/2012.11717
Code: https://github.com/vita-epfl/social-nce
Topic: Trajectory Prediction | Contrastive Learning

ICCV2021 Oral paper - Equivariant Imaging: Learning Beyond the Range Space

ICCV2021 Oral paper:

Title: Equivariant Imaging: Learning Beyond the Range Space
arXiv: https://arxiv.org/abs/2103.14756
code: https://github.com/edongdongchen/EI
topic: Self-supervised learning, Low-level vision, image reconstruction, inverse problem

ICCV 21 Paper: Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?

Title: Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?
Arxiv: https://arxiv.org/abs/2105.02498
Link: https://github.com/KingJamesSong/DifferentiableSVD
Topic: Backbone | Others

欢迎分享ICCV 2023 论文和代码 / Welcome to share the paper and code of ICCV 2023

[The format of the issue]
Paper name/title:
Paper link:
Code link:
keywords:

ICCV-2021 Paper: Generating Attribution Maps with Disentangled Masked Backpropagation

Title - Generating Attribution Maps with Disentangled Masked Backpropagation
Arxiv - https://arxiv.org/pdf/2101.06773.pdf
Code - https://gitlab.com/adriaruizo/dmbp_iccv21
Topic - Feature Attribution Visualization

ICCV 2023 paper: "MARS: Model-agnostic Biased Object Removal without Additional Supervision for Weakly-Supervised Semantic Segmentation"

Paper name/title: "MARS: Model-agnostic Biased Object Removal without Additional Supervision for Weakly-Supervised Semantic Segmentation"
Paper link: https://arxiv.org/abs/2304.09913
Code link: https://github.com/shjo-april/MARS

Thank you for organizing ICCV papers!

ICCV 2021 paper - Generalize then Adapt: Source-Free Domain Adaptive Semantic Segmentation

Title - Generalize then Adapt: Source-Free Domain Adaptive Semantic Segmentation
Arxiv - https://arxiv.org/abs/2108.11249
Project page - https://sites.google.com/view/sfdaseg (code link will be updated here)
Topic - Unsupervised Domain Adaptation (for Semantic Segmentation)

ICCV-2021 Paper: PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering

分类：图像合成
代码仓库：https://github.com/RenYurui/PIRender

ICCV 2021 paper: Geometry-based Distance Decomposition for Monocular 3D Object Detection

Title: Geometry-based Distance Decomposition for Monocular 3D Object Detection
Arxiv: https://arxiv.org/abs/2104.03775
Code: https://github.com/Rock-100/MonoDet
Topic: Monocular 3D Object Detection

ICCV 21 paper: Visual Alignment Constraint for Continuous Sign Language Recognition

Title: Visual Alignment Constraint for Continuous Sign Language Recognition
Arxiv: https://arxiv.org/abs/2104.02330
Code: https://github.com/ycmin95/VAC_CSLR
Topic: Sign Language Recognition | Sequence Learning | Action Recognition

ICCV21 paper: PyMAF: 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop

Hi, thanks for gathering papers with code! Please add our paper:

Title: PyMAF: 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop
Paper (Oral): https://arxiv.org/abs/2103.16507
Code: https://github.com/HongwenZhang/PyMAF
Homepage: https://hongwenzhang.github.io/pymaf

Thanks.

ICCV 21 paper: VLGrammar: Grounded Grammar Induction of Vision and Language

Can you kindly add my paper?

Yining Hong; Qing Li; Song-Chun Zhu; Siyuan Huang, "VLGrammar: Grounded Grammar Induction of Vision and Language", ICCV2021

pdf: https://arxiv.org/abs/2103.12975
code: https://github.com/evelinehong/VLGrammar

ICCV-2021 Paper: Image Classification and Depth Estimation in extreme low-light

ICCV 2021: Photon-Starved Scene Inference using Single Photon Cameras

Paper: https://openaccess.thecvf.com/content/ICCV2021/papers/Goyal_Photon-Starved_Scene_Inference_Using_Single_Photon_Cameras_ICCV_2021_paper.pdf

Code: https://github.com/bhavyagoyal/spclowlight

ICCV-2021 Paper: Perception-Aware Multi-Sensor Fusion for 3D LiDAR Semantic Segmentation

Category: 3D Semantic Segmentation(3D语义分割)

Paper: https://openaccess.thecvf.com/content/ICCV2021/papers/Zhuang_Perception-Aware_Multi-Sensor_Fusion_for_3D_LiDAR_Semantic_Segmentation_ICCV_2021_paper.pdf

Code: https://github.com/ICEORY/PMF

Thank you~

ICCV 2021 paper: Fine-grained Semantics-aware Representation Enhancement for Self-supervised Monocular Depth Estimation

Title: Fine-grained Semantics-aware Representation Enhancement for Self-supervised Monocular Depth Estimation
ICCV 2021, Oral

Topic: Depth Estimation (monocular, self-supervised)
Code: https://github.com/hyBlue/FSRE-Depth
arXiv: https://arxiv.org/abs/2108.08829

Thanks!

ICCV-2021 Paper: Zen-NAS: A Zero-Shot NAS for High-Performance Deep Image Recognition

Paper and Code: https://github.com/idstcv/ZenNAS

Thank you very much!

ICCV-2021 Paper: Instance Segmentation in 3D Scenes Using Semantic Superpoint Tree Networks

Category: 3D Instance Segmentation(3D实例分割)

Paper: https://openaccess.thecvf.com/content/ICCV2021/html/Liang_Instance_Segmentation_in_3D_Scenes_Using_Semantic_Superpoint_Tree_Networks_ICCV_2021_paper.html

Code: https://github.com/Gorilla-Lab-SCUT/SSTNet

Thank you~

ICCV 2021 Paper

Title

Self-Supervised Representation Learning from Flow Equivariance
https://arxiv.org/abs/2101.06553

Code

Not available

ICCV Paper: IDM: An Intermediate Domain Module for Domain Adaptive Person Re-ID

Hi, thanks for your collections! Please add our paper:

Paper (ICCV 2021 Oral): https://arxiv.org/abs/2108.02413
Code: https://github.com/SikaStar/IDM

Thanks!

ICCV 2021 paper: LSG-CPD: Coherent Point Drift with Local Surface Geometry for Point Cloud Registration

paper:
https://openaccess.thecvf.com/content/ICCV2021/papers/Liu_LSG-CPD_Coherent_Point_Drift_With_Local_Surface_Geometry_for_Point_ICCV_2021_paper.pdf

codes:
https://github.com/ChirikjianLab/LSG-CPD.git

topic:
Point Cloud Registration

Thx for your work!

ICCV2021 paper: Neural Articulated Radiance Field

Title - Neural Articulated Radiance Field
Arxiv - https://arxiv.org/abs/2104.03110
Project page (with code) - https://github.com/nogu-atsu/NARF
Topic - NeRF

ICCV-paper:Enhanced Boundary Learning for Glass-like Object Segmentation

Paper: https://arxiv.org/abs/2103.15734
Code: https://github.com/hehao13/EBLNet.

Code will be opensourced soon.

ICCV-2021 Paper: Few Shot Visual Relationship Co-Localization

Title - Few Shot Visual Relationship Co-Localization
Arxiv - https://arxiv.org/abs/2108.11618
Project page (with code) -https://vl2g.github.io/projects/vrc/
Topic - Few-Shot Learning

ICCV 2021 paper: RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth

Title: RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth
Arxiv: https://arxiv.org/abs/2108.00616
Dataset and Code: https://github.com/MengyangPu/RINDNet
Topic: Edge Detection

ICCV 2021 paper: Conditional DETR for Fast Training Convergence

Hi,
Official code of our paper "Conditional DETR for Fast Training Convergence" (https://arxiv.org/abs/2108.06152) has been released, please see https://github.com/Atten4Vis/ConditionalDETR

Hope you can add this link to your README file,
thanks.

Topic:
Transformer, Object Detection

ICCV2021 paper: Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition

Title: Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition
arxiv: https://arxiv.org/abs/2107.12213
code: https://github.com/Uason-Chen/CTR-GCN
topic: action recognition

ICCV2021 Oral paper: GNeRF: GAN-based Neural Radiance Field without Posed Camera

ICCV2021 Oral paper:

Title: GNeRF: GAN-based Neural Radiance Field without Posed Camera
arXiv: https://arxiv.org/abs/2103.15606
code: https://github.com/MQ66/gnerf
topic: NeRF

ICCV 21 paper: Residual Attention: A Simple but Effective Method for Multi-Label Recognition

Hi,
Official code of has been released, see https://github.com/Kevinz-code/CSRA.git

Hope you can add this link to your README file ,
thanks.

ICCV-2021 Paper: Where are you heading? Dynamic Trajectory Prediction with Expert Goal Examples

Paper and Code: Link

Thanks

ICCV 2021 paper: Multitask AET with Orthogonal Tangent Regularity for Dark Object Detection

Please add ICCV 2021 paper: Multitask AET with Orthogonal Tangent Regularity for Dark Object Detection,
paper: https://openaccess.thecvf.com/content/ICCV2021/papers/Cui_Multitask_AET_With_Orthogonal_Tangent_Regularity_for_Dark_Object_Detection_ICCV_2021_paper.pdf
code:
https://github.com/cuiziteng/MAET

Another work on Self-Supervised Learning

Hi @amusi, thanks for creating this list!

We have a paper on self-supervised pre-training on point cloud accepted by ICCV this year.
It would be so nice of you to include our work :)

Project Page: https://hansen7.github.io/OcCo/
Code: https://github.com/hansen7/OcCo
Paper: https://arxiv.org/abs/2010.01089

ICCV 2021 paper: Influence Selection for Active Learning

paper:
https://arxiv.org/abs/2108.09331
https://openaccess.thecvf.com/content/ICCV2021/papers/Liu_Influence_Selection_for_Active_Learning_ICCV_2021_paper.pdf

codes:
https://github.com/dragonlzm/ISAL

topic:
2D目标检测(Object Detection)

Thx for your work!

Thanks for maintaining such a repository.

ICCV 2021 paper: Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain

Hi,
Official code of our paper "Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain" (https://arxiv.org/abs/2108.08487) has been released, please see https://github.com/iCGY96/APR

Hope you can add this link to your README file,
thanks.

Topic:
Others

ICCV 21 paper: Fast Convergence of DETR with Spatially Modulated Co-Attention

Fast Convergence of DETR with Spatially Modulated Co-Attention

https://arxiv.org/abs/2101.07448

https://github.com/abc403/SMCA-replication

ICCV 2021 paper: TRAR: Routing the Attention Spans in Transformer for Visual Question Answering

Thanks for adding our ICCV 2021 paper~

Paper: https://openaccess.thecvf.com/content/ICCV2021/papers/Zhou_TRAR_Routing_the_Attention_Spans_in_Transformer_for_Visual_Question_ICCV_2021_paper.pdf
Code: https://github.com/rentainhe/TRAR-VQA
Topic: Visual Question Answering

ICCV2021 Paper: RangeDet: In Defense of Range View for LiDAR-based 3D Object Detection

Paper
Code
Topic: 3D point cloud detection

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.