Coder Social home page Coder Social logo

dair-v2x's Introduction

DAIR-V2X and OpenDAIRV2X: Towards General and Real-World Cooperative Autonomous Driving



teaser

Table of Contents:

  1. Highlights
  2. News
  3. Dataset Download
  4. Getting Started
  5. Major Features
  6. Benchmark
  7. Citation
  8. Contaction

Highlights

  • DAIR-V2X: The first real-world dataset for research on vehicle-to-everything autonomous driving. It comprises a total of 71,254 frames of image data and 71,254 frames of point cloud data.
  • OpenDAIR-V2X: An open-sourced framework for supporting the research on vehicle-to-everything autonomous driving.

News

  • [2023.03] ๐Ÿ”ฅ Our new dataset "V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting" has been accepted by CVPR2023. Congratulations! We will release the dataset sooner. Please follow DAIR-V2X-Seq for the latest news.
  • [2023.03] ๐Ÿ”ฅ We have released training code for our FFNET, and our OpenDAIRV2X now supports evaluating FFNET.
  • [2022.11] We have held the first VIC3D Object Detection challenge.
  • [2022.07] We have released the OpenDAIRV2X codebase v1.0.0. The current version can faciliate the researchers to use the DAIR-V2X dataset and reproduce the benchmarks.
  • [2022.03] Our Paper "DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection" has been accepted by CVPR2022. Arxiv version could be seen here.
  • [2022.02] DAIR-V2X dataset is availale here. It can be unlimitedly downloaded within mainland China.

Dataset Download

Getting Started

Please refer to getting_started.md for Installation, Evaluation, Benchmark and Training etc for VIC3D.

Major Features

  • Support Train/Evaluation for VIC3D

    It will directly support model training and evaluation for VIC3D. Now the model inference and model training are mainly based on MMDetection3D, which is not quite convenient to carry the VICAD research.

  • Evaluation (Model inference is based on MMDetection3D)

  • Training based on MMDetection3D

  • Direct Evaluation with DAIR-V2X Framework

  • Direct Training with DAIR-V2X Framework

  • Support different fusion methods for VIC3D

    It will directly support different fusion methods including early fusion/feature fusion/late fusion. Now it supports early fusion and late fusion.

  • Early Fusion

  • Early Fusion

  • Late Fusion

  • Support multi-modality/single-modality detectors for VIC3D

    It will directly support different modaility detectors including image-modality detector, pointcloud-modality detector and image-pointcloud fusion detector. Now it supports image-modality detector ImvoxelNet, pointcloud-modality detector PointPillars.

  • Image-modality

  • Pointcloud-modality

  • Multi-modality

  • Support Cooperation-view/single-view detectors for VIC3D

    It directly supports different view's detectors for VIC3D, including infrastructure-view detector, vehicle-view detector, vehicle-infrastrucure cooperation-view detector.

  • Infrastructure-view

  • Vehicle-view

  • Cooperation-view

Benchmark

You can find more benchmark in SV3D-Veh, SV3D-Inf, and VIC3D. We provide part of the VIC3D Benchmark in following table.

Modality Fusion Model Dataset AP-3D (IoU=0.5) AP-BEV (IoU=0.5) AB
Overall 0-30m 30-50m 50-100m Overall 0-30m 30-50m 50-100m
Image VehOnly ImvoxelNet VIC-Sync 9.13 19.06 5.23 0.41 10.96 21.93 7.28 0.78 0
Late-Fusion ImvoxelNet VIC-Sync 18.77 33.47 9.43 8.62 24.85 39.49 14.68 14.96 309.38
Pointcloud VehOnly PointPillars VIC-Sync 48.06 47.62 63.51 44.37 52.24 30.55 66.03 48.36 0
Early Fusion PointPillars VIC-Sync 62.61 64.82 68.68 56.57 68.91 68.92 73.64 65.66 1382275.75
Late-Fusion PointPillars VIC-Sync 56.06 55.69 68.44 53.60 62.06 61.52 72.53 60.57 478.61
Late-Fusion PointPillars VIC-Async-2 52.43 51.13 67.09 49.86 58.10 57.23 70.86 55.78 478.01
TCLF PointPillars VIC-Async-2 53.37 52.41 67.33 50.87 59.17 58.25 71.20 57.43 897.91

Citation

If you find this project useful in your research, please consider cite:

@inproceedings{dair-v2x,
  title={Dair-v2x: A large-scale dataset for vehicle-infrastructure cooperative 3d object detection},
  author={Yu, Haibao and Luo, Yizhen and Shu, Mao and Huo, Yiyi and Yang, Zebang and Shi, Yifeng and Guo, Zhenglong and Li, Hanyu and Hu, Xing and Yuan, Jirui and Nie, Zaiqing},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={21361--21370},
  year={2022}
}

Contaction

If any questions and suggenstations, please email to [email protected].

Related Resources

Awesome

dair-v2x's People

Contributors

haibao-yu avatar wenxian-yang avatar icycookies avatar coutyou avatar jileimao avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.