Coder Social home page Coder Social logo

ldkong1205 / openess Goto Github PK

View Code? Open in Web Editor NEW
26.0 3.0 0.0 11.69 MB

[CVPR 2024 Highlight] OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies

Home Page: https://ldkong.com/OpenESS

autonomous-driving event-camera open-vocabulary semantic-segmentation

openess's Introduction

English | 简体中文

OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies

Lingdong Kong1,2    Youquan Liu3    Lai Xing Ng4    Benoit R. Cottereau5,6    Wei Tsang Ooi1
1National University of Singapore    2CNRS@CREATE    3Hochschule Bremerhaven    4Institute for Infocomm Research, A*STAR    5IPAL, CNRS IRL 2955, Singapore    6CerCo, CNRS UMR 5549, Universite Toulouse III

About

OpenESS is an open-vocabulary event-based semantic segmentation (ESS) framework that synergizes information from image, text, and event-data domains to enable scalable ESS in an open-world, annotation-efficient manner.

Input Event Stream “Driveable” “Car” “Manmade”
Zero-Shot ESS “Walkable” “Barrier” “Flat”

Updates

  • [2024.05] - Our paper is available on arXiv, click here to check it out. The code will be available later.
  • [2024.04] - OpenESS was selected as a ✨ highlight ✨ at CVPR 2024 (2.8% = 324/11532).
  • [2024.02] - OpenESS was accepted to CVPR 2024! 🎉

Outline

🎥 Demo

Demo #1 Demo #2 Demo #3
YouTube ⤴️ YouTube ⤴️ YouTube ⤴️

⚙️ Installation

Kindly refer to INSTALL.md for the installation details.

♨️ Data Preparation

Kindly refer to DATA_PREPARE.md for the details to prepare the DDD17-Seg and DSEC-Semantic datasets.

🚀 Getting Started

Please refer to GET_STARTED.md to learn more about how to use this codebase.

📊 Benchmark

OpenESS Framework

Annotation-Free ESS

To be updated.

Fully-Supervised ESS

To be updated.

Open-Vocabulary ESS

To be updated.

Qualitative Assessment

📝 TODO List

To be updated.

Citation

If you find this work helpful, please kindly consider citing our paper:

@inproceedings{kong2024openess,
  title = {OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies},
  author = {Kong, Lingdong and Liu, Youquan and Ng, Lai Xing and Cottereau, Benoit R. and Ooi, Wei Tsang},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2024},
}

License

This work is under the Apache License Version 2.0, while some specific implementations in this codebase might be with other licenses. Kindly refer to LICENSE.md for a more careful check, if you are using our code for commercial matters.

Acknowledgements

To be updated.

openess's People

Contributors

ldkong1205 avatar

Stargazers

 avatar Zijiyingcai avatar Zhiwen Chen avatar Zhifeng Gu avatar Jing Wang avatar yahooo avatar Tony Davis avatar LeGr4ndK avatar Zhihua Liu avatar Dachun Kai avatar  avatar  avatar Hyoseok Lee avatar Yang Cao avatar Gregor Lenz avatar Sina Tayebati avatar Jingwei Zhang avatar Nanfei Ye avatar Jokester avatar Hao Lu avatar wangrujia avatar sankin avatar RoboDrive avatar  avatar Shaoyuan Xie avatar Xiang Xu avatar

Watchers

Liam Zhang avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.