Coder Social home page Coder Social logo

mulloymorrow / hit_ava Goto Github PK

View Code? Open in Web Editor NEW

This project forked from joslefaure/hit_ava

0.0 1.0 0.0 190 KB

This repository is a fork of https://github.com/joslefaure/HIT customized for the AVA dataset

C++ 1.44% Python 91.34% C 0.41% Cuda 6.18% Cython 0.63%

hit_ava's Introduction

HIT

This project is the official implementation of our paper Holistic Interaction Transformer Network for Action Detection (WACV 2023), authored by Gueter Josmy Faure, Min-Hung Chen and Shang-Hong Lai.

What makes this different from the original Repo?

  • The code is simplified and customized for the AVA dataset
  • This implementation outperforms the original (on JHMDB and UCF) with only person features (no hands, pose and objects)
  • (Caution) I made sure the code works for AVA, without training and testing on the whole AVA dataset, therefore I don't know how good (bad) it is compared to the original implementation.

Installation

You need first to install this project, please check INSTALL.md

Data Preparation

To do training or inference on AVA, please check DATA.md for data preparation instructions. Instructions for other datasets coming soon.

Model Zoo

Please see MODEL_ZOO.md for downloading models.

Training and Inference

To do training or inference with HIT, please refer to GETTING_STARTED.md.

Citation

If this project helps you in your research or project, please cite this paper:

@InProceedings{Faure_2023_WACV,
    author    = {Faure, Gueter Josmy and Chen, Min-Hung and Lai, Shang-Hong},
    title     = {Holistic Interaction Transformer Network for Action Detection},
    booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
    month     = {January},
    year      = {2023},
    pages     = {3340-3350}
}

Acknowledgement

We are very grateful to the authors of AlphAction for open-sourcing their code from which this repository is heavily sourced. If your find this research useful, please consider citing their paper as well.

@inproceedings{tang2020asynchronous,
  title={Asynchronous Interaction Aggregation for Action Detection},
  author={Tang, Jiajun and Xia, Jin and Mu, Xinzhi and Pang, Bo and Lu, Cewu},
  booktitle={Proceedings of the European conference on computer vision (ECCV)},
  year={2020}
}

hit_ava's People

Contributors

joslefaure avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.