Light

wjtan99 / bert_anomaly_video_classification Goto Github PK

View Code? Open in Web Editor NEW

15.0 1.0 2.0 106 KB

Overlooked Video Classification in Video Anomaly Detection

Home Page: https://arxiv.org/abs/2210.06688

Python 49.03% Jupyter Notebook 50.97%

anomaly bert detection

bert_anomaly_video_classification's Introduction

Overlooked Video Classification in Video Anomaly Detection

The power of video-level classification in almost all previous video anomaly detection is either overlooked or not studied well explicitly. With addition of a BERT or LSTM video classification, we achieve new SOTA results on UCF-Crime, ShanghaiTech, and XD-Violence datasts.

MIL-BERT

Standard MIL with BERT video classification

RTFM-BERT

Modificed RTFM with BERT video classification

bert_anomaly_video_classification's People

Contributors

Stargazers

Watchers

Forkers

rachleb hsouporto

bert_anomaly_video_classification's Issues

Regarding 'dataset.py'

When referring to the UCF-crime dataset within 'dataset.py', two txt files are called:

train_file = 'train_normal.txt' test_file = 'test_normalv3.txt'

I cannot find these txt files in any of the UCF-dataset versions or txt file splits.

Where can I find these files?

Thanks!

some questions about extracting I3D features

Will you extract I3D features of 10crop? If you have code, can you provide me with a copy? Thank you very much!

Code to generate the XDViolence_gt.npy needed for test_10crop.py

Could you please elaborate on how to create the XDViolence_gt.npy file to replicate your results using the XD-Violence dataset?

About the video_list generate

Thanks for sharing the code of this excellent work. But I have some questions about the code. It will be much appreciated if you could resolve my puzzles.

the result of your process of the ucf dataset and generating the video_list.txt in "/BERT_Anomaly_Video_Classification/tree/main/MIL-BERT/fix_ucf_crime_test_list.ipynb" is below:

['Abuse/Abuse028_x264.mp4|1412|[165, 240, -1, -1]', 'Abuse/Abuse030_x264.mp4|1544|[1275, 1360, -1, -1]', 'Arrest/Arrest001_x264.mp4|2374|[1185, 1485, -1, -1]', 'Arrest/Arrest007_x264.mp4|3144|[1530, 2160, -1, -1]', 'Arrest/Arrest024_x264.mp4|3629|[1005, 3105, -1, -1]']

but the result was processed by me is below：
Abuse/Abuse028_x264.mp4|1413|[165, 240, -1, -1]
Abuse/Abuse030_x264.mp4|1545|[1275, 1360, -1, -1]
Arrest/Arrest001_x264.mp4|2375|[1185, 1485, -1, -1]
Arrest/Arrest007_x264.mp4|3145|[1530, 2160, -1, -1]

there are different in num_frames, I use the "frames.pkl" released by https://github.com/junha-kim/Learning-to-Adapt-to-Unseen-Abnormal-Activities

Thanks very much

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.