Coder Social home page Coder Social logo

cadgmm's Introduction

·

Implementation code and data of the algorithm described in paper "CADGMM: Correlation-aware Deep Generative Model for Unsupervised Anomaly Detection".

Abstract

Unsupervised anomaly detection aims to identify anomalous samples from highly complex and unstructured data, which is pervasive in both fundamental research and industrial applications. However, most existing methods neglect the complex correlation among data samples, which is important for capturing normal patterns from which the abnormal ones deviate. In this paper, we propose a method of Correlation aware unsupervised Anomaly detection via Deep Gaussian Mixture Model (CADGMM), which captures the complex correlation among data points for high-quality low-dimensional representation learning. More specifically, the relations among data samples are correlated firstly in forms of a graph structure, in which, the node denotes the sample and the edge denotes the correlation between two samples from the feature space. Then, a dual-encoder that consists of a graph encoder and a feature encoder, is employed to encode both the feature and correlation information of samples into the low-dimensional latent space jointly, followed by a decoder for data reconstruction. Finally, a separate estimation network as a Gaussian Mixture Model is utilized to estimate the density of the learned latent vector, and the anomalies can be detected by measuring the energy of the samples. Extensive experiments on real-world datasets demonstrate the effectiveness of the proposed method.

Experimental results

Prerequisites

Tested on

  • python 3.6.5
  • pytorch 0.4.0
  • h5py 2.10.0
  • joblib 0.13.2
  • keras 2.2.0
  • matplotlib 3.1.1
  • numpy 1.17.1
  • pillow 6.2.1
  • scipy 1.3.2
  • seaborn 0.9.0
  • tensorboard 1.10.0
  • tensorboardx 1.9
  • tensorflow-gpu 1.14.0

Run

python main.py --dataset=kdd --v=0 --K=4 --KNN=15
#--dataset:/kdd/arrhythmia/satellite
#--v: The noise ratio of train data
#--K: GMM membership number
#--KNN: k-value of KNN 

Cite:

@inproceedings{fan2020cadgmm,
  title        = {Correlation-aware Deep Generative Model for Unsupervised Anomaly Detection},
  author       = {Haoyi Fan, Fengbin Zhang, Ruidong Wang, Liang Xi, and Zuoyong Li},
  booktitle    = {The 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD)},
  year         = {2020},
  organization={Springer}
}

cadgmm's People

Contributors

haoyfan avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.