Coder Social home page Coder Social logo

dmc's Introduction

DMC: Deep Multidimensional Clustering

This repository aims to provides items about deep multimensional clustering(DMC), such as datasets, typical training processes, evaluation protocol etc.

What is DMC?

Deep multimensional clustering targets at multiple non-redundant partitions of unlabeled images with power of deep architectures that model non-linear similarity between images along different axes. For instance, we can group the figures based on the object meaning, environment, shape, color, material etc.

Why DMC?

DMC can extract more visual information than a single clustering that cares only about semantic meaning. With the use of DMC, many applications are benefited, such as image search, video search, video recommendation, etc.

How DMC?

19NeurIPSWorkshop Disentangling to Cluster Gaussian Mixture Variational Ladder Autoencoders

19 ICLR: LTVAE Learning Latent Superstructures in Variational Autoencoders for Deep Multidimensional Clustering

21 NeurIPS: MFCVAE Multi-Facet Clustering Variational Autoencoders

Related Multidimension clustering Literature

18 KDD: Discovering Non-Redundant K-means Clusterings in Optimal

19 AAAI: Multiple Independent Subspace Clusterings

19 IEEE: TRANSACTIONS ON CYBERNETICS: Discovering_Multiple_Co-Clusterings_With_Matrix_Factorization

20 AAAI: Multi-view multiple clusterings using deep matrix factorization

20 AAAI: Deep Embedded Non-Redundant Clustering

20 ICDM: Deep Incomplete Multi-view Multiple Clusterings

20 TKDD: Non-Redundant Subspace Clusterings with Nr-Kmeans

21 IJIS: Multipartition clustering of mixed data with Bayesian networks

21 MachineLearning: Multiple Clusterings Of Heterogenous information Netiworks

DMC Datasets

There are some existing datasets for DMC, such as

  • 3DShapes : 480000 Images with 6 labels for each single image.
  • Microsoft COCO : 330K images (>200K labeled), 5 captions per image

Here, we propose a new approach of image stitching to produce the datasets for multidimension clustering. CIFAR-100 is chosen as a base dataset. To forge a new figure, we randomly select four pictures in the base dataset and combine them as a 2x2 large picture. These four pictures are randomly choosen from 4 random categories. This approach can be repeated to get pictures with multiple captions such that be suitable for DMC. The corresponding code is on generate_grid_img.py

Through the approach above, we create the datasets at baidu cloud link. Following are some examples in the datasets

image

label: [19, 39, 67, 81]

image

label: [29, 41, 30, 34]

image

label: [0, 71, 76, 20]

image

label: [11, 87, 83, 81]

DMC Evaluation Protocols

NMI: normalized mutual information.

ACC: Accuracy of clustering results after optimal matching through NMI.

dmc's People

Contributors

xingzhizhou avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.