Coder Social home page Coder Social logo

msakai / plcm Goto Github PK

View Code? Open in Web Editor NEW

This project forked from bnegreve/plcm

0.0 2.0 0.0 58 KB

Parallel algorithm for closed itemsetmining

Home Page: http://melindaplcm.ligforge.imag.fr/

License: GNU Lesser General Public License v3.0

Shell 0.03% HTML 3.05% C++ 95.07% Makefile 0.66% M4 1.18%

plcm's Introduction

PLCM is a frequent itemset mining algorithm based on the LCM algorithm. 

##################
#### Download ####
##################

Latest version at: http://people.cs.kuleuven.be/~benjamin.negrevergne/plcm/plcm_latest.tar.gz

##################################
##### Reference publications #####
##################################

Discovering Closed Frequent Itemsets on Multicore:
Parallelizing Computations and Optimizing Memory Accesses. 

Benjamin Negrevergne Alexandre Termier Jean-François Méhaut And
Takeaki Uno.

LCM ver. 2: Efficient Mining Algorithms for
Frequent/Closed/Maximal Itemsets

Takeaki Uno1, Masashi Kiyomi, Hiroki Arimura2

###################
##### Compile #####
###################

./configure 
make
make install 

### debug mode ###

./configure --enable-debug=yes
make
make install 

#################
###### Run ######
#################

plcm <dataset file> <absolute threshold> <output file> [ -t number of threads ]

(If output is '-' then plcm dumps itemsets on the standard output.)

#######################
##### File format #####
#######################

### Input file ###
Dataset must be a single ASCII file. 

Each line is a transaction. 
Each transaction contains distinct items. 
Transaction must be ordered. 
Last line must be empty.

eg. 
--- file test.dat ---
1 2 4 6
1 2 3 5 7
2 3

is a valid dataset. 

### Output file ###

plcm output files have the following format: 
* each line is a frequent closed itemset
* frequency is stored at the end of the line into brackets.

eg.
running plcm on test.dat dataset, with the following command 
./plcm test.dat 2 out.dat -t 1

will generate the following file 

--- file out.dat ---
2 (3)
1 2 (2)
3 2 (2)

You can change the output format by modifying the dumpItemset()
function in plcm.hpp and plcm.cpp

################
##### BUGs #####
################

Report bugs and/or comments at: 
[email protected]

My FirstName is Benjamin
My LastName is Negrevergne

If you observe a bug, please compile in debug mode and report the error message if any. 
(To compile in debug mode use ./configure --enable-debug=yes.)


######################
##### Developers #####
######################

To checkout the source tree:
git clone <plcm_git_repos> plcm
cd plcm
git submodule init
git submodule update

Compile with the standard procedure (see Compile section)

./configure 
make 
make install

plcm's People

Contributors

msakai avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.