Coder Social home page Coder Social logo

vsingh58 / hadoop-page-parser Goto Github PK

View Code? Open in Web Editor NEW

This project forked from rahul27/hadoop-page-parser

0.0 1.0 0.0 5.27 MB

A Java parser program to generate statistics about pages in memory

Java 11.94% Prolog 81.59% Perl 6.42% Shell 0.05%

hadoop-page-parser's Introduction

README for the Hadoop Page Statistics Module
Created by: Rahul Rudradevan <[email protected]> and Praveen Ammanji <[email protected]> of University of California, Irvine.

Run finshell.sh to sample the application, the application to be profiled needs to be fed into "finshell.sh", a shell script that iteratively calls the sampling program. 

	$sh finshell.sh > outfile

This creates the sampled output in "outfile" using the fincore module developed by David Plonka of University of Wisconsin - Madison "http://net.doit.wisc.edu/~plonka/fincore".

Now feed this input to the Hadoop program to generate the Page statistics. The program code can be found in the Folder "Hadoop Program". The dist folder contains the jar to be run to generate the output. A typical run is shown below:

	$sudo hadoop jar Pageset.jar <input location> <output location>

Logs of sample inputs, outputs and a standalone run of hadoop on a pseudo cluster is included in the folder "Sample Input Output".



hadoop-page-parser's People

Contributors

rahul27 avatar

Watchers

Venu Kanaparthy avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.