Coder Social home page Coder Social logo

iit-cs553-cloudcomputing's Introduction

# readme.txt for hw6

## Requirement
- gensort1.5
- Hadoop
- Spark
- Python3
- python/pandas
- python/matplotlib

## DIRECTORY STRUCTURE
.
├── mySort                      # source code and makefile for our mySort program, implemented in C++
│   ├── Makefile
│   ├── monitor                 # monitor scripts and logs for mySort and LinuxSort
│   │   ├── ...
│   │   ├── plot.py             # plotting scripts in python
│   │   └── monitor.sh
│   └── src
├── hadoop_src                  # source code for our hadoop sort program 
│   ├── ...
│   ├── hadoop_sort.sh          # scripts for Hadoop Sort
│   └── logs                    # logs for Hadoop Sort
├── spark_src                   # source code and makefile for our Spark Sort program,
|   ├── Makefile
|   ├── SparkSort.java
|   ├── SparkSort.sh            # scripts for Spark Sort
|   └── ...                     # logs for Spark Sort
└─── readme.txt

## Usage

### mySort & linuxSort
all experiments of mySort & linuxSort are wrapped in the mySort/monitor/monitor.sh

To run the experiments:
1. cd mySort/
2. make
3. monitor/monitor.sh [0 mySort | 1 Linux Sort] [dataset size in GB] [memory size for sorting in GB]


### hadoop sort
# preparations
# create a project directory and set the PRJ_DIR to it
# Make sure gensort binary is downloaded into that directory
# and HADOOP_PATH to your hadoop installation
# To run the hadoop sort experiments, navigate to the hw6 directory, and run
~/.../team-25/hw6 $./hadoop_src/hadoop_sort.sh [expsize]
# where [expsize] specifies the workload size created by gensort, in GBs.
# it recompiles the hadoop sort program from source, and submits the job to yarn.
# The log files from for the experiment will be saved in the ./hw6/hadoop_src/logs, which includes the timed results of the experiment
ls ./hadoop_src/logs

### spark sort
all experiments of Spark Sort are wrapped in the spark_src/SparkSort.sh

To run the experiments:
1. cd spark_src/
2. make 
3. SparkSort.sh [dataset size in GB]

iit-cs553-cloudcomputing's People

Contributors

alexandru-orhean avatar lancewl avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.