Coder Social home page Coder Social logo

tweet-hotspots's Introduction

Tweet Hotspots

This application will search a large Geo-Coded Twitter dataset to identify tweet hotspots around Melbourne. The key purpose is to experiment and exercise the parallel programming on HPC environment and GeoProcessing big Twitter data.

It is using Python and mpi4py as a key module.

Running Local

mpiexec -n 8 python app.py

Running on SPARTAN HPC

sbatch job_1n1c.sh
sbatch job_1n8c.sh
sbatch job_2n8c.sh
sbatch job_2n8c-sym.sh

NOTE: the different between job_2n8c.sh and job_2n8c-sym.sh is that, the latter -sym ensure 4 cores per node by using --ntasks-per-node=4, therefore symmetrical.

Utility scripts such as env.sh prepare input data and environmental setup on cluster and clean.sh clear outputs and log files for consecutive runs if desire.

Slurm useful commands

squeue -u [ur_username]
scontrol show jobid -dd [ur_job_id]
scancel [ur_job_id]

This assignment work is done for COMP90024 Cluster and Cloud Computing assignment 1 assessment of 2017 SM1, The University of Melbourne. You can read the report for background context, though it discusses more on the data that I have worked with. You may also want to read the related tutorials mpi4py-tute and mpjexpress-tute. The implementation still has room for improvement. You may wish to cite this work as follow.

LaTeX/BibTeX:

@misc{sanl1,
    author    = {Lin, San Kho},
    title     = {Tweet Hotspots - HPC Twitter GeoProcessing},
    year      = {2017},
    url       = {https://github.com/victorskl/tweet-hotspots},
    urldate   = {yyyy-mm-dd}
}

tweet-hotspots's People

Contributors

victorskl avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.