Coder Social home page Coder Social logo

bigdatadevkit's Introduction

BigDataDevKit

Big Data Development Kit (Hadoop / Spark / Zeppelin / IntelliJ) in Amazon AWS

Usage:

Launch an instance on Amazon WebServices

Specify in the initialization of the vm

#!/bin/bash
wget -O- http://bit.ly/1PjnNB3 |\
 PASSWORD="changeme"\
 TOKEN="duck-dns-token"\
 HOST="duckdnshost"\
 EMAIL="your@email"\
 bash

NOTE!

  • You need to specify Amazon Linux 64 bit
  • You need at least an image small (2g) better 4g for running all the services
  • You need at least 20 GB space
  • Create a secondary block on sdb and the /app folder with all your data will be mounted there, thus preserved from termination
  • change the password (change the string within the quotes with your password) to the one you want
  • register an hostname in www.duckdns.org and get the token, and replace them in the TOKEN and HOST variables
  • specify your email (user for let's encrypt service)
  • if you have a backup of let's encrypt (for example in Dropbox) specify LETGZ="...." to the url of your backup
  • before you launch the instance add a rule to open the HTTPS ports to the world

Spark / Hadoop / Zeppelin devkit

Docker kit for Hadoop, Spark and Zeppelin

Devenv with IntelliJ, SBT and Ammonite accessible via web

Usage:

First, get a docker machine and configure your docker to access it. Refer to docker documentation to learn how to do it.

The script sh build.sh <password> builds the enviroment.

Start it with docker-compose up -d.

That is all.

What is in the kit

Access the shell with http://youserver:3000 and the desktop with http://yourserver:6080

In the kit there is Intellij free edition, a terminal with sbt and ammonite

Inside the kit you have also Zeppelin, internally accessible as

http://zeppelin.loc:8000, Hadoop accessible as hdfs://hadoop.loc:8020 and Spark on http://spark.loc

(to fix) You can also ssh (without password) on hadoop.loc, spark.loc and zeppelin.loc

bigdatadevkit's People

Watchers

 avatar 长毛是狗 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.