Coder Social home page Coder Social logo

hh-viewership's Introduction

hh-viewership

Steps:

For data-pipeline:

  1. Build/package for aws data pipeline:
    • ./build-data-pipeline.sh
    • it will push the artifacts to s3://.../data-pipeline/hh-viewership bucket
  2. Execute the ./loop.sh AWS_ACCESS_KEY_ID AWS_SECRET_ACCESS_KEY within the data pipeline
    • It will first run ./precondition to identify the available data and last run dates
    • Next it will execute the loop with the ./run script, pasisng the date and AWS key/secret

For ec-2:

  1. Build/package for ec2 linux:
    • Prep:
      • cp loop-secure.sh loop.sh
      • inside loop.sh replace with values:
        • AWS_ACCESS_KEY_ID
        • AWS_SECRET_ACCESS_KEY
        • $> ./build-ec2.sh
  2. Launch ec2 instance:
    • c4.2xlarge
    • 500GB EBS
    • daap-s3-role
    • default security group (default VPC)
    • Setup:
      • ./configure.sh
        • /data
        • chown ec2-user:ec2-user /data
        • yum install -y tree
  3. Deploy:
    • archive.zip from build-ec2.sh --> /data
    • working dir
    • unzip archive.zip under working
  4. Run:
    • A. Single day:
      • nohup ./run-hh-viewership.sh <AWS_ACCESS_KEY_ID> <AWS_SECRET_ACCESS_KEY> rovi-cdw data_downloader_tracker.txt cdw_downloads_logs input_compressed_cdw_data cdw-data-reports tv_viewership.cod event/tv_viewership mso-list.csv <20160702> &
    • B. Range of days since last succesful run:
      • nohup ./loop.sh AWS_ACCESS_KEY_ID AWS_SECRET_ACCESS_KEY &
  5. Monitor:
    • tail nohup.out
    • tree events
    • tree cdw-data-reports
    • df -h
    • free -m

hh-viewership's People

Contributors

gevgev avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.