Topic: aws-emr Goto Github
Some thing interesting about aws-emr
Some thing interesting about aws-emr
aws-emr,An AWS based solution using AWS CloudWatch and AWS Lambda based on Python to automatically terminate AWS EMR clusters that have been idle for a specified period of time.
User: abdullahkhawer
aws-emr,Lambda to start EMR and run a map reduce job
User: abhibalani
aws-emr,Spark 2.0 Python Machine Learning examples
User: adornes
aws-emr,Spark 2.0 R/SparkR Machine Learning examples
User: adornes
aws-emr,Spark 2.0 Scala Machine Learning examples
User: adornes
aws-emr,Cloud-based AI / ML workflow and data application development framework
Organization: amzn
aws-emr,Build modern workflows with AWS MWAA, AWS Step Functions, AWS Glue, and AWS EMR
User: aufeld
aws-emr,Use aws-emr and aws-redshift to analyse dataset of adult census of USA
Organization: aws-big-data-projects
aws-emr,Analyzing Big Data with Amazon EMR
Organization: aws-big-data-projects
aws-emr,Run a Spark job within Amazon EMR
Organization: aws-big-data-projects
aws-emr,Use-Case: Airline on-time performance
User: bajaj-varun
aws-emr,Bits of code I use during live demos
User: dacort
aws-emr,A cookiecutter template for working with PySpark on AWS EMR
User: daniel-cortez-stevenson
Home Page: https://daniel-cortez-stevenson.github.io/cookiecutter-pyspark-cloud/
aws-emr,This project analyzes the correlation between COVID-19 and the US aviation industry. By studying data on passenger/freight traffic and delays alongside COVID-19 trends, it provides insights into airline and passenger responses. The findings help airlines adapt to the pandemic's impact.
User: dhruv007patel
aws-emr,Data Engineering Projects including Data Modeling, Data Warehouse, Data Lake Development
User: dvu4
aws-emr,EMR + Hadoop to Redshift ELT workflow using spark steps API and orchestrated by Apache-Airflow, which ingests disparate datasets focused around 7Gb of I94 arrivals information to produce a simple star schema in Redshift
User: felipeazucares
aws-emr,Generic python library that enables to provision emr clusters with yaml config files (Configuration as Code)
User: harshadranganathan
aws-emr,Assignments belonging to the course Supercomputing for Big Data (ET4310) at TU Delft
User: huizerd
aws-emr,A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from locally hosted Airflow containers. The end product is a Superset dashboard and a Postgres database, hosted on an EC2 instance at this address (powered down):
User: ismaildawoodjee
Home Page: http://54.169.163.221:8080
aws-emr,MapReduce Analysis on Amazon Food Review Dataset (Big-Data)
User: jaintanisha
aws-emr,Create Data Lake on AWS S3 to store dimensional tables after processing data using Spark on AWS EMR cluster
User: jkoth
aws-emr,ETL pipeline with PySpark on EMR orchestrated with Airflow
User: jomavera
aws-emr,Projeto de processamento distribuído de dados utilizando Python, MRJob e AWS EMR
User: jonathanamanciosales
aws-emr,:star: CLI tool to launch Spark jobs on AWS EMR
Organization: jwplayer
Home Page: http://spark-steps.readthedocs.io/en/latest/
aws-emr,Daily Incremental load ETL pipeline for Ecommerce company using AWS Lambda and AWS EMR cluster, Deployed using Apache airflow in a docker container.
User: khushal2405
aws-emr,We Build an ETL pipeline using Airflow that accomplishes the following: Downloads data from an AWS S3 bucket, Runs a Spark/Spark SQL job on the downloaded data producing a cleaned-up dataset of delivery deadline missing orders and then Upload the cleaned-up dataset back to the same S3 bucket in a folder primed for higher level analytics
User: khushal2405
aws-emr,Illustrates access to S3 bucket owned by a different account from instances in an EMR cluster
User: krishnan-mani
aws-emr,A Spark application, written in Python, to figure out strongly connected components with Bi-directional Label Propagation algorithm. This project implemented an 1.3GB Twitter network dataset on AWS EMR cluster.
User: linghaol
aws-emr,AWS EMR Docker integration
User: mauropelucchi
aws-emr,A Hadoop Map-Reduce job to process the DBLP dataset to produce a graph depicting which professors at the CS department of UIC have co-authored publications.
User: mayankrastogi
aws-emr,A Spark application to process the DBLP dataset to find out the Page Rank of faculty at the UIC CS department based on their co-authorships on publications.
User: mayankrastogi
aws-emr,AWS 및 AWS를 이용한 Data Lake 구성 이해
User: micopes
aws-emr,Data Science and Engineering project - Programming for Big Data @ Simon Fraser University (SFU)
User: ninjeanne
aws-emr,An ETL pipeline that extracts data from S3, processes them using Spark, and loads the data back into S3 as a set of dimensional tables
User: nitinspatil15
aws-emr,Data Analysis Exercise over Walmart Stock
User: pratikbarjatya
aws-emr,CMPT 732 Project - Dealt with 3 large scale databases by joining them to analysis the economic impact of Covid-19 on the airline industry. Fetched data using API and stored in AWS S3 that is retrieved by an AWS EMR cluster that does data computation. Queried into AWS Athena and visualized the results on Tableau by implementing static and dynamic dashboards.
User: rahilbalar98
aws-emr,This repository will be used to understand data science and data engineering concepts
User: ricardo-farias
aws-emr,Load data from the Million Song Dataset into a final dimensional model stored in S3.
User: rigganni
aws-emr,Analysis and monitoring system using AWS... Also the comp4442 project
User: samchenghowing
Home Page: http://comp4442frontend.s3-website-us-east-1.amazonaws.com/
aws-emr,Analysed New York City's Yellow taxi data set with Big Data tools such as Hadoop, HBase, Sqoop, MapReduce and AWS Cloud Infrastructure.
User: shinde-chandrakant
aws-emr,A large-scale data framework that will enable us to store and analyze financial market data and drive future predictions for investment.
User: sjmiller8182
Home Page: https://sjmiller8182.github.io/Warehousing-Stock-Tweet-Data/
aws-emr,Terraform module to create AWS EMR resources 🇺🇦
Organization: terraform-aws-modules
Home Page: https://registry.terraform.io/modules/terraform-aws-modules/emr/aws
aws-emr,My AWS Playground
User: wingkwong
aws-emr,The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
User: wittline
Home Page: https://wittline.github.io/pyspark-on-aws-emr/
aws-emr,A Grafana-based application to assist Big Data infrastructure optimization initiatives where Spark applications are a dominant cost driver
Organization: xonai-computing
Home Page: https://xonai.io
aws-emr,A collection of airflow sample workflows for data processing on aws
User: ychantit
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.