anthonywong611 / batch-etl-with-aws-emr-and-mwaa Goto Github PK
View Code? Open in Web Editor NEWCreate a data pipeline on AWS to execute batch processing in a Spark cluster provisioned by Amazon EMR. ETL using managed airflow: extracts data from S3, transform data using spark, load transformed data back to S3.