gveerashekar / data-lake-using-aws-emr-pyspark-and-s3 Goto Github PK
View Code? Open in Web Editor NEWThis project forked from pandilwar605/data-lake-using-aws-emr-pyspark-and-s3
Building an ETL pipeline that extracts data from S3, processes it using Spark, and loads the data back into S3 as a set of dimensional tables.