danielbrock4 / bigdata_amazon_vine_analysis Goto Github PK
View Code? Open in Web Editor NEWUsing PySpark, I performed the ETL process on a large dataset (170,000 rows) of Video Games. Next, I created an AWS relational database instance & transformed the data to be loaded into PostgreSQL. Once in PgAdmin, exported the Video Game Review Table as a CSV file. Afterward, I loaded the data into Python to create Dataframes using Pandas. Then analyzed the data to determine if there was bias in paid reviews vs. unpaid reviews.