Coder Social home page Coder Social logo

wall-e28 / amazon_vine_analysis Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 153 KB

The purpose of the project is to analyze Amazon reviews written by members of the paid Amazon Vine program using AWS, PySpark, and SQL.

Jupyter Notebook 100.00%
aws google-collab amazon vine-reviews postgresql pyspark

amazon_vine_analysis's Introduction

Amazon Vine Analysis

Overview

The purpose of the project is to analyze Amazon reviews written by members of the paid Amazon Vine program. In order to complete this analysis, I used a subset of data that pertained to the video game subcategory of Amazon. I then performed an ETL on this data by using AWS, Google Colaboratory, PostgreSQL, and PySpark. After that, I took a closer look at one of the tables I made during the ETL phase regarding the vine reviews to determine if there was any positivity bias for reviews in the Vine Program.

Results

During this analysis I wanted to answer a few key questions.

  1. How many Vine reviews and non-Vine reviews were there?
  2. How many Vine reviews were 5 stars? How many non-Vine reviews were 5 stars?
  3. What percentage of Vine reviews were 5 stars? What percentage of non-Vine reviews were 5 stars?

Here are my answers to these questions.

  1. There are 94 vine reviews and 40,471 non-vine reviews for a total of 40,565 reviews.

total_reviews

  1. There were 48 5-star vine reviews and 15663 5-star non-vine reviews for a total of 15,711 5-star reviews.

5_star_reviews

  1. 51.06% of paid reviews are 5-stars and 38.7% of unpaid reveiws are 5-stars.

percentage_5_stars

Summary

After looking that the results, I would conclude that there is a positivity bias for reviews in the Vine program. It is important to note that while the non-vine sample size was very large, the vine sample size was less than 100 entries. While 94 entires is still a decent number to sample with, it could lead a less signifcant result. In addition to the current analysis, I could take it a step further and see the percentage of those who purchased the product by filtering through the verified_purchase column to either confirm or fail to confirm if there is a positivity bias for reviews in the Vine program.

amazon_vine_analysis's People

Contributors

wall-e28 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.