Coder Social home page Coder Social logo

amazonkindlereviews's Introduction

coding duck MX

Data Scientist // Mathematician

TwitterBlogLinkedIn

Predicting stars on Kindle books' Reviews.

For this project, I decided to predict how many stars a customer is more likely to give to a specific written review.

Procedure

I pull up a 2 million data set of Kindle reviews, but due to computational limitations I was forzed to use only a random 10% sample of this dataset to work on.

Even though this project was challenging for me, it helped me to push myself to untaught themes like Natural Language Processing NLP. So this job was done focusing more on the Statistical tool rather than on NLP tools.

Deployment

I deployed, as a stretch goal, an app on Heroku, where you can take a shot of how this works. You can see it here.

Result

Some highlights of the analysis of the data could be read in my blog post.

In this repository:

Here is the final notebooks and some pickles I had to make, due the size of the data set.

Built With

  • gzip
  • json
  • pandas as pd
  • numpy as np
  • matplotlib.pyplot as plt
  • urllib.request.urlopen
  • string
  • seaborn
  • pickle
  • Heroku

Version

This is the very first version. Sometime I'd like to use more powerfool NLP tools to compare the new results.

Sources

amazonkindlereviews's People

Contributors

codingduckmx avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.