Coder Social home page Coder Social logo

text_analytics's Introduction

Text Analytics

Tableau Dashboard Link

Project Overview

This text mining project to analyzes messages sent from the couple's chatting app called Between.

Totals
1113 Days
144,011 Messages
694,967 Words
Sender BF GF
Number of Messages 80,007 64,004
Word Count 384,341 310,626
Avg Message Length 24.3 24.6
Avg Messages Per Day 89 72

Part 1: Data Cleaning and Preprocessing

  • Export data from the app as a text file and then convert to an .xlsx
  • Import .xslx file as csv and wrangle into a pandas df
  • Extracted following features: date message sent, time message sent, sender name, message body

Part 2: Basic EDA

  • Frequency of total messages sent
  • Frequency of messages sent throughout days of the week
  • Compared frequency between BF and GF

Part 3: WordClouds

  • Overall WordCloud from all message history
  • WordCloud from BF messages vs. WordCloud from GF messages
  • Look for common words, themes, patterns

Part 4: Sentiment Analysis

  • Evaluate sentiment scores from Vader
  • Evaluate polarity score from TextBlob
  • Vader vs. TextBlob comparison
  • See changes in sentiment over time

Part 5: Text Classification

  • Find words that best define the overall message history using spaCy
  • KMeans clustering to look for common themes/patterns

Data Visualizations

text_analytics's People

Contributors

chelynl avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.