Coder Social home page Coder Social logo

just-count's Introduction

Counting the prevelance of "just" in emails from the Enron email dataset

This project is inspired by a recent article by Ellen Petry Leanse where she suggests that gender disparities often exist in the way the word "just" is used in emails. Here we wish to quantify the prevelance of the word just by measuring its frequency in emails. We choose the Enron email data set as our collection of emails to measure the frequency of "just" since:

  • the data is publicly available and free to use
  • the data reflects emails sent in a corporate setting. That is the specific environment Ellen Petry Leanse suggests that use of the word "just" is especially harmful.

Results

To focus on the people who were actively engaged in Enron, we select email senders who have sent 5 or more email messages in the data set. We call these the "active" participants in Enron and its company affairs. This leaves us with the top 25% of email senders.

For each email message, we measure two quantities:

  • the number of times the word "just" is used in each email message
  • the number of character used in each email message

We then compute the median number of times each sender uses the word "just" in all of her/his email messages and also the median number of characters each sender uses in all of her/his email messages. Here we show the visualization of what the relationship between these statistics and the number of emails sent:

"just" rate each sender uses in her/his emails from the Enron email dataset <script data-plotly="crude2refined:1245" src="https://plot.ly/embed.js" async></script>

Click the figure above to interact with the data and results directly.

Credits

Many thanks to Arne Hendrik Schulz for processing the Enron email dataset.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.