Coder Social home page Coder Social logo

mijikm / exploratory-data-analysis Goto Github PK

View Code? Open in Web Editor NEW
0.0 0.0 0.0 14.44 MB

Exploring the Australian Energy generation data set and the twitter data set using Python

exploratory-data-analysis energy-generation twitter-dataset python jupyter-notebook

exploratory-data-analysis's Introduction

Exploratory data analysis and visualisation

Purpose

The purpose of this project is to investigate and visualise data using several data science tools. The statistics related to all electricity generation in Australia is explored primarily through visualisation with tools such as motion chart and linear regression. The pre-processed tweets about bushfires in Australia is also investigated and explored through the process of exploratory data analysis (EDA).

Version

1 May 2020

User Instructions

  • The Python code written to analyse and plot the data is a Jupyter notebook file. To run the Jupyter notebook file, energy_data.xlsx and twitter_data.csv should be placed in the same location as Exploratory-Data-Analysis.ipynb.
  • Alternatively, it can be viewed in Exploratory-Data-Analysis.html or Exploratory-Data-Analysis.pdf.

Interesting Findings

  • Energy dataset: As you can see from the motion chart below, the reliance on Wind increased significantly from 2009 to 2018 in South Australia, while that on Natural gas had been stagnant. On the other hand, the reliance on Natural gas continuously increased over time in Western Australia, while that on Wind fluctuated during the same time.

   

  • Twitter dataset: Seen from the violin plot below, the interquartile range (IQR) which is marked as red dots does not vary much among twitter age groups. Regarding the median which is represented as the white dot, it is noted that the median of the group "0-2" and the group "1-2" is lower than any other group. This means that the authors who had been using Twitter less than 2 years had less tweet length, on average, compared to the authors using Twitter more than 2 years.

   

Data Sources

exploratory-data-analysis's People

Contributors

mijikm avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.