Coder Social home page Coder Social logo

airqualityindex's Introduction

AirQualityIndex

Delhi Air Quality Index calculation, EDA and Regression models

Dataset Link: https://www.kaggle.com/datasets/deepaksirohiwal/delhi-air-quality

• Developed an algorithm to calculate the Air Quality Index (AQI) for hourly pollutant values and assigned corresponding labels based on established guidelines. • Constructed a predictive model using the Random Forest Regressor algorithm to accurately forecast the AQI, leveraging insights gained from exploratory data analysis (EDA) and comprehensive dataset preprocessing techniques. • Evaluated the performance of the predictive model on previously unseen data, achieving an impressive accuracy rate of 82% and demonstrating its robustness and reliability.

Addition of calculated AQI to dataset

image

Addition of AQI labels based on AQI values

image

AQI distribution

image
The majority of the data points have AQI's between 0 and 150 which means that the livability condition is between Good and Unhealthy for sensitive groups.

EDA

image
The concentration of CO in the atmosphere is greater during the weekdays of winter and autumn as compared to the weekends in those seasons, because a lot less number of trains and cars are running on weekends to commute people to and from their work.
Summer and Monsoon have similar concentrations of CO during the weekdays and the weekends.
As compared to other seasons, Monsoons are rightly the least polluted ones which makes sense because a lot of pollution matter are washed away from the atmosphere, flowing away the rain into the ground.

image
The air quality index reaches a peak during the 3:00 PM hour of the day. One of the primary reasons is the increased vehicular traffic till 3:00 PM and from then the Air quality index keeps reducing as there traffic go down.

image
NO2 and SO2 are not directly related to CO. However, the presence of NO2 and SO2 in the air can indirectly affect the levels of CO in the atmosphere.

Feature Importance

image

Accuracy of RandomForrestRegressor

There were 2907 number of labels correctly predicted in the test dataset out of 3643 instances, with an error scope of 16.
Accuracy Percentage : 79.79687071095252

airqualityindex's People

Contributors

anagha0202 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.