Coder Social home page Coder Social logo

phuochaihuynh / project-arrhythmia Goto Github PK

View Code? Open in Web Editor NEW

This project forked from shsarv/project-arrhythmia

0.0 1.0 0.0 5.26 MB

Machine learning project on Distinguish between the presence and absence of cardiac arrhythmia and its classification in one of the 16 groups.

License: Apache License 2.0

Jupyter Notebook 100.00%

project-arrhythmia's Introduction

Project-Arrhythmia

Introduction

The Dataset used in this project is available on the UCI machine learning Repository. It can be found at: https://archive.ics.uci.edu/ml/datasets/Arrhythmia.

  • It consists of 452 different examples spread over 16 classes. Of the 452 examples,245 are of "normal" people.
  • We also have 12 different types of arrhythmias. Among all these types of arrhythmias, the most representative are the "coronary artery disease" and "Rjgbt boundle branch block".
  • We have 279 features, which include age, sex, weight, height of patients and other related information.
  • We explicitly observe that the number of features is relatively high compared to the number of examples we are available.
  • Our goal was to predict if a person is suffering from arrhythmia or not, and if yes, classify it in to one of 12 available groups.

Directory Structure

Algorithms Used

  1. KNN Classifier
  2. Logestic Regression
  3. Decision Tree Classifier
  4. Linear SVC
  5. Kernelized SVC
  6. Random Forest Classifier
  7. Principal Component analysis (PCA)

Result

Conclusion

The models started performing better after we applied PCA on the resampled data. The reason behind this is, PCA reduces the complexity of the data. It creates components based on giving importance to variables with large variance and also the components which it creates are non collinear in nature which means it takes care of collinearity in large data set. PCA also improves the overall execution time and quality of the models and it is very beneficial when we are working with huge amount of variables.

The Best model in term of recall score is Kernalized SVM with PCA having accuracy of 80.21%.

project-arrhythmia's People

Contributors

shsarv avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.