Coder Social home page Coder Social logo

zeba-khadhijah / diagnosing_pcos_decision_trees Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 630 KB

R project to diagnose PCOS using a decision tree algorithm. Use of machine learning models has scope to reduce healthcare costs, increase attention towards PCOS diagnosis and ultimately improve healthcare and quality of life for women.

R 100.00%
decision-trees pcos-detection

diagnosing_pcos_decision_trees's Introduction

Diagnosing_PCOS

Polycystic ovary syndrome (PCOS) is a common condition that affect women and are characterized by having two or more of the following features: irregular periods, excess male hormones that may lead to excess facial and body hair growth, polycystic ovaries (enlarged ovaries containing fluid filled sacks called follicles).

The dataset used in this project contains numerous physical and clinical parameters to determine PCOS and infertility related issues. The data has been collected from 10 different hospital across Kerala,India and is available to access freely from https://www.kaggle.com/datasets/prasoonkottarathil/polycystic-ovary-syndrome-pcos

Motivation for choice of project:

I was diagnosed with PCOS in 2022. However, I have been experiencing symptoms for many years before and have visited other doctors before. In India, there is a large tendency to ignore PCOS in unmarried women, even by experienced gynecologists. The fact that it drastically reduces quality of life for many women is not taken into account and doctors tend to do no more than advise to exercise and eat healthy. Therfore, I became interested in exploring ways that I can use my data science skills to improve healthcare for women with PCOS.

The aim of this project is to use an appropriate classification model to diagnose PCOS. The use of machine learning in situations like these can help process large amounts of data to gain accurate diagnosis and thus possibly help reduce healthcare costs.

This project uses several R packages to perform data analysis and modeling. Below is a brief description of each package and its purpose.

  1. readxl The readxl package provides functions for reading data from Excel files into R.

  2. tidyverse The tidyverse package is a collection of packages that provide tools for data manipulation, visualization, and modeling. It includes popular packages like dplyr, ggplot2, and tidyr.

  3. plyr The plyr package provides functions for splitting, applying, and combining data in R.

  4. dplyr The dplyr package provides functions for data manipulation, including filtering, sorting, grouping, and summarizing data.

  5. ggplot2 The ggplot2 package provides a powerful system for creating graphics in R, with an emphasis on creating aesthetically pleasing and informative visualizations.

  6. Hmisc The Hmisc package provides functions for data analysis and modeling, including descriptive statistics, regression modeling, and survival analysis.

  7. stats The stats package is a core R package that provides functions for statistical analysis and modeling, including hypothesis testing, regression modeling, and time series analysis.

  8. corrplot The corrplot package provides functions for creating correlation matrix plots in R.

  9. psych The psych package provides functions for psychometrics and personality research, including factor analysis and correlations.

  10. DescTools The DescTools package provides functions for descriptive statistics and data visualization, including various summary statistics, contingency tables, and graphical displays.

  11. caret The caret package provides functions for machine learning and predictive modeling, including feature selection, model training, and model evaluation.

  12. tree The tree package provides functions for creating classification and regression trees in R.

  13. rpart The rpart package provides functions for creating decision trees in R.

  14. rattle The rattle package provides a graphical user interface (GUI) for data mining and machine learning tasks in R. It includes tools for data preprocessing, feature selection, and model evaluation.

diagnosing_pcos_decision_trees's People

Contributors

zeba-khadhijah avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.