Coder Social home page Coder Social logo

luca-caruccio / dsc-phase-3-project Goto Github PK

View Code? Open in Web Editor NEW

This project forked from learn-co-curriculum/dsc-phase-3-project

1.0 0.0 0.0 12.58 MB

This project attempts to analyze water well efficacy in Tanzania, in an effort to predict (with a reasonable level of certainty) where government resources should be allocated.

License: Other

dsc-phase-3-project's Introduction

Pump it up data competition, Analyzing water well functionality in Tanzania

Overview

This project is an attempt at an open sourced competition to best analyze and predict water well functionality in Tanzania. This could also include suggestions at infrastructure changes, geographical decisions, and population focuses. Ideally we would like to identify (with a reasonable degree of certainty) areas or well types that are especially functional, or unusually non-functional.

Business Problem(s)

The problem is, of course, access to safe and reliable drinking water. Quite literally a matter of life and death, the correct analyzation of which would be invaluable. Population growth, coupled with less and less wells being built while more and more break could spell disaster for Tanzania if not properly addressed and responded to.

Data and Methods

The data used for this project was generously provided by the Tanzanian Ministry of Water, as well as Taarifa, an open sourced infrastructure assisting to bring water as well as awareness to the nation of Tanzania. The data covers just under 60,000 water wells across the country, their status (functional versus non-functional) as well as other data such as source types, age, and those responsible for it's installation. This is a ternary problem, and so there are multiple data sets to cross examine.

![]https://imgur.com/su2iWJW

Results and Conclusions

Our model scored in at just over 81% accuracy, allowing us to identify well with a very reasonable degree of certainty. We also tested for feature importance which found that geographical location played a very signifcant role in well functionality. The further north and west a well is, the more likely it is to be functional. This is most likely due to the Northwestern border of Tanzania's proximity to Lake Victoria, Africa's largest body of water.

Future work

As far as future work goes, there is still much to be done. We could cross-examine different well failures, what factors they share, and possibly use that to prevent and predict such events. Additionally, the average lifespan of a well or pump before going dry, which can help planning and creating infrastructure for future projects, and inevitable breaks or malfunctions. Lastly, what the ideal ratio is person:well, as this would save materials from unnecessary overlap, which would be all the more arduous considering how much of the country is currently facing a drought.

For more information

Email: [email protected] Github: luca-caruccio Linkedin: Luca Caruccio

Repository Structure

-Introduction and Overview

-Business Problems

-Data Methods

-Results

-Conclusions

-Future Work and Next Steps

dsc-phase-3-project's People

Contributors

davidbraslow avatar loredirick avatar luca-caruccio avatar

Stargazers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.