Coder Social home page Coder Social logo

machine-learning-with-business-metrics's Introduction

Machine-Learning-with-Business-Metrics

QUEST FOR UPSTREAMS!

In this project business metrics are the decision tools.

An oil company, OilyGiant is in a quest of finding a new location for its new oil mining location.

A dataset of volume oil reserves, as well as three independent variables for each oil well as observed in the past are given.

The project is to find one location with to develop with prospect to meet company's business metrics:

  1. Contribute to the company's positive profitability, with budgeted cost of USD 100,000,000

  2. Has a potential of risk not more than 2.5%

Steps in finding that are:

  1. Gather the parameters to select wells in each location: oil quality and volume of reserves

  2. Build and select a model that is able to predict the volume of oil reserve in a new well

  3. Select wells with highest estimated volume

  4. Select location with highest total profit for the selected wells

The datasets of three locations observed:

  1. id — unique ID of wells

  2. f0, f1, f2 — three features that are significant

  3. product — volume of oil reserves in a well (thousands barrel)

The parameters of each well are known already. The target of this machine learning is to find prediction of volume, that is numerical, continuous. Hence the model needed is the regression one. One location with the highest margin will be found. From those three location, wells are to be sampled using bootstrapping method. And an analysis of profitability and potential risk is made.

Three models with three different algorithms: XGBoostRegressor, LinearRegression, and LightGBMRegressor, are built. All three algorithms perform well for dataset of Location 2. Yet none work satisfactorily for datasets of Location 1 and Location 3. This is probably related to the three given predictors being weekly relevant to the volume of oil reserved. Without bootstrapping, Location 1 appears as the most promising contributor to profitability. However with bootstrapping technique, it is discovered that the extremely high figure does not fall in the confidence interval of 95% of the distribution of mean of the samples. Finally, it is found that Location 2 is solid and more likely to contribute the highest profit, with mean distribution of confidence interval of 95% : USD 1,050,097 - USD 9,635,649, and loss potential as low as 0.5%.

machine-learning-with-business-metrics's People

Contributors

dwiputris avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.