Coder Social home page Coder Social logo

mechacar_statistical-analysis's Introduction

MechaCar_Statistical-Analysis

Project Overview

The purpose of this project is to use statistics and hypothesis testing to review the production data in the automotive industry to give insights that can help AutosRUs with the manufacturing team with its production troubles with their newest protoype, the MechaCar.

Linear Regression to Predict MPG

  1. Based on the Pr, the vehicle length and vehicle ground clerance are statistically likely to provide non-random amounts of variance to the model.

  2. The p-Value for this model, p-Value: 5.35e-11, is much smaller than the assumed significance level of 0.05%. This indicates there is sufficient evidence to reject our null hypothesis, which further indcates that the slope of this linear model is not zero.

  3. This linear model has an r-squared value of 0.7149, which means that approximately 71% of all mpg predictions will be determined by this model. Relatively speaking, this multiple regression model does predict mpg of MechaCar prototypes effectively.

Summary Statistics on suspension Coils

image

image

  • When looking at the entire population of the production lot, the variance of the coils is 62.29 PSI, which is well within the 100 PSI variant requirement.

  • Lots 1 and 2 are significantly more consistent and well within the 100 PSI variance requirement with variances of .98 and 7.47 respectively

  • Lot 3 shows a much larger variance in performance and consistency with a variance of 170.29 PSI which is well outside the requirement of 100 PSI

t-Tests on Suspension Coils

Summary of t-Tests Across All Manufacturing Lots

image

  • true mean of the sample is 1498.78 and p-Value is 0.06, which is higher than the common significance level of 0.05 so there is not enough evidence to support rejecting the null hypothesis.

Summary of Lot 1

image

  • the Lot 1 sample has the true sample mean of 1500 with a p-Value of 6.529e-07 so we cannot reject the null hypotehsis that there is no statistical difference between the observed sample mean and the presumed population mean.

Summary of Lot 2

image

  • the Lot 2 sample has the true sample mean of 1499.52 with a p-Value of 0.117 so we cannot reject the null hypotehsis that there is no statistical difference between the observed sample mean and the presumed population mean.

Summary of Lot 3

image

  • the Lot 3 sample has the true sample mean of 1496.92 with a p-Value of 0.5859 so we cannot reject the null hypotehsis that there is no statistical difference between the observed sample mean and the presumed population mean.

Study Design: MechaCar vs Competition

A statistical study that compares the performance of the MechaCar vehicles versus the performance of vehicles from other manufacturers would involve collecting data on MechaCar and its comparable models across several different manufacturers over the last 3 years. Some of this Data would include:

  • What are the competitions' comparable models,
  • Which cars will MechaCar be competing with head-to-head? which cars will be included in the study?
  • Which factors will look at the study to determine the relevant to selling price?

Metrics

Collecting data for comparable models across all major manufacturers for past 3 years for the following metrics:

  • Safety Feature Rating: Independent Variable
  • Current Price (Selling): Dependent Variable
  • Drive Package : Independent Variable
  • Engine (Electric, Hybrid, Gasoline / Conventional): Independent Variable
  • Resale Value: Independent Variable
  • Average Annual Cost of ownership (Maintenance): Independent Variable
  • MPG (Gasoline Efficiency): Independent Variable

Hypothesis: Null and Alternative

After determining which factors are key for the MechaCar's genre:

  • Null Hypothesis (Ho): MechaCar is priced correctly based on its performance of key factors for its genre.
  • Alternative Hypothesis (Ha): MechaCar is NOT priced correctly based on performance of key factors for its genre.

Statistical Tests

A multiple linear regression would be used to determine the factors that have the highest correlation/predictability with the list selling price (dependent variable); which combination has the greatest impact on price.

mechacar_statistical-analysis's People

Contributors

stevenyouv avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.