Coder Social home page Coder Social logo

manuelruizf / mechacar_statistical_analysis Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 130 KB

Performed multiple linear regression analysis to identify which variables in the dataset predict the mpg of MechaCar prototypes. Ran t-tests to determine if the manufacturing lots are statistically different from the mean population

R 100.00%
data-analysis r r-language statistics

mechacar_statistical_analysis's Introduction

MechaCar_Statistical_Analysis

Project Overview

Jeremy´s upper management gave him a special project. AutosRUs’ newest prototype, the MechaCar, is suffering from production troubles that are blocking the manufacturing team’s progress. AutosRUs’ upper management has called on Jeremy and the data analytics team to review the production data for insights that may help the manufacturing team.
For this project we made the following insights/ data summary:

  • Perform multiple linear regression analysis to identify which variables in the dataset predict the mpg of MechaCar prototypes
  • Collect summary statistics on the pounds per square inch (PSI) of the suspension coils from the manufacturing lots
  • Run t-tests to determine if the manufacturing lots are statistically different from the mean population
  • Design a statistical study to compare vehicle performance of the MechaCar vehicles against vehicles from other manufacturers. For each statistical analysis, you’ll write a summary interpretation of the findings.

DELIVERABLE 1

Linear Regression to Predict MPG

Results

DEL1.1

Del1.2


Summary

The results showed us some important insights to understand the data. There were 2 variables that provided non-random amounts of variance to the model. According to the best p-values these are the 3 variables that provide non-random amounts to the model: The vehicle weight, spoiler angle, and All Wheel Drive (AWD).
There is sufficient evidence to reject our null hypothesis, which further indcates that the slope of this linear model is not zero, since the p-Value: 5.35e-11, is much smaller than the assumed significance level of 0.05%.
This linear model does predict mpg of MechaCar prototypes effectively, since it has an r-squared value of 0.7149.

DELIVERABLE 2

Summary Statistics on Suspension Coils

Del2


Del2.2


Summary

The entire population of the production lot, the variance of the coils is 62.29 PSI, so the answer is that the current manufacturing data meet this design specification for all manufacturing lots in total and each lot individually. The only one that doesn´t meet the standarts is the lot 3 with a PSI of 170.

DELIVERABLE 3

Summary

The mean of the sample is 1498.78, and a p-Value of 0.06, which is higher than the common significance level of 0.05. This means that there isn´t enough evidence to support rejecting the null hypothesis.

Individual lots:

Lot 1 & 2 sample actually has the true sample mean of 1500, again as we saw in the summary statistics above. With a p-Value of 1, clearly we cannot reject (i.e. accept) the null hypothesis that there is no statistical difference between the observed sample mean and the presumed population mean (1500). The Lot 3 has a sample mean of 1496.14 and the p-Value is 0.04, which is lower than the common significance level of 0.05. All indicating to reject the null hypothesis that this sample mean and the presumed population mean are not statistically different.

T-Tests on Suspension Coils

Del3

Del3.1

DELIVERABLE 4

Study Design: MechaCar vs Competition

You will earn a perfect score for Deliverable 4 by completing all requirements below:

The statistical study design has the following: A metric to be tested is mentioned (5 pt) A null hypothesis or an alternative hypothesis is described (5 pt) A statistical test is described to test the hypothesis (5 pt) The data for the statistical test is described (5 pt)

Metric

Current Price (Selling): Dependent Variable

Tools

  • R language
  • R Studio

Hypothesis

Null Hypothesis: MechaCar is priced correctly based on its performance of key factors for its genre per year according to data.

Test and description

We applied the multiple linear regression to describe how realted are the prices according to the impact it has historically made.

mechacar_statistical_analysis's People

Contributors

manuelruizf avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.