This project attempts to analyze and predict test scores for students against various social and economic factors
Note: The easiest way to see the results of the project is to navigate here
Our dataset is the CollegeDistance dataset which can be found here
This dataset has a total of 4739 observations of 15 variables (14 predictors and 1 response). The said variables are:
gender
: a factor indicating genderethnicity
: factor indicating ethnicity (African-American, Hispanic or other)score
: base year composite test scorefcollege
: factor. Is the father a college graduate?mcollege
: factor. Is the mother a college graduate?home
: factor. Does the family own their home?urban
: factor. Is the school in an urban area?unemp
: country unemployment rate in 1980wage
: state hourly wage in manufacturing in 1980distance
: distance from 4-year college (in 10 miles)tuition
: average state 4-year college tuition (in 1000 USD)education
: number of years of educationincome
: factor. Is the family income above 25,000 USD per year?region
: factor indicating region (West or other)
Our project attempts to predict score
from a combination of the other varaibles
This project was created in R with a variety of libraries:
readr
knitr
faraway
lmtest
zoo
ggplot2
reshape2
rsq
To get this project running on your machine, follow the steps below
The simplest way to view the project is to go to this link, where you will be able to see the results of our project
Additionally, you can download the file final-data-project.html
file and open it with your prefered browser
This project was created in R, so I recommend R Studio as the IDE to run the application
After installing R Studio, navigate to your preferred terminal and type the following command:
git clone https://github.com/nbalepur/College-Distance-Modeling.git
You will then see the following files:
CollegeDistance.csv
: The dataset used in this projectfinal-data-project.Rmd
: Contains the R Code for computations and calculationsfinal-data-project.html
: The HTML output of the project
- Nishant Balepur
- Allison Zhang
- Kobe Dela Cruz
This project was created as a final project for STAT 420