This data set includes information about individual rides made in a bike-sharing system covering the greater San Francisco Bay area.
• The highest duration is between 5 to 10 minutes. • The highest range of users are in age of between 25 to 35. • The highest range of users are subscribers. • Male members are almost triple the total count of female. • The users between 25 to 35 years old, take the longest trip duration. • In weekends users are usually make the longer trip duration. • In early morning 8 - 9 am and in late afternoon 4 - 6 pm useres usually take more bikes.
I mainly focused on ford GoBike system users and the influence of the trip duration, user age and type, and the daily usage of the system, starting by introducing the trip duration variable. Afterward, followed by age, type and gender distribution, then plot the transformed histogram plots. Afterwards, I introduce each of the categorical variables one by one. To start, I use the box plots of age to calculate the outliers, then, I introduced the effect of user type and age on long trip duration. to finish, I figured out the top hours of day of users using the bikes with the catplot.