We've covered a lot in this course, starting from the ground up with what python is, how to import packages, the basics of machine learning, and using enterprise database systems. Below is a general outline. As we wrap up the course, take some time to go through the outline below and review some of the many new skills that you are starting to become familiar with. This is a great time to ask some final questions, refresh your memory on old concepts and start to organize resources to help you as a budding data scientist going forward!
-
Python basics
- Data types
- strings
- slicing
- ints
- floats
- lists
- append
- pop
- remove
- dictionaries
- dict.get(value, default) #useful for updating dictionary values
- tuples
- booleans
- strings
- Iteration: for, while, break
- Conditionals: if, elif, else
- Try / Except Clauses
- Functions
- def func_name(inputs): """docstring""" #stuff to do return blah
- Data types
-
Python Packages
- PANDAS
- DataFrames
- df.head()
- df.set_index()
- df[df["col"]=="value"] #Filtering on a boolean
- df.sort_values(by="col_name", ascending=False) #sort by a column
- Series
- df["col"] #selecting a column
- df.col #alternative format
- df.col.value_counts()
- df.col.quantile(q=) #percentiles of a series
- DataFrames
- Matplotlib
- import matplotlib.pyplot as plt
- %matplotlib inline
- plt.plot()
- plt.title()
- plt.xlabel() #and plt.ylabel()
- df.plot(kind='barh') #alternatively, kind='bar' or kind='scatter' etc.
- plt.figure(figsize=(10,10)) #change figure size
- plt.subplots()
- PANDAS
-
Machine Learning:
- Supervised Learning
- Train - test splits
- Regression
- OLS
- Greedy Selection
- Observed how coefficients changed when adding features to our model
- Lasso/Ridge
- Normalizing against overfitting
- Adding Polynomial Features
- Measuring performance
- r^2
- train vs test error
- MSE
- MSLE
- Supervised Learning
-
SQL
- Select statements
- Order by
- Limit
- Where
- Group by
- Having
- Joins
- Inner join
- Outer join
-
APIs
- HTTP Requests
- Headers
- OAuth
- Authentication Tokens
- JSON
- HTTP Requests