learningmachineslab / tutorials Goto Github PK

View Code? Open in Web Editor NEW

3.0 4.0 9.0 125.09 MB

introduction to machine learning notebooks for physics education researchers

License: MIT License

Jupyter Notebook 100.00% Python 0.01%

python-notebook tutorials physics-education-research

tutorials's People

Contributors

Stargazers

Watchers

Forkers

zimmermant walshkc huaguiyuan johnspat catlucht jillianmellen bigdatasciencegroup ziaridoy20 rifatulhimel

tutorials's Issues

Regression tutorial might need more scaffolding

The tasks in the tutorial are appropriate for linear regression, but seem to assume a lot on the part of the participants. For example, even the first tasks just asks to develop 3 visualizations of the data to explain how things co-vary. Participants might need more scaffolding throughout to achieve the goals.

data creation code should be in its own folder instead of spread out across all folders

the data creation is currently residing in the data folder for each tutorial. this is sort of unwieldy and the data creation could be stored in its own folder in a single script to create the data for all of the different tutorials. this cuts down on repeated code and also mistakes in data creation.

Regression tutorial seems to be missing interpretation answers

Went through the regression tutorial and it appears to be focused on calculating quantities and making plots, but there are some important interpretation questions that we'd want them to reflect on that we should have answers to.

I'll start with the first task in which the solutions show 3 different visualizations that are not interpreted (i.e., answered the posed question). These other tasks are similarly missing interpretations from the solution.

lack of education related data

all of the data thus far is just fake data generated from data generation libraries from sklearn. but i think its much more relevant to have data sets that are relevant to PER topics. This means like, a column shouldnt be feature3 but perhaps HSGPA or fci_prescore. then there can be discussions of the analysis of this data, like, i have no idea what it means that feature3 correlates to feature4, but if HSGPA correlates to fci_prescore we can think of some hypotheses as to why this is true.

each sub folder should have a readme to help users navigate

subfolders can have readmes, this will be helpful to explain the function of notebooks in the folder, especially for users who are not in a workshop, course, etc. where they can immediately ask someone the purpose of a notebook. this should exist for every subfolder.

topic/
├── data/
│   ├── data.csv
├── notebook1.ipynb