MSc Thesis

My MSc thesis on "Classifying brain activity using electroencephalography and automated time tracking of computer use".

Progress was tracked using GitHub issues and the GitHub Projects board.

Abstract

We investigate the ability of EEG to distinguish between different activities users engage in on their devices, building on previous research which showed a considerable difference in brain activity between code- and prose-comprehension, as well as differences during code- and prose-synthesis. We perform a replication study and improve upon past results using state-of-the-art machine learning classifiers based on Riemannian geometry.

Furthermore, we extend the scope of previous work by introducing the automated time tracking application ActivityWatch, to track the device activities that the user is engaging in. This lets us label EEG data with naturalistic device activity, which we then use to train classifiers to discern activities such as code writing vs prose writing, or work vs media consumption. Our results indicate that a consumer-grade EEG device can discern between different activities that a user performs at the computer. Among other results, we show that not only can code and prose comprehension be distinguished, but also code and prose writing.

Writing

The latest version of the writing can be downloaded at:

Thesis report: erik.bjareholt.com/thesis/thesis.pdf
Goal document: erik.bjareholt.com/thesis/goaldoc.pdf
Presentation: erik.bjareholt.com/thesis/presentation.pdf
Popular scientific article (Swedish): erik.bjareholt.com/thesis/popsci.pdf

Usage

Setting it up:

Ensure you have Python 3.7+ and poetry installed
Install dependencies with poetry install

Collecting data:

Run eegwatch --help for instructions on how to collect EEG data
Run ActivityWatch to collect device activity data
Run the codeprose task in eeg-notebooks to collect data for the code vs prose task
- Install eeg-notebooks with pip install git+
- Run the codeprose task with eegnb runexp -ex visual-codeprose -subject X

Running classifier:

Run ./scripts/query_aw.py to collect labels from the running ActivityWatch instance
- You probably want to adjust the categorization rules embedded in the file
(TODO) Run eegclassify --help for instructions on how to train and run the classifier

Devices

I've worked with multiple devices, but the experiments were performed using the Muse S, which is therefore the best-supported device.

Muse S
- PPG support (experimental)
Neurosity Notion 1 & 2
- Thanks to @andrewjaykeller at @neurosity for sending me a refurbished Notion 1 to test with!
Neurosity Crown
OpenBCI Cyton
In theory: any device supported by Brainflow or muse-lsl

Notebooks

Code notebooks are built in CI and available at:

Main - primary notebook for the thesis, where we train a classifier for the code vs prose comprehension task.
Signal - for signal filtering and quality checking.
Activity - for classification of device activities.
PPG - for a basic PPG analysis.

Acknowledgements

See the Acknowledgements section in the thesis.

Muse data is frequently -1000 for TP9 and TP10

Not sure what's up with that, or how to deal with it.

From looking at the raw data, it looks like it's -1000 exactly every 5th row. Sometimes there are 2 in a row, and then it repeats every 5th row again.

Edit: Maybe this is just powerline noise? At the sampling freq of 250Hz the powerline peak would happen roughly every 4-5th sample. Why are TP9 and TP10 so much more sensitive though?

Example:

1603711387.314,-1000.000,-44.434,-38.574,-1000.000,0.000
1603711387.318,-609.375,-29.297,-27.344,-574.707,0.000
1603711387.322,787.109,-19.531,-23.926,814.941,0.000
1603711387.325,-852.051,-27.832,-22.461,-858.887,0.000
1603711387.329,184.082,-37.598,-23.926,189.941,0.000
1603711387.333,-1000.000,-45.410,-39.062,-1000.000,0.000
1603711387.337,-836.914,-34.668,-30.762,-804.688,0.000
1603711387.341,519.043,-18.555,-11.719,561.523,0.000
1603711387.345,-801.758,-18.555,-20.508,-808.105,0.000
1603711387.349,150.391,-23.438,-27.832,155.762,0.000
1603711387.353,-1000.000,-29.785,-26.367,-1000.000,0.000
1603711387.357,-1000.000,-30.762,-27.832,-1000.000,0.000
1603711387.361,178.711,-21.484,-20.508,231.934,0.000
1603711387.365,-764.648,-22.949,-21.484,-768.555,0.000
1603711387.368,222.168,-33.203,-31.250,198.242,0.000
1603711387.372,-1000.000,-39.551,-32.715,-1000.000,0.000
1603711387.376,-1000.000,-33.203,-31.250,-1000.000,0.000
1603711387.380,-36.133,-21.484,-26.367,-6.836,0.000
1603711387.384,-789.062,-16.113,-27.832,-781.738,0.000
1603711387.388,409.668,-18.066,-33.691,378.906,0.000
1603711387.392,-909.180,-28.320,-34.180,-925.293,0.000
1603711387.396,-1000.000,-26.855,-33.203,-1000.000,0.000
1603711387.400,-213.867,-16.113,-29.785,-186.523,0.000
1603711387.404,-873.535,-15.137,-23.438,-854.492,0.000
1603711387.407,650.879,-21.484,-28.320,603.516,0.000
1603711387.411,-738.281,-66.406,-38.086,-779.785,0.000
1603711387.415,-1000.000,-78.613,-34.180,-1000.000,0.000
1603711387.419,-204.102,-25.879,-19.531,-210.449,0.000
1603711387.423,-943.848,-7.812,-18.555,-930.664,0.000
1603711387.427,878.418,-9.277,-25.879,835.938,0.000
1603711387.431,-439.453,-28.320,-37.109,-500.977,0.000
1603711387.435,-1000.000,-36.621,-27.344,-1000.000,0.000
1603711387.439,-177.734,-18.066,-20.996,-181.641,0.000
1603711387.443,-991.699,-17.578,-37.598,-982.910,0.000
1603711387.447,-974.121,-29.297,-38.574,-996.094,0.000
1603711387.450,-139.160,-31.738,-42.969,-194.824,0.000
1603711387.454,-1000.000,-18.066,-48.340,-1000.000,0.000
1603711387.458,-303.223,-12.695,-33.691,-268.066,0.000

erikbjare / thesis Goto Github PK

thesis's Introduction

MSc Thesis

Abstract

Writing

Usage

Devices

Notebooks

Acknowledgements

thesis's People

Stargazers

Watchers

Forkers

thesis's Issues

Classifying tasks

Synchronized Brainwave Dataset (2015)

Reading prose vs code (Fucci et al)

Classifying sleep stages

Classifying emotion

Classifying mental states (focus etc)

EEG data for Mental Attention State Detection (focused, unfocused, drowsy)

Tasks

GQM

Resources

Tasks

GQM

Metrics

Tasks

GQM

Recommend Projects

Recommend Topics

Recommend Org