stufield Goto Github PK

followers: 15.0 following: 21.0 repos: 41.0 gists: 22.0

Name: Stu Field

Type: User

Bio: I am a data scientist and software engineer with 14+ years experience implementing machine learning in biomarker discovery in proteomics

Twitter: stufield3

Location: Fort Collins, Colorado

Blog: https://www.linkedin.com/in/stu-field-133396a/

Hi, I'm Stu Field!

I'm a data scientist and software engineer from 🇨🇦 and I live in Fort Collins, CO ⛰️ . I am currently exploring new employment opportunities, so if you think my skill set and experience is a match for your team, please reach out!

Professional Summary

With 14+ years in the Life Sciences proteomics (high dimensional) space, I have created a comprehensive machine learning analysis ecosystem based in R that enables both biomarker discovery and model development. Strong leadership and mentoring skills have lead to 40+ production level, predictive models resulting in significant revenue generation.

Skills

Machine Learning	Statistics	Open-Source	Software Tools
Random Forest	Logistic regression	R	Linux, MacOS
Naive Bayes	Linear regression	C++	Git, GitHub
Lasso/ridge regression	GLMMs	Python	BASH, GNU
k-Nearest neighbour	Mixed-effects models	LaTeX	BitBucket
PCA	Survival analysis	CI/CD	Slack
Ensemble methods	Multivariate statistics		Atlassian suite
Maximum Likelihood	ANOVA

Additional Skills

Analysis of high-throughput, multi-plex, high-dimensional, proteomics assay data
Accomplished leader driving small group projects to completion
Proven record of accomplishment via publication in peer reviewed, international journals
Project development and management, experimental design, and data analysis

About Me

😄 Pronouns: he/him
📫 How to reach me: or any of the links on the ⬅️ sidebar
🔭 I’m currently open for employment opportunities!
📚 I am currently learning ... actually, I am constantly learning 😄
🤔 I’m looking for help with ... finding my next role!
💬 Ask me about ... bikes and R ... I'll talk your 👂 off 😄
💬 Favorite food: 🐟 🌮
⚡ Fun fact ...
🚴 I'm an avid cyclist ... come say hi on

I maintain several R software libraries (📦) that implement statistical and machine learning techniques in biomarker discovery. Some of my popular published (CRAN) 📦 are:
These projects support analyses in the general health care (Life Sciences) space to generate proteomic based clinical insights in health spaces such as:
- cardiovascular disease
- liver disease (NASH/NAFLD)
- alcohol effects
- biological aging
- exercise status
- metabolic disease
Favorite techniques:
- logistic regression (ol' faithful)
- random forest
- naive Bayes
- KKNN (nearest neighbor)
- survival analyses
- ensemble methods
I am a proponent of the open-source software, conducting the majority of my research/analysis via Linux toolkits, R, and the RStudio IDE.
I promote conforming to the adherence of so-called "tidy" data, a philosophy of data science designed to share underlying data structure, grammar, and format which facilitates the generation of reproducible analyses.

🔧 Tools & Languages

🔧 GitHub Commits

📈 GitHub Stats

Contributions

🔗 Links & Resources

Stu Field's Projects

code-works

This is simply a script repository, mostly a dumping ground for R scripts that don't have a proper home

covid-19

Quick and dirty visualization of COVID-19 case data from the Colorado Dept. of Public Health & Environment

csu-career-day-2018

Slides for presentation on the CSU careers in Life Sciences Day 2018

devel

This "package" is the development package. It contains a loose testing ground of functionality without a home

docs

Cubically's documentation

git-hooks

A testing ground for Git hooks ... what they do and when they're triggered

git-staa-577

Slides, code, cheat sheets, and RStudio lab notebooks for "Applied Machine Learning" course for Spring 2019

gitr

A light-weight, dependency-free, API to access system-level git commands from within R

grapevine

The Grapes of Wrath. A simple set of useful, unique, and non-standard binary operators.

hex-images

A repository for typical PNG and/or SVG hex stickers primarily of the tidyverse

hex-stickers

Creating and manipulating hex stickers for R projects and packages. I stole this from somewhere but can't remember where

mehan-scripts

Crude greedy backup of original R scripts passed down from Mike Mehan

methyl-seq-analysis

A quick directory structure and setup for the Methyl Seq Analysis at ARBL

monty-hall-paradox

Attempt to explain the Monty Hall paradox with a simple simulation

mosquitosnp

Summary scripts of the African mosquito SNP data evaluating population genetic resistance to pesticides

packages-report

A repo from the WFT-workshop at RStudio Conf 2019 with Jenny Bryan and Jim Hester

power

A rudimentary power analysis tool geared towards 2-group comparisons

recipes

Pipeable steps for feature engineering and data preprocessing to prepare for modeling

remotes

Install R packages from GitHub, GitLab, Bitbucket, git, svn repositories, URLs

reprex-collection

A collection of *.R source code and *.html files of useful R tricks coded a reproducible examples via reprex

rstudio-conf-2018

Workshop materials from the Rstudio::conf2018 Machine Learning with Max Kuhn

rstudio-conf-2019

Data, code, and workshop files used during the RStudio Conference 2019 "What They Forgot to Teach You About R" with Jenny Bryan

slide-template

GitHub Repository for Rmarkdown Xaringan Slideshow Presentations

somadataio

The SomaDataIO package loads and exports 'SomaScan' data via the 'SomaLogic Operating Co., Inc.' proprietary data file, called an ADAT ('*.adat'). The package also exports auxiliary functions for manipulating, wrangling, and extracting relevant information from an ADAT object once in memory.