Chip Huyen's Projects
LSTM and QRNN Language Model Toolkit for PyTorch
My implementation of useful data structures, algorithms, as well as my solutions to programming puzzles.
Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)
Library for fast text representation and classification.
Feathr – An Enterprise-Grade, High Performance Feature Store
Easy TOC creation for GitHub README.md
The flexibility of Python with the scale and performance of modern SQL.
An ongoing list of pandas quirks
Library to scrape and clean web pages to create massive datasets.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Code Repository for Machine Learning with PyTorch and Scikit-Learn
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"
Metaflow tutorials for ODSC West 2021
Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.
https://huyenchip.com/ml-interviews-book/
Toolkit for efficient experimentation with various sequence-to-sequence models
MathJAX plugin for GitBook
Cool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.
A living collection of deep learning problems
Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyday.
This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.
A library for generalized sequence to sequence models
Computation using data flow graphs for scalable machine learning
Code for O'Reilly's "A Short Course on TensorFlow"
Open deep learning compiler stack for cpu, gpu and specialized accelerators