Name: Holden Karau
Type: User
Company: Open Source Big Data Dev
Bio: Holden Karau is trans Canadian, and open source contributor. She is a Spark committer co-author of Learning Spark, High Performance Spark and Kubeflow for ML.
Twitter: holdenkarau
Location: San Francisco, CA, USA
Blog: http://www.holdenkarau.com/resume.pdf?q=github
Holden Karau's Projects
Oryx 2: Lambda architecture on Spark, Kafka for real-time large scale machine learning
python implementation of the parquet columnar file format.
Send Sir Perceval on a quest to retrieve and gather data from software repositories.
pfff is mainly an OCaml API to write static analysis, dynamic analysis, code visualizations, code navigations, or style-preserving source-to-source transformations such as refactorings on source code.
PHP library for the Bitpay.com API
The Interactive PHP Debugger
Stand-alone numeric code snippets
Machine Learning Pipelines for Kubeflow
Source code for the operator plugin SDK of the Alpine Chorus analytics platform
Community Powered Hotlines
A better notebook for Scala (and more)
👀 A Kubernetes cluster resource sanitizer
[WIP] Predict comments on PRs
I (attempt to) print everything* from places
Puppet module to install Spark (0.8.0)
A conda-smithy repository for pyspark.
Mirror of pytest-kind: Test your Python Kubernetes app/operator end-to-end with kind and pytest
Python implementation of TextRank for text document NLP parsing and summarization
OpenID library for Python
Ruby on Rails
A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Command line tools for working with Review Board
Spark reference applications
Set up PDB on Spark
Online REPL for 15+ languages.
latex resume
A Proof-Of-Concept auto-tuner for Apache Spark