Name: Scott Haines
Type: User
Company: Nike
Bio: Distinguished Software Engineer @nike-inc. I specialize in writing massively distributed streaming systems on top of @ApacheSpark . Love Dogs
Twitter: newfront
Location: Mountain View, CA
Blog: https://medium.com/@newfrontcreative
Scott Haines's Projects
All Data, Relevant Information, Scripts, and Applications for the Open Data Science Conference (2018)
Parallel.js is a tiny library for multi-core processing in Javascript.
Source files from Programming Objective-C 2.0 (3rd Edition)
Mirror of Apache Spark
A Python Library to support running data quality rules while the spark job is running⚡
This project is available free of charge as a companion to my Data+AI Summit (2022) talk.
A Gentle introduction to Machine Learning with Apache Spark
The source code for the book Modern Data Engineering with Apache Spark
Spark Application : Spark Summit 2018 : Streaming Trend Discovery
This is the material for the 2019 Silicon Valley Code Camp Session "Realish Time Predictive Analytics with Spark Structured Streaming"
Source Code and Files for the Web Socket I/O presentation by Scott Haines
Javascript Webworkers Playground
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.