Emma Yang's Projects
This is an activator project for showcasing how to read & write data from Kafka-cluster using Scala Producer & Consumer API.
The Python Dict that's better than heroin.
A collection of airflow sample workflows for data processing on aws
算法/数据结构/Python/剑指offer/机器学习/leetcode
Source code for www.allaboutscala.com tutorials
Samples and documentation for using the Amazon Neptune graph database service
Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment
Apache Spark Examples
Avro schema generation and serialization / deserialization for Scala
:whale: A curated list of Docker resources and projects
AWS Glue code samples
Mirror of Apache Beam
A repo with a few tiny Apache Beam utilities that I've coded.
An example Apache Beam project.
Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
:books: Books worth reading
AWS SDK for Python
Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm
Cloud ML Engine is now a part of AI Platform
Examples of covariance and contravariance usage in Scala and Java.
Mirror of Apache Crunch (Incubating)
Java classes to convert a CSV file to an Avro file
Repository of code and data for my data analysis and machine learning projects.
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
【数据科学家系列课程】
数据挖掘18大算法实现以及其他相关经典DM算法