Christian Clauss's Projects
Differentiable architecture search for convolutional and recurrent networks
A python library for easy manipulation and forecasting of time series.
Interactive, reactive web apps in pure python :star:
Dash DAQ example apps for instrumentation and hardware
Distractor-aware Siamese Networks for Visual Object Tracking (ECCV2018)
Data and code behind the articles and graphics at FiveThirtyEight
Readiness for Data and AI and IoT
The Data Broker (DBR) is a distributed, in-memory container of key-value stores enabling applications in a workflow to exchange data through one or more shared namespaces. Thanks to a small set of primitives, applications in a workflow deployed in a (possibly) shared nothing distributed cluster, can easily share and exchange data and messages with a minimum effort. In- spired by the Linda coordination and communication model, the Data Broker provides a unified shared namespace to applications, which is independent from applications’ programming and communication model.
Data Engineering Practice Problems
Source code for the data pipeline that start by ingesting data from the embedded data collection devices (whale tags, moorings, etc), uploads it to the cloud and combines in a dataset consumable by the machine learning pipelines.
code for Data Science From Scratch book
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Toturials coming with the "data science roadmap" picture.
Open Source code related to dataCommons
Checks for the Datadog Agent that Stripe finds useful.
The Datadog Python library
Pydantic model generator for easy conversion of JSON, OpenAPI, JSON Schema, and YAML data sources.
This repository contains code and specifications to support harmonized data models
Put all the messages in the postgres
An open source multi-tool for exploring and publishing data
An #OSINT Framework to perform various recon techniques, aggregate all the raw data, and give data in multiple formats.
An open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
Useful extensions to the standard Python datetime features
Fast, efficiently stored Trie for Python. Uses libdatrie.
An asynchronous PASE Db2 and IBM i integration library
DBNet: A Large-Scale Dataset for Driving Behavior Learning
dbt (data build tool) enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Database connections for multi-threaded environments