Coder Social home page Coder Social logo

Stefan Grafberger's Projects

csvmatch icon csvmatch

🔎 Finds fuzzy matches between CSV spreadsheets

datawig icon datawig

Imputation of missing values in tables.

dedupe icon dedupe

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

deequ icon deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

jenga icon jenga

Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptions (e.g., missing values, broken character encodings) on the prediction quality of their ML models.

learnedcardinalities icon learnedcardinalities

Code and workloads from the Learned Cardinalities paper (https://arxiv.org/abs/1809.00677)

mlinspect icon mlinspect

Inspect ML Pipelines in Python in the form of a DAG

mlinspect-cidr icon mlinspect-cidr

Inspect ML Pipelines in Python in the form of a DAG (CIDR Submission version)

mlwhatif icon mlwhatif

Data-Centric What-If Analysis for Native Machine Learning Pipelines

noworkflow icon noworkflow

Supporting infrastructure to run scientific experiments without a scientific workflow management system.

pgbm icon pgbm

Probabilistic Gradient Boosting Machines

streamdq icon streamdq

StreamDQ is a library built on top of Apache Flink for defining "unit tests for data", which measure data quality in large data streams.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.