Coder Social home page Coder Social logo

kdd2022-tutorial's Introduction

[KDD 2022 Tutorial]

Why Data Scientists Prefer Glassbox Machine Learning:
Algorithms, Differential Privacy, Editing and Bias Mitigation

Tutorial Abstract

Recent research has shown that interpretable machine learning models can be just as accurate as blackbox learning methods on tabular datasets. In this tutorial we will walk you through leading open source tools for glassbox learning, and show how intelligible machine learning helps practitioners uncover flaws in their datasets, discover new science, and build models that are more fair and robust. We’ll begin with an introduction to the science behind glassbox modeling, and walk through a series of case-studies that highlight the added value of interpretable methods in a variety of domains such as finance and healthcare without compromising accuracy. We’ll also show how glassbox models can be used for state of the art differentially private learning, bias detection/mitigation, and how these models can be edited to remove undesirable effects with GAMChanger. We’ll also discuss how to train interpretable models with deep neural nets.

Outline

  • Introduction to Glassbox Models (30 min)

    • Explainable Boosting Machines (EBMs)
    • Glassbox vs. Blackbox Explanations
  • Case Studies (60 min)

    • Healthcare
    • Industry
    • Finance
    • Missing Values Meets Glassbox Learning
    • Bias and Fairness
  • InterpretML Software: Intro and Installation (15 min, hands-on)

  • Differential Privacy (45 min, hands-on)

    • Introduction to Differential Privacy
    • DP Explainable Boosting Machines
    • Mitigating impacts of DP noise with interpretable, editable models
  • Model Editing (15 min, hands-on)

Tutors and Bios

Rich Caruana @ Microsoft Research

  • Rich Caruana is a senior principal researcher at Microsoft Research. Before joining Microsoft, Rich was on the faculty in the Computer Science Department at Cornell University, at UCLA’s Medical School, and at CMU’s Center for Learning and Discovery. Rich’s Ph.D. is from Carnegie Mellon University, where he worked with Tom Mitchell and Herb Simon. His thesis on Multi-Task Learning helped create interest in a new subfield of machine learning called Transfer Learning. Rich received an NSF CAREER Award in 2004, best paper awards in 2005, 2007, 2014, and 2021 co-chaired KDD in 2007, and serves as area chair for NIPS, ICML, and KDD. His current research focus is on learning for medical decision making, transparent modeling, and deep learning.

Harsha Nori @ Microsoft Research

  • Harsha Nori is a senior research engineering manager at Microsoft Research, working on Responsible AI tools. He co-founded the popular InterpretML toolkit, and has contributed to a number of popular open source data science tools. His current research focus is on explainability, fairness, and differential privacy for machine learning, and has published on these topics at conferences including NeurIPS, ICML, AAAI, CHI, and CLeaR, with a best paper at the 2021 NeurIPS Research2Clinics workshop.

Environment Setup

Note that M1/M2 based Mac devices require a slightly modified setup process. Please scroll down for detailed instructions on these devices.

  1. [Optional] For an isolated Python environment, setup the conda environment manager:
  1. Install required packages on the command line: pip install interpret gamchanger jupyter requests

  2. Clone this repository: git clone https://github.com/interpretml/kdd2022-tutorial.git

  3. Launch Jupyter: cd kdd2022-tutorial && jupyter notebook

Setup Instructions for M1/M2 based Macs

  1. Install Rosetta 2 and create a Rosetta-enabled terminal, following this guide: https://www.byran.tech/html/how-to-make-a-rosetta-2-emulated-x86-terminal-on-arm-apple-silicon-chips.html

  2. Launch your new Rosetta-enabled Terminal

  3. Install Miniconda: https://docs.conda.io/en/latest/miniconda.html#latest-miniconda-installer-links

  4. Create and activate a new x86-enabled conda environment:

CONDA_SUBDIR=osx-64 conda create -n interpret python=3.8
conda activate interpret
conda config --env --set subdir osx-64
  1. Follow steps 2-4 above :)








































black box == black hole

Let's redefine the event horizon to be the transparency horizon.

kdd2022-tutorial's People

Contributors

harsha-nori avatar richcaruana avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

Forkers

jeromeku

kdd2022-tutorial's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.