Coder Social home page Coder Social logo

sophieyliu / kdd19-tutorial Goto Github PK

View Code? Open in Web Editor NEW

This project forked from astonzhang/kdd19-tutorial

0.0 0.0 0.0 72.53 MB

KDD19 Tutorial: From Shallow to Deep Language Representations: Pre-training, Fine-tuning, and Beyond

Home Page: http://kdd19.mxnet.io/

License: Apache License 2.0

Python 41.03% Jupyter Notebook 58.97%

kdd19-tutorial's Introduction

KDD19 Tutorial: From Shallow to Deep Language Representations: Pre-training, Fine-tuning, and Beyond

Time: Thu, August 08, 2019 - 9:30am - 12:30 pm | 1:00 pm - 4:00 pm

Location: Dena’ina Center, Kahtnu 1 & 2-Level 2, 600 W. Seventh Avenue Anchorage, AK 99501

Presenters: Aston Zhang, Haibin Lin, Leonard Lausen, Sheng Zha, and Alex Smola

Abstract

Natural language processing (NLP) is at the core of the pursuit for artificial intelligence, with deep learning as the main powerhouse of recent advances. Most NLP problems remain unsolved. The compositional nature of language enables us to express complex ideas, but at the same time making it intractable to spoon-feed enough labels to the data-hungry algorithms for all situations. Recent progress on unsupervised language representation techniques brings new hope. In this hands-on tutorial, we walk through these techniques and see how NLP learning can be drastically improved based on pre-training and fine-tuning language representations on unlabelled text. Specifically, we consider shallow representations in word embeddings such as word2vec, fastText, and GloVe, and deep representations with attention mechanisms such as BERT. We demonstrate detailed procedures and best practices on how to pre-train such models and fine-tune them in downstream NLP tasks as diverse as finding synonyms and analogies, sentiment analysis, question answering, and machine translation. All the hands-on implementations are with Apache (incubating) MXNet and GluonNLP, and part of the implementations are available on Dive into Deep Learning.

Agenda

Time Tutor Title
9:30am-10:00am Alex Smola Part 1.1: Basics of hands-on deep learning
10:00am-11:00am Alex Smola Part 1.2: Neural Networks
11:00am-11:10am Coffee break
11:10am-11:30am Aston Zhang Part 2.1: Shallow language representations in word embedding
11:30am-12:30pm Aston Zhang Part 2.2: Word Embedding Application
12:30pm-1:00pm Lunch break
1:00pm-2:20pm Leonard Lausen Part 3: Transformer
2:20pm-2:30pm Coffee break
2:30pm-3:30pm Haibin Lin Part 4.1: Deep language representations with Transformer (BERT)
3:30pm-4:00pm Haibin Lin Part 4.2: BERT Application

Part 1.1: Basics of Hands-on Deep Learning

Slides: [pdf]

Notebooks:

  1. NDArray: [ipynb]
  2. Autograd: [ipynb]

Part 1.2: Neural Networks

Notebooks:

  1. Model: [ipynb]
  2. CNN/RNN: [ipynb]
  3. Sequence: [ipynb]
  4. RNN with Gluon: [ipynb]

Part 2.1: Shallow language representations in word embedding

Slides: [pdf]

Part 2.2: Word Embedding Application

Notebooks: [ipynb]

Part 3: Transformer

Slides: [pdf]

Notebooks: [ipynb]

Part 4.1: Deep language representations with Transformer (BERT)

Slides: [pdf]

Part 4.2: BERT Application

Notebooks: [ipynb]

FAQ

  • Q: How do I get access to the notebooks from the tutorial?
    • For setting it up on SageMaker Notebook instances, you can find the instructions here.
    • For setting up locally, check out the local installation guide.
    • The notebooks can be downloaded from this repo.

Have more questions? You may reach us at [email protected]

Links

kdd19-tutorial's People

Contributors

astonzhang avatar eric-haibin-lin avatar leezu avatar smolix avatar szha avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.