Subsampling of frequent words

deeplearning-nlp-models

A small, interpretable codebase containing the re-implementation of a few "deep" NLP models in PyTorch.

This is presented as an (incomplete) starting point for those interested in getting into the weeds of DL architectures in NLP. Annotated models are presented along with some notes.

There are links to run these models on colab with GPUs 🌩️ via notebooks.

Current models: word2vec, CNNs, transformer, gpt. (Work in progress)

BERT: Reading. Comprehending.

Note: These are toy versions of each model.

Models

These NLP models are presented chronologically and, as you might expect, build off each other.

Model Class	Model	Year
Embeddings
1.	Word2Vec Embeddings (Self-Supervised Learning)	2013
CNNs
2.	CNN-based Text Classification (Binary Classification)	2014
Transformers
3.	The O.G. Transformer (Machine Translation)	2017
4.	OpenAI's GPT Model (Language Model)	2018, 2019, 2020

Features

This repository has the following features:

model overviews: A brief overview of each model's motivation and design are provided in separate README.md files.
Jupyter notebooks (easy to run on colab w/ GPUs): Jupyter notebooks showing how to run the models and some simple analyses of the model results.
self-contained: Tokenizers, dataset loaders, dictionaries, and all the custom utilities required for each problem.

Endgame

After reviewing these models, the world's your oyster in terms of other models to explore:

Char-RNN, BERT, ELMO, XLNET, all the other BERTs, BART, Performer, T5, etc....

Roadmap

Future models to implement:

Char-RNN (Kaparthy)
BERT

Future repo features:

Tensorboard plots
Val set demonstrations
Saving checkpoints/ loading models
BPE (from either openai/gpt-2 or facebook's fairseq library)

Setup

You can install the repo using pip:

pip install git+https://github.com/will-thompson-k/deeplearning-nlp-models

Structure

Here is a breakdown of the repository:

nlpmodels/models: The model code for each paper.
nlpmodels/utils: Contains all the auxiliary classes related to building a model, including datasets, vocabulary, tokenizers, samplers and trainer classes. (Note: Most of the non-model files are thrown into utils. I would advise against that in a larger repo.)
tests: Light (and by no means comprehensive) coverage.
notebooks: Contains the notebooks and write-ups for each model implementation.

A few useful commands:

make test: Run the full suite of tests (you can also use setup.py test and run_tests.sh).
make test_light: Run all tests except the regression tests.
make lint: If you really like linting code (also can run run_pylint.sh).

Requirements

Python 3.6+

Here are the package requirements (found in requirements.txt)

Citation

@misc{deeplearning-nlp-models,
  author = {Thompson, Will},
  url = {https://github.com/will-thompson-k/deeplearning-nlp-models},
  year = {2020}
}

License

MIT

will-thompson-k / deeplearning-nlp-models Goto Github PK

deeplearning-nlp-models's Introduction

deeplearning-nlp-models

Contents

Models

Features

Endgame

Roadmap

Setup

Structure

Requirements

Citation

License

deeplearning-nlp-models's People

Contributors

Stargazers

Watchers

Forkers

deeplearning-nlp-models's Issues

Recommend Projects

Recommend Topics

Recommend Org