Unused self.out_projection

An Apache 2.0 PyTorch implementation of some attentions for Deep Learning Researchers.

Intro

attentions provides some attentions used in natural language processing using pytorch.
these attentions can used in neural machine translation, speech recognition, image captioning etc...

attention allows to attend to different parts of the source sentence at each step of the output generation.
Instead of encoding the input sequence into a single fixed context vector, we let the model learn how to generate a context vector for each output time step.

Implementation list

Name	Citation
Additive Attention	Bahdanau et al., 2015
Dot-Product Attention	Luong et al., 2015
Location-Aware (Location Sensitive) Attention	Chorowski et al., 2015
Scaled Dot-Product Attention	Vaswani et al., 2017
Multi-Head Attention	Vaswani et al., 2017
Relative Multi-Head Self Attention	ZihangDai et al., 2019

Troubleshoots and Contributing

If you have any questions, bug reports, and feature requests, please open an issue on Github.
or Contacts [email protected] please.

I appreciate any kind of feedback or contribution. Feel free to proceed with small issues like bug fixes, documentation improvement. For major contributions and new features, please discuss with the collaborators in corresponding issues.

Code Style

I follow PEP-8 for code style. Especially the style of docstrings is important to generate documentation.

Author

Soohwan Kim @sooftware
Contacts: [email protected]

sooftware / attentions Goto Github PK

attentions's Introduction

Intro

Implementation list

Troubleshoots and Contributing

Code Style

Author

attentions's People

Contributors

Stargazers

Watchers

Forkers

attentions's Issues

Unused self.out_projection

sharpening feature

Bug with mask?

something about the additive attention

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent