lstm_pytorch's Introduction

LSTM_pytorch

The goal of this repository is to train LSTM model for a classification purpose on simple datasets which their difficulties/size are scalable. The examples have variable sequence length which using pack_padded_sequence and pad_packed_sequence is necessary. The code is written based on Pytorch Dataset and Dataloader packages which let you employ parallel workers.

Datasets

There are currently two datasets. The first one is a sort of identity function. Given the in input sequence [4,4,4,4,4] and [3,3] the model should be able to learn to classify them as 4 and 3, respectively. You can increase the number of classes (means from 1 to class_no can appear in the input sequence), the number of samples, minimum and maximum input sequence length. Run this example by

python main.py --identity

The second example is to find the mode (the most frequent element) in a given sequence.

python main.py --mode

Acknowlegment

Thanks Egor Lakomkin and Chandrakant Bothe for their valuable feedback.

lstm_pytorch's People

Contributors

Stargazers

Watchers

lstm_pytorch's Issues

scale_grad_by_freq issue in embedding

Line 19in models.py should be fixed, (inside nn.Embedding) it should be

Currently it is:

self.embed = nn.Embedding(V, self.embed_dim, max_norm=None, scale_grad_by_freq=False, padding_idx=0)

Change :
max_norm=args.max_norm instead of None

Hence, it should be:

self.embed = nn.Embedding(V, self.embed_dim, max_norm=args.max_norm, scale_grad_by_freq=False, padding_idx=0)

Recommend Projects

mazzamani / lstm_pytorch Goto Github PK

lstm_pytorch's Introduction

LSTM_pytorch

Datasets

Acknowlegment

lstm_pytorch's People

Contributors

Stargazers

Watchers

Forkers

lstm_pytorch's Issues

scale_grad_by_freq issue in embedding

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent