Coder Social home page Coder Social logo

keras-seq2seq-nmt-english-french's Introduction

Keras-SEQ2SEQ-NMT-english-french using pretrained word vectors

encoder-decoder model using LSTM 1.english to french using pretrained GLOVE (100d) vectors 2.French to english using pretrained word2vec vectors(200d)

Dataset: English to French sentence pairs. http://www.manythings.org/anki/fra-eng.zip

Pre-trained word embeddings: 1.English: the 100-dimensional GloVe (https://nlp.stanford.edu/projects/glove/) embeddings of 400k words computed on a 2014 dump of English Wikipedia. French: the 200-dimensional frWac2Vec(http://fauconnier.github.io/ )embeddings computed using Word2vec skip-gram approach on a 1.6 billion word corpus constructed from the Web limiting the crawl to the .fr domain.

Dependencies :

Python 3.6
Scikit-learn, Pandas, NumPy, Matplotlib
Keras >2.0
Either Theano or Tensorflow backend

Hyperparameters: batch_size = 64 # Batch size for training. epochs = 100 # Number of epochs to train for. Hidden units = 256 # Latent dimensionality of the encoding space. num_samples = 10000 # Number of samples to train on. max_vocab size = 10000

English to Fr -model Summary __________________________________________________________________________________________________ Layer (type) Output Shape Param # Connected to

encoder_inputs (InputLayer) (None, None, 100) 0


decoder_inputs (InputLayer) (None, None, 116) 0


encoder_lstm (LSTM) [(None, 256), (None, 365568 encoder_inputs[0][0]


decoder_lstm (LSTM) [(None, None, 256), 381952 decoder_inputs[0][0]
encoder_lstm[0][1]
encoder_lstm[0][2]


decoder_dense (Dense) (None, None, 116) 29812 decoder_lstm[0][0]

Total params: 777,332 Trainable params: 777,332 Non-trainable params: 0

Extensions working on: 1.Data Cleaning. 2.More Data 3.Layers 4.Attention

keras-seq2seq-nmt-english-french's People

Contributors

santhoshmani888 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.