transformer-due-eng's Introduction

Transformer Attention Model for English to German Translation

This repository contains the implementation of a Transformer model for translating English (eng) to German (deu) using attention.

This project is the implementation of paper : Attention is all you need

The model architecture includes components such as cross-attention, decoder, encoder, feedforward, self-attention, and transformer. The project model is saved at models/ and includes visualizations for accuracy, loss, and learning rate at directory assets/images. You can also find the images of architecture for cross-attention, decoder, encoder, feedforward, self-attention, and transformer within directory assets/model architecture.

Installation

Clone the repository:

git clone https://github.com/yourusername/transformer-deu-to-eng.git
cd transformer-deu-to-eng

Model Architecture

The transformer model consists of the following components:

Encoder: Processes the input sequence.
Decoder: Generates the output sequence.
Self-Attention: Allows the model to focus on different parts of the input sequence.
Cross-Attention: Allows the decoder to focus on relevant parts of the input sequence.
Feedforward: Adds non-linearity and complexity to the model.
Transformer: transformer itself.

Pickle files

text_pairs.pickle: For pairing text among deu and eng.
vectorize.pickle: Vectorizing each sentence.
posenc-2048-512.pickle: Position encoding for the sentence.

Learning Rate

Position Encoding

PE Curves

Subplots

Loss and Accuracy

Acknowledgements

The implementation is based on the paper "Attention Is All You Need" by Vaswani et al. Attention is all you need
Thanks to the contributors of open-source libraries such as TensorFlow.

Recommend Projects

kanish-h-h / transformer-due-eng Goto Github PK

transformer-due-eng's Introduction

Transformer Attention Model for English to German Translation

Installation

Model Architecture

Pickle files

Learning Rate

Position Encoding

Loss and Accuracy

Acknowledgements

transformer-due-eng's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent