Coder Social home page Coder Social logo

sharavsambuu / mongolian-text-classification Goto Github PK

View Code? Open in Web Editor NEW
32.0 9.0 10.0 38.67 MB

Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP experiments are included.

Home Page: https://github.com/tugstugi/mongolian-bert

Python 4.65% HTML 0.13% Jupyter Notebook 95.22%
mongolia text-classification mongolian-text-classification nlp news-classification bert fine-tuning nlu cyrillic

mongolian-text-classification's Introduction

mongolian-text-classification

Mongolian cyrillic text classification with modern tensorflow and some fine tuning on TugsTugi's BERT model.

Load Mongolian BERT in Tensorflow 2

Open In Colab

Generate text using Mongolian BERT

Open In Colab

Visualize Mongolian BERT using bertviz and pytorch model

Open In Colab

Alt text

Fine tuning TugsTugi's Mongolian BERT model

On TPU mode, loading checkpoints from the file system doesn't supported by the bert and bucket should be used.

Fine tuning mongolian BERT on TPU, You need own bucket in order to finetune on TPU Open In Colab

Fine tune a mongolian BERT on GPU, a lot of computation needed, a low batch size matters due to memory limit Open In Colab

Classifiers using simple neural networks

No 01, Simplest classifier Open In Colab

No 02, Pretrained Word2Vec initialization from Facebook's fasttext, kind of transfer learningish. Embedding layer is not trainable in this case Open In Colab and with trainable embedding layer Open In Colab

No 03, 1D Convolution Open In Colab and multiple 1D convnets Open In Colab

No 04, LSTM Open In Colab

Visualize RNN neuron firing in text generation Open In Colab

No 05, LSTM with Attention, visualization of attention scores in text classification Open In Colab

No 06, Classification with Mongolian BERT and Tensorflow 2.0, with frozen bert layers Open In Colab

No 07, Classification with Mongolian BERT large and HuggingFace and Tensorflow 2 Open In Colab

Mongolian sentence interpolation experiments

Sequence loss in keras and tf2 Open In Colab

Variational Auto Encoder for Mongolian text Open In Colab

Other experiments

Predict next word, greedy text generation Open In Colab

Series included(or will) followings

word2vec initialization, 1D Convolution, RNN variants, Attention, Some weights visualization for reasoning, Transformer, Techniques to handle longer texts and so on...

useful references and resources

Images and screenshots

Alt text Alt text Alt text Alt text

mongolian-text-classification's People

Contributors

dependabot[bot] avatar sharavsambuu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

mongolian-text-classification's Issues

Сайн байна уу

Танитай холбогдох гэсэн юмаа facebook -ээр холбогдож болох уу?
Facebook хаягаа өгөөч please

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.