Coder Social home page Coder Social logo

dimits-ts / text_analytics Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 1.0 70.71 MB

Language Modelling (text generation, spell correction) and Sentiment Analysis / POS Tagging with MLP, RNN, CNN and BERT models and LLM prompting

Jupyter Notebook 86.13% TeX 8.51% Python 5.35%
baseline-model bert cnn fasttext-embeddings language-modeling llm mlp-classifier pos-tagging prompting pytorch rnn sentiment-analysis sklearn spell-correction tensorflow text-generation transformers transformers-library

text_analytics's Introduction

Text Analytics

Language Moddeling

We create bigram, trigram and linear interpolation language models which are used for language generation and spell correction.

Source code Report

Sentiment Classification and POS Tagging tasks

We create deep learning models using the Transformers\Datasets, Pytorch and Tensorflow libraries. We also use the keras_tuner / transformers_trainer frameworks to optimize hyperparameters and model architecture.

We briefly mention additional tasks carried out:

  • Sentiment Analysis: Dataset selection, exploratory analysis, custom stopwords, data augmentation.
  • POS Taggging: Dataset selection, exploratory analysis, custom parsing, custom baseline ("smart dummy") model, local caching of heavy computations, automated results generation (python -> LaTeX).

Each task features two IPython notebooks containing the executed code, python source files for repeated custom tasks and a unified report.

The reports discuss in detail the design decisions for each classifier and include graphs and aggregated results comparing the current model to the previous models.

Simple MLP model

Sentiment classification POS Tagging Report

RNN Model

Sentiment classification POS Tagging Report

CNN Model

Sentiment classification POS Tagging Report

BERT Model

Sentiment classification POS Tagging Report

text_analytics's People

Contributors

dimits-ts avatar vassilisdrouzas avatar

Watchers

 avatar

Forkers

vassilisdrouzas

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.