Coder Social home page Coder Social logo

nlp-labs's Introduction

ACADEMIC - NATURAL LANGUAGE PROCESSING LABS

Done by: MOHAMED AMINE FAKHRE-EDDINE

Supervised by: Pr. Lotfi EL AACHAK

This repository contains the labs I've done during my academic year in the Natural Language Processing course.

Lab 1

The first lab focused on the basics of NLP, including web scraping, data storing, cleaning, stemming, lemmatization, POS tagging and NER.

You can find the report here or in pdf format here.

Lab 2

Objective: The main purpose behind this lab is to get familiar with NLP Rule based, Regex and NLP Word embedding.

Work to do:

Part 1: Rule Based NLP and Regex: Using Regex, write a Python code that can generate a bill from a text given by the user.

Use case: "I bought three Samsung smartphones 150 $ each, four kilos of fresh banana for 1,2 dollar a kilogram and one Hamburger with 4,5 dollar".

Generated Bill:

Product Quantity Unit Price Total Price
Samsung smartphone 3 150.0 450.0
Banana 4 1.2 3.2
Hamburger 1 4.5 4.5

Part 2: Word Embedding:

  1. Apply one-hot encoding, bag of words, and TF-IDF techniques on the Data vectors collected during the lab 1.
  2. Apply one Word2Vec Approach (Skip Gram, CBOW) on the same DataSet.
  3. Apply Glove and FastText approaches on the same DataSet.
  4. Plot all the encoded/vectorized vectors using the Tsne Algorithm, evaluate those approaches, and give a general conclusion.

You can find the report here or in pdf format here.

Lab 3

Objective: The main purpose behind this lab is first to get familiar with using word embeddings with regression and classification models.

Work to do:

Part 1: Language Modeling - Regression: In this part of the lab, we learned about language modeling using regression, to predict the score of an answer for a specific question.

Part 2: Language Modeling - Classification: In this part of the lab, we learned about language modeling using classification, to predict the sentiment of a tweet.

You can find the report here or in pdf format here.

Lab 4

Objective: The main purpose behind this lab is to predict the relevance of a text to a specific topic, using a recurrent neural network, in addition to fine-tuning with a pre-trained language model.

You can find the report here or in pdf format here.

nlp-labs's People

Contributors

al0new0lf avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.