Coder Social home page Coder Social logo

drknessfall / portuguese-natural-language-processing-with-python Goto Github PK

View Code? Open in Web Editor NEW

This project forked from rubensfernando/portuguese-natural-language-processing-with-python

0.0 2.0 0.0 13 KB

[WIP] This reporsitoy keeps an ongoing e-book about Natural Language Processing wiih Python with a focus on Portuguese Language (mainly Portugal and Brazil varieties)

Jupyter Notebook 100.00%

portuguese-natural-language-processing-with-python's Introduction

[WIP] Portuguese Natural Language Processing with Python

This reporsitoy keeps an ongoing e-book about Natural Language Processing wiih Python with a focus on Portuguese Language (mainly Portugal and Brazil varieties) .

The main goal for this e-book is to show how to use python and the several libraries available to process the Portuguese language.

Outline

0. Preface

1. Python for Natural Language Processing

  • Why Python?
  • Python 2 vs Python 3
  • Encodings
  • Best Practices

2. NLP Resources for Portuguese

  • Lexicons
    • Setiment Analysis
      • OpLexicon
      • LIWC
      • SentiLex
      • Priberam Subjective Lexicon
  • Gazetters
    • DBPedia
    • Other lists
  • Dicionaries
    • Unitex
    • Open Dictionary
    • Wiki Dictionary
  • Thesauri
    • TEP
    • Open Thesaurus
    • Onto.PT
  • Wordnets
    • WordNet.BR
    • WordNet.PT
    • Onto.PT
    • Open WordNet-PT
    • MultiWordNet
  • Corpora
    • PoS
      • MacMorpho
    • Parsing
      • Floresta Sinta(c)tica
    • Sentiment Analysis
      • Priberam Fine-Grained Opinion Corpus
      • ReLi

3. Language Statistics

  • Frequency Distribution
  • Lexical Variety
  • Zipf's law
  • Luhn cut
  • String distance
  • Fuzzy matching

4. NLP Processing

  • Speech Recognition
  • Language Detection
  • Sentence Tokenization
  • Word Tokenization
  • Morphological Analysis
  • Part of Speech Tagging
  • Parsing
  • Named Entity Recognition
  • Semantic Analysis

5. NLP Taks and Applications

  • Speller
  • Grammar checker
  • Term Extraction
  • Text Classification
  • Topic Detection
  • Automatic Summarization
  • Word-sense -Disambiguation
  • Machine Tranlation
  • Sentiment Analysis

6. Natural Language Generation

  • Random-text Generation

7. NLP Classification

Representations

  • Bag-of-words
  • Language Model
  • Word Embeddings

Classifiers

  • SVM
  • CRF
  • Random-forest

8. Information Extraction

9. Data Mining

10. Information Retrieval

Appendix

Python Libraries for NLP

  • chardet, unidecode
  • PyEnchant, hunspell
  • stemming
  • Soundex, Metaphone
  • Fuzzywuzzy, jellyfish
  • brasil.vocab, certografia
  • Whoosh, cwb-python, Pattern, Newspaper
  • NLTK, TextBlob, MontyLingua, spaCy, PyNLPl, polyglot, PyPLN
  • nlpnet, MBSP
  • gensim
  • sumy, textteaser, TextRank
  • MITIE, RAKE
  • scikit-learn, CRFsuite

Author

Pedro Paulo Balage pedrobalage.com

portuguese-natural-language-processing-with-python's People

Contributors

pedrobalage avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.