Coder Social home page Coder Social logo

masakhane-pos's Introduction

MasakhaPOS: Part-of-Speech Tagging for 20 African Languages

The code is based on HuggingFace implementation (License: Apache 2.0).

The license of the POS dataset is in CC-BY-4.0-NC, the monolingual data have difference licenses depending on the news website license.

Required dependencies

  • python
    • transformers : state-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
    • seqeval : testing framework for sequence labeling.
    • ptvsd : remote debugging server for Python support in Visual Studio and Visual Studio Code.
pip install transformers seqeval ptvsd

If you make use of this dataset, please cite us:

BibTeX entry and citation info

@inproceedings{Dione2023MasakhaPOSPT,
  title={MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African Languages},
  author={Cheikh M. Bamba Dione and David Adelani and Peter Nabende and Jesujoba Alabi and Thapelo Sindane and Happy Buzaaba and Shamsuddeen Hassan Muhammad and Chris Chinenye Emezue and Perez Ogayo and Anuoluwapo Aremu and Catherine Gitau and Derguene Mbaye and Jonathan Mukiibi and Blessing Sibanda and Bonaventure F. P. Dossou and Andiswa Bukula and Rooweither Mabuya and Allahsera Auguste Tapo and Edwin Munkoh-Buabeng and victoire Memdjokam Koagne and Fatoumata Ouoba Kabore and Amelia Taylor and Godson Kalipe and Tebogo Macucwa and Vukosi Marivate and Tajuddeen Gwadabe and Mboning Tchiaze Elvis and Ikechukwu Onyenwe and Gratien Atindogbe and Tolulope Adelani and Idris Akinade and Olanrewaju Samuel and Marien Nahimana and Th'eogene Musabeyezu and Emile Niyomutabazi and Ester Chimhenga and Kudzai Gotosa and Patrick Mizha and Apelete Agbolo and Seydou Traore and Chinedu Uchechukwu and Aliyu Yusuf and Muhammad Abdullahi and Dietrich Klakow},
  year={2023}
}

masakhane-pos's People

Contributors

dadelani avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.