Coder Social home page Coder Social logo

cltk's Introduction

The Classical Language Toolkit

PyPi downloads Documentation Status DOI

Build Status Coverage Status codecov.io Code Issues

Join the chat at https://gitter.im/cltk/cltk

About

The Classical Language Toolkit (CLTK) offers natural language processing support for Classical languages. In some areas, it extends the NLTK. The goals of the CLTK are to:

  • compile analysis-friendly corpora in a variety of Classical languages (currently available for Chinese, Coptic, Greek, Latin, Pali, and Tibetan);
  • gather, improve, and generate linguistic data required for NLP (Greek and Latin are in progress, with more in the pipeline);
  • develop a free and open platform for generating reproducible, scientific research that advances the study of the languages and literatures of the ancient world.

Documentation

The docs are at docs.cltk.org.

Installation

CLTK supports Python version 3.4 and 3.5. The software only runs on POSIX–compliant operating systems (Linux, Mac OS X, FreeBSD, etc.).

$ pip install cltk

See docs for complete installation instructions.

The CLTK organization curates corpora which can be downloaded directly or, better, imported by the toolkit.

Contributing

See the Quickstart for contributors for an overview of the process. If you're looking to start with a small contribution, see the Issue tracker for "easy" jobs needing to be done. Bigger projects may be found at Project ideas page. Of course, new ideas are always welcome.

Citation

Each major release of the CLTK is given a DOI, a type of unique identity for digital documents. This DOI ought to be included in your citation, as it will allow researchers to reproduce your results should the CLTK's API or codebase change. To find the CLTK's current DOI, observe the blue DOI button in the repository's home on GitHub. To the end of your bibliographic entry, append DOI plus the current identifier. You may also add version/release number, located in the pypi button at the project's GitHub repository homepage.

Thus, please cite core software as something like:

Kyle P. Johnson et al.. (2014-2016). CLTK: The Classical Language Toolkit. DOI 10.5281/zenodo.<current_release_id>

A style-neutral BibTeX entry would look like this:

@Misc{johnson2014,
author = {Kyle P. Johnson et al.},
title = {CLTK: The Classical Language Toolkit},
howpublished = {\url{https://github.com/cltk/cltk}},
note = {{DOI} 10.5281/zenodo.<current_release_id>},
year = {2014--2016},
}

Many contributors have made substantial contributions to the CLTK. For scholarship about particular code, it might be proper to cite these individuals as authors of the work under discussion.

Gratitude

We are thankful for the following organizations that have offered support:

  • Google Summer of Code (sponsoring two students, 2016)
  • JetBrains (licenses for PyCharm and WebStorm)

License

The CLTK is Copyright (c) 2016 Kyle P. Johnson, under the MIT License. See LICENSE for details.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.