Coder Social home page Coder Social logo

castren-komi-wedding-laments's Introduction

Matthias Alexander Castrén's Komi Wedding Laments, sentence-aligned dataset

Matthias Alexander Castrén (1813–1852) collected seven wedding laments in Komi language, presumably in 1843 somewhere in the Pechora region. The original manuscripts are archived in the National Library of Finland, and they were published in 1873 with Finnish and German translations of T.G. Aminoff in Acta Societatis Scientiarum Fennicae, digitized copies being available in the University of Helsinki Library and in the Internet Archive.

In this dataset different versions of the text, especially in Komi and Finnish, are aligned with another. The Komi transcription provided by Castrén, and later edited by Aminoff, is also presented in a version in Standard Zyrian Komi orthography, or a variety of that used in recent dialect dictionary and corpora.

The work of Niko Partanen was conducted within the Kone Foundation funded research project Language Documentation Meets Language Technology: The Next Step in the Description of Komi. The materials used are in Public Domain, and author doesn't claim new copyright for the rearrangement the text into XML files or for the creation of the orthographic variants. The citation of the original source in Zenodo is, however, recommended and appreciated.

As the narrator of the texts is not known, and presumably Castrén collected them from several individuals, exact places or persons are not indicated anywhere besides Castrén. We are, however, glad to add into collection new information in case that can be found. The author of the dataset, Niko Partanen, can be reached by email in [email protected].

The data is provided in ELAN XML files so that it is easily compatible with other spoken Komi materials, even though in this case no recording naturally exists. Later, when more alignations are created to different manuscript versions, some other format may well be adopted.

Citation

Please cite this dataset as:

Niko Partanen 2021: Matthias Alexander Castrén's Komi Wedding Laments, sentence-aligned dataset. 

castren-komi-wedding-laments's People

Contributors

nikopartanen avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.