Coder Social home page Coder Social logo

Niko Partanen

I'm a linguist working primarily with the Uralic languages and language technology. I have worked with different archives and memory organizations, and serve as the librarian and archivist of the Finno-Ugrian Society. My research work has, besides linguistics, regularly addressed the use and digitization of archived materials.

I work as an information specialist in the National Library of Finland. My work is primarily related to the minority language support in our digital services, especially for the Sámi languages spoken in Finland.


Information

  • 🔭 I'm currently finalizing my PhD thesis about morphological variation in Komi language
  • 📔 I work regularly with normalization of dialectal and historical texts
  • 📜 I know both R and Python at an advanced level
  • 👯 I'm looking for new collaboration on:
    • Speech technologies (forced alignment, speaker detection, speaker identification)
    • Dependency parsing
    • Linguistic data visualization and cartography
  • 💬 Ask me about text and speech recognition, or Uralic languages
  • 📫 How to reach me: [email protected]

🛠️ Collaboration

I work or collaborate currently with various organizations, the list below is not exhaustive:


🧑‍🏫 Courses & Workshops

I have taught following courses and workshops regularly. Please contact me, if you would like to organize something in your institution along these lines.

  • Data management and publishing best practices
  • Multimedia management in language documentation
  • Using natural language processing in the language documentation context
  • Advanced manipulation of ELAN corpora with Python and R
  • Linguistic data analysis with spoken language corpora
  • Text recognition tools: model fine tuning & extracting the data from recognition result

🗺 Location

I live currently in Helsinki, Finland. I have previously lived in:


Languages

  • Finnish
  • Komi
  • Russian
  • English
  • Italian
  • Please free to contact me also in: Northern Saami, Aanaar Saami, Skolt Saami, French, German, Estonian, Karelian, Udmurt and Swedish

Niko Partanen's Projects

aineistot icon aineistot

Materiaaleja syksyn luennoille (pahasti kesken)

amazonian-uralic-collaboration icon amazonian-uralic-collaboration

This is a web application, running in a Rahti container, that runs an audio processing pipeline and sends an ELAN file into email

conllr icon conllr

R scripts used to work with CoNLL-U files and their variants in spring/summer 2017.

conllu-vis icon conllu-vis

Just a test repository to visualize files in CoNLL-U format in RMarkdown

courses icon courses

Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1

cv icon cv

Niko Partanen's CV

doc-to-publication icon doc-to-publication

Thoughts, scripts and Word Macros (🤮) that help when preparing documents for publication

finto-data icon finto-data

Vocabulary data and tools for the Finto service

giellagas icon giellagas

This repository contains example scripts used to process materials in the Saami Culture Archive, University of Oulu

helsinki20180106 icon helsinki20180106

Esitelmä Sukukansojen ystävät ry:n seminaarissa "Ajankohtaista komin ja hantin kielen tilanteesta"

iwclul2015 icon iwclul2015

This folder contains example data for ELAN tutorial in Tromsø conference: http://gtweb.uit.no/iwclul2015/

izva_sibilants icon izva_sibilants

This repository contains draft of a paper Niko Partanen is writing about the pronunciation of different Russian elements in Komi, with focus especially in sibilants

language_maps icon language_maps

This is a collection of polygons which represent areas where different languages are spoken

ma icon ma

Päivitetty versio pro gradu -tutkimuksestani

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.