Coder Social home page Coder Social logo

adv_elan_draft's Introduction

Niko Partanen

I'm a linguist working primarily with the Uralic languages and language technology. I have worked with different archives and memory organizations, and serve as the librarian and archivist of the Finno-Ugrian Society. My research work has, besides linguistics, regularly addressed the use and digitization of archived materials.

I work as an information specialist in the National Library of Finland. My work is primarily related to the minority language support in our digital services, especially for the Sámi languages spoken in Finland.


Information

  • 🔭 I'm currently finalizing my PhD thesis about morphological variation in Komi language
  • 📔 I work regularly with normalization of dialectal and historical texts
  • 📜 I know both R and Python at an advanced level
  • 👯 I'm looking for new collaboration on:
    • Speech technologies (forced alignment, speaker detection, speaker identification)
    • Dependency parsing
    • Linguistic data visualization and cartography
  • 💬 Ask me about text and speech recognition, or Uralic languages
  • 📫 How to reach me: [email protected]

🛠️ Collaboration

I work or collaborate currently with various organizations, the list below is not exhaustive:


🧑‍🏫 Courses & Workshops

I have taught following courses and workshops regularly. Please contact me, if you would like to organize something in your institution along these lines.

  • Data management and publishing best practices
  • Multimedia management in language documentation
  • Using natural language processing in the language documentation context
  • Advanced manipulation of ELAN corpora with Python and R
  • Linguistic data analysis with spoken language corpora
  • Text recognition tools: model fine tuning & extracting the data from recognition result

🗺 Location

I live currently in Helsinki, Finland. I have previously lived in:


Languages

  • Finnish
  • Komi
  • Russian
  • English
  • Italian
  • Please free to contact me also in: Northern Saami, Aanaar Saami, Skolt Saami, French, German, Estonian, Karelian, Udmurt and Swedish

adv_elan_draft's People

Contributors

nikopartanen avatar

Watchers

 avatar  avatar

adv_elan_draft's Issues

Write about basic search methods

There has to be a part about basic most common search methods.

Something like:

Sequence a b:

corpus %>% filter(token == 'a') & lead(token) == 'b'))
corpus %>% filter(str_detect(token, 'a') & str_detect(lead(token), 'b'))

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.