Coder Social home page Coder Social logo

jvdzwaan / sonar2naf Goto Github PK

View Code? Open in Web Editor NEW

This project forked from cltl/sonar2naf

0.0 0.0 0.0 5.51 MB

Converter from Folia to NAF

License: Other

Shell 1.42% JavaScript 2.21% Python 21.94% Awk 0.22% CSS 3.35% HTML 70.85% Dockerfile 0.02%

sonar2naf's Introduction

since 20-02-2015

AUTHORS

GOAL

the SoNaR corpus is a large Dutch corpus (http://tst-centrale.org/producten/corpora/sonar-corpus/6-85) , of which a part has been annotated with Cornetto senses (http://www2.let.vu.nl/oz/cltl/cornetto/) in the DutchSemcor project (http://www2.let.vu.nl/oz/cltl/dutchsemcor/). the goal of this project is to:

USAGE

There are two main purposes of this github:

  • convert a folia xml file to NAF containing wf and term layer. cd to the scripts folder and call python FoliaToNaf.py -h for information on how to use it.
  • convert DutchSemcor to NAF. cd to the scripts folder and call python main.py -h for more information on how to use it.

Contents

Contents of this github:

  • folder 'scripts': contains python scripts to perform conversion
  • folder 'resources': contains 'base_naf.xml' which is used for the NAF conversion and 'cdb_syn_FILT.xml.lu-map', which is a mapping from Cornetto to ODWN1.0. It also contains the allwords xml files and its annotations. The folder allwords_NAF contains the processed all words files with annotations in naf.
  • folder 'dutch_pipeline': contains scripts to run naf file through dutch pipeline. only created for use on our personal server.
  • the all words part of dutchsemcor has been processed, but can not be distributed due to the license of SoNaR. Please contact the authors of this github for more information.

TODO list (in this order)

TODO list includes:

  • run full conversion to naf with pipeline (to be set up)

Code Documentation

All python code has been documented with the epydoc package (http://epydoc.sourceforge.net/) open script/html/index.html to inspect the documentation.

sonar2naf's People

Contributors

jvdzwaan avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.