kuhumcst Goto Github PK
Name: Centre for Language Technology, University of Copenhagen
Type: Organization
Location: Copenhagen, Denmark
Blog: http://cst.ku.dk/
Name: Centre for Language Technology, University of Copenhagen
Type: Organization
Location: Copenhagen, Denmark
Blog: http://cst.ku.dk/
Using supervised learning, create a set of affix rules for use by the CSTlemma lemmatiser.
Converts input text (UTF-8 encoded) to lowercase. Usage: all2lower <input> <output>
OpenCV-based Plugin for the Anvil annotation software that tracks faces and creates annotations when velocity or acceleration thresholds are transgressed.
RDF, SPARQL and OWL for Clojure
Digital repository for the CLARIN-DK data centre
Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervised learning from a full form - lemma list.
Transform or scrape Hiccup with a declarative DSL.
A Danish semantic reasoning benchmark compiled from lexical semantic resources
Gold standard resource for evaluation of Danish word embedding models.
The Danish WordNet as an RDF graph.
Fetch the DK5 dataset and store it as EDN.
The implementation of fcs-korp-endpoint running on Alf.
The life of Louis Hjelmslev.
Simple implementation of a hash map using separate chaining. The table allocates more buckets if the load factor is more than 100% and frees buckets if the loadfactor falls below 20%.
Jupyter notebooks and training data containing manual head movement annotations, speech data and velocity, acceleration and jerk data.
various functions for manipulating Hiccup data
Analyses the movement of two points in x-y plane, in casu nose tips data from OpenPoseDemo.exe, and computes velocity, acceleration and jerk of the points.
Docker setups for all Korp installations maintained by NorS.
Frontend demos.
Lemmatiser with an extra. Predict lemmas as well as classes (e.g. Parts of Speech), based on the morphology of the input word.
Functions for upper/lower casing, for testing whether a character is a letter and for conversion between Unicode encodings UTF-8 and UTF-16
Converts UTF-16 (BE/LE), UTF-32 (BE/LE), ISO-8859-N to UTF-8. Removes BOM and surrogate pairs from UTF-8, converting a codepoint between U-D800 and U-DBFF followed by a codepoint between U-DC00 and U-DFFF to one valid codepoint > U-FFFF.
Web service that wraps around Bernd Bohnet's graph based parser
Webservice that wraps around the mate POS tagger
Public repository of the META-SHARE software
Webservice that wraps around the OpenNLP POS tagger
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.