kuhumcst / rtfreader Goto Github PK
View Code? Open in Web Editor NEWText segmenter and tokeniser for Danish, English and other languages. Reads an RTF or flat text file and outputs the text, one line per sentence & optionally tokenized.
License: GNU General Public License v2.0