apertium / apertium-rus-ukr Goto Github PK
View Code? Open in Web Editor NEWApertium translation pair for Russian and Ukrainian
License: GNU General Public License v2.0
Apertium translation pair for Russian and Ukrainian
License: GNU General Public License v2.0
Russian and Ukranian apertium-rus-ukr =============================================================================== This is an Apertium language pair for translating between Russian and Ukranian. What you can use this language package for: * Translating between Russian and Ukranian * Morphological analysis of Russian and Ukranian * Part-of-speech tagging of Russian and Ukranian For information on the latter two points, see subheading "For more information" below Requirements =============================================================================== You will need the following software installed: * lttoolbox (>= 3.1.2) * apertium (>= 3.2.0) * vislcg3 (>= 0.9.7.5129) * apertium-lex-tools * apertium-rus * apertium-ukr If this does not make any sense, we recommend you look at: apertium.org Compiling =============================================================================== Given the requirements being installed, you should be able to just run: $ ./configure $ make # make install You can use ./autogen.sh instead of ./configure you're compiling from SVN. If you're using a --prefix to ./configure, make sure it's the same one you used to install apertium itself. Testing =============================================================================== If you are in the source directory after running make, the following commands should work: $ echo "здичавіння" | apertium -d . ukr-rus одичание After installing somewhere in $PATH, you should be able to do eg. $ echo "здичавіння" | apertium ukr-rus одичание The following command runs tests which are on the Apertium wiki page: $ ./regression-tests.sh Files and data =============================================================================== For monolingual data, see the modules apertium-rus and apertium-ukr. * apertium-rus-ukr.rus-ukr.lrx - Lexical selection rules for translating into Ukranian * apertium-rus-ukr.rus-ukr.t1x - Transfer rules for translating into Ukranian (first pass) * apertium-rus-ukr.rus-ukr.t2x - Transfer rules for translating into Ukranian (second pass) * apertium-rus-ukr.rus-ukr.t3x - Transfer rules for translating into Ukranian (third pass) * apertium-rus-ukr.rus-ukr.dix - Bilingual dictionary * apertium-rus-ukr.ukr-rus.lrx - Lexical selection rules for translating into Russian * apertium-rus-ukr.ukr-rus.t1x - Transfer rules for translating into Russian (first pass) * apertium-rus-ukr.ukr-rus.t2x - Transfer rules for translating into Russian (second pass) * apertium-rus-ukr.ukr-rus.t3x - Transfer rules for translating into Russian (third pass) * modes.xml - Translation modes For more information =============================================================================== * https://wiki.apertium.org/wiki/Installation * https://wiki.apertium.org/wiki/Ukranian_and_Russian * https://wiki.apertium.org/wiki/Using_an_lttoolbox_dictionary * https://wiki.apertium.org/wiki/Constraint_Grammar Help and support =============================================================================== If you need help using this language pair or data, you can contact: * Mailing list: [email protected] * IRC: #apertium on irc.oftc.net See also the file AUTHORS included in this distribution.
Hi,
The word "Russian" is sometimes translated as "москаль". I understand that this word exists in Ukrainian and that in some very particular contexts it may be a correct translation, but most of the time, it's just incorrect and unnecessarily offensive.
Here's a consistently reproducible example.
Russian input: "Вход в чаты свободный (однако не забудьте представиться и кратко рассказать о себе, если пришли впервые), рабочий язык — русский."
Ukrainian output: "Вхід у чаты свободный (однак не забудьте представиться #і коротке розказати о собі, якщо прийшли вперше), робітниця мову — москаль."
There are some other mistakes here, but the last word is the worst.
This segfaults: echo 'Boxberry с 2010 года работает на российском рынке доставки. В отделениях компании можно отправлять посылки и письма по России, Армении, Таджикистану, Беларуси, ...' | lt-proc -w '/usr/share/apertium/apertium-rus-ukr/rus-ukr.automorf.bin' | cg-proc -w '/usr/share/apertium/apertium-rus-ukr/rus-ukr.rlx.bin' | apertium-tagger -g '/usr/share/apertium/apertium-rus-ukr/rus-ukr.prob'
Whether it is a problem of apertium-tagger or the .prob file, I have no clue.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.