Comments (10)
better off asking the mailing list. Hardly anyone pays attention to this forum
http://mailman.mit.edu/mailman/listinfo/moses-support
please subscribe before you post
from mosesdecoder.
closing this. Looks like no-one's responding to this forum
from mosesdecoder.
Why close, if it is essentially open?
from mosesdecoder.
I'll reopen it, but don't be surprised if u get no response.
from mosesdecoder.
don't worry, I'll refer to it on the mailing list that you suggested :-)
from mosesdecoder.
I know nothing about moses' sentence splitter but give a try for eserix. I used it from time to time.
from mosesdecoder.
Thanks @tomekd, do you know whether it is accommodates different languages, v.s. being just useful for English? we're looking for something covering a wide range of languages, not that the Moses script was necessarily perfect at that.
from mosesdecoder.
Hi,
it supports the most popular languages:
- English
- French
- Spanish
- German
- Polish
- Russian
- Arabic
- Chinese
- Croatian
Notice that it's really simple tool using SRX files.
from mosesdecoder.
Well, I guess, good to learn of SRX (Segmentation Rules eXchange) now :-) Other than reading the dry spec of it, may I assume that the implied algorithm comprises a two-step flow, where first a break is matched by all the break=yes
rules, and then the break may be avoided if it matches any of the break=no
rules? any notable libraries that execute the rules or notable rule depos? I see version 2.0 of the standard is supposed to be "safer" and Java is lagging in regex support required for it.
Essentially the perl script here has a similar flow, although it seems to struggle with introducing extra spaces that it later needs to discard, and arguably a bit of a hack when it comes to adaptation to special domains or language registers.
from mosesdecoder.
looks like the mailing list got you good responses. Closing now
from mosesdecoder.
Related Issues (20)
- Lexical reordering scoring failed at /home/ubuntu/Moses/mosesdecoder/scripts/training/train-model.perl line 1924. HOT 2
- Please don't create new issues HOT 4
- No abbreviation Files Found HOT 1
- Evaluation with multi-bleu.perl or multi-bleu-detok.perl HOT 16
- Placeholders should be separated by comma HOT 4
- tiny weights after tuning HOT 2
- Question:Related translation models.
- How to increase BLEU? HOT 1
- PROBLEM: alignment is 0. HOT 2
- normalize-punctuation.perl Change the Chinese punctuation marks in English sentences into English. HOT 1
- symal crashes on Linux after latest update HOT 3
- It seems the home page is not working HOT 1
- train-model.perl failed HOT 4
- Tunning translation model failed with this error
- Replace non-breaking space with regular space HOT 1
- tokenizer.perl supported language HOT 2
- symal: permission denied HOT 4
- Looking for Arabic/English demo
- webshell exists in the project HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mosesdecoder.