tuetschek / en-deep Goto Github PK
View Code? Open in Web Editor NEWMLProcess – a framework for batch parallel processing of various NLP tasks
MLProcess – a framework for batch parallel processing of various NLP tasks
The line numbers doesn't even reflect the actual position (i.e. the
currently parsed task).
Original issue reported on code.google.com by [email protected]
on 23 Apr 2010 at 3:56
Reduce the usage of obsolete synchronized collections.
Original issue reported on code.google.com by [email protected]
on 21 Jun 2010 at 3:16
There should be a "legal" way to stop the running workers, such as special file
that the task should check for existence upon each task retrieval.
Original issue reported on code.google.com by [email protected]
on 9 Jul 2010 at 5:03
StToArff should clear all the output files before each run.
Original issue reported on code.google.com by [email protected]
on 16 Apr 2010 at 11:42
Task reset feature should be reviewed, its behavior is sometimes strange (for a
prefix, it resets the whole plan).
Original issue reported on code.google.com by [email protected]
on 9 Jul 2010 at 5:04
If there is a change in the scenario file, it should be recognized upon
task reset (just for the tasks that are to be reset).
Original issue reported on code.google.com by [email protected]
on 16 Apr 2010 at 10:09
Issue warnings if there may be a pattern collision (one pattern is a
subpattern of another) in order to prevent problems in scenario reruns.
E.g.: file*.txt and file1*.txt exist in two different tasks, but file1*.txt
is produced later. If file1*.txt is already produced and the task that
produces file*.txt gets reset, the pattern expansion includes also
file1*.txt and it may get messy.
Original issue reported on code.google.com by [email protected]
on 30 Apr 2010 at 12:22
There should be some switch with path prefix that will be considered to be
local, e.g. /tmp. All tasks which have this prefix in their I/O specs and
depend on each other should then be lined up for computation on the same
machine.
Original issue reported on code.google.com by [email protected]
on 17 Jul 2010 at 8:33
In the current setting, the expanded task is copied along with all the
dependencies of the original task, which are then removed. This increases
complexity and reduces performance with more than ca. 10000's of tasks.
Original issue reported on code.google.com by [email protected]
on 24 Jul 2010 at 5:48
The program is yet unable to handle UTF-8 characters in the input ST files.
Original issue reported on code.google.com by [email protected]
on 21 Aug 2010 at 11:38
If an input like this is provided:
params: lang_conf="st-en.conf", omit_semclass="1", predicted="1",
pred_only="1" generate="Children";
then the parser doesn't report an error, but the last parameter is not
recognized at all.
The parser should report an error.
Original issue reported on code.google.com by [email protected]
on 16 Apr 2010 at 10:07
GreedyAttributeSearch does not work well together with attribute rankings that
do not contain all attributes.
Original issue reported on code.google.com by [email protected]
on 31 Jul 2010 at 9:49
Add new generated feature: children patterns without function words.
Original issue reported on code.google.com by [email protected]
on 16 Apr 2010 at 10:49
There should be an option, which, when selected, just parses the scenario file
and end the whole program. It would be useful just to check for errors in the
plan file before launching the process.
Original issue reported on code.google.com by [email protected]
on 9 Jul 2010 at 5:49
For a task that outputs a-**.txt, some of which are a-*-x.txt, there is no way
to capture just the a-*-x.txt in the input of another task.
There should be something like a-*|-x|.txt, which would depend on a task
producing a-**.txt, but take only a-*-x.txt as input
Original issue reported on code.google.com by [email protected]
on 7 Jul 2010 at 3:52
In the current version, sub-specifications of prefixes and suffixes for the
expansions do not work (only if they're at the beginning of the expansion
transitive line).
Original issue reported on code.google.com by [email protected]
on 26 Jul 2010 at 4:45
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.