johnmm / jatetoolkit Goto Github PK
View Code? Open in Web Editor NEWAutomatically exported from code.google.com/p/jatetoolkit
Automatically exported from code.google.com/p/jatetoolkit
I am new can you provide me some basic steps to build this
Original issue reported on code.google.com by [email protected]
on 15 May 2015 at 8:25
What steps will reproduce the problem?
java -Xmx1024m -classpath
/Users/sarnobat/trash/jatetoolkit-read-only/dist/:/Users/sarnobat/trash/jatetool
kit-read-only/libs/apache-log4j-1.2.15/log4j-1.2.15.jar:/Users/sarnobat/trash/ja
tetoolkit-read-only/libs/apache-opennlp-1.51/jwnl-1.3.3.jar:/Users/sarnobat/tras
h/jatetoolkit-read-only/libs/apache-opennlp-1.51/opennlp-maxent-3.0.1-incubating
.jar:/Users/sarnobat/trash/jatetoolkit-read-only/libs/apache-opennlp-1.51/opennl
p-tools-1.5.1-incubating.jar:/Users/sarnobat/trash/jatetoolkit-read-only/libs/dr
agon/dragontool.jar:/Users/sarnobat/trash/jatetoolkit-read-only/libs/hsqldb2.2.3
/hsqldb.jar:/Users/sarnobat/trash/jatetoolkit-read-only/libs/hsqldb2.2.3/sqltool
.jar:/Users/sarnobat/trash/jatetoolkit-read-only/libs/wit-commons/wit-commons.ja
r: uk.ac.shef.dcs.oak.jate.test.AlgorithmTester
/Users/sarnobat/trash/jatetoolkit-read-only/nlp_resources/ test/example/
test/output
What is the expected output? What do you see instead?
don't know
What version of the product are you using? On what operating system?
trunk
Please provide any additional information below.
Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
at java.util.Arrays.copyOf(Arrays.java:2734)
at java.util.ArrayList.ensureCapacity(ArrayList.java:167)
at java.util.ArrayList.add(ArrayList.java:351)
at uk.ac.shef.dcs.oak.jate.core.npextractor.NGramExtractor.getNGram(NGramExtractor.java:123)
at uk.ac.shef.dcs.oak.jate.core.npextractor.NGramExtractor.extract(NGramExtractor.java:67)
at uk.ac.shef.dcs.oak.jate.core.npextractor.NGramExtractor.extract(NGramExtractor.java:49)
at uk.ac.shef.dcs.oak.jate.core.feature.indexer.GlobalIndexBuilderMem.build(GlobalIndexBuilderMem.java:53)
at uk.ac.shef.dcs.oak.jate.test.AlgorithmTester.main(AlgorithmTester.java:83)
Original issue reported on code.google.com by [email protected]
on 12 Jun 2012 at 4:46
What steps will reproduce the problem?
java -Xmx1024m -classpath
/home/minev/jate/dist/jate.jar:/home/minev/jate/libs/apache-log4j-1.2.15/log4j-1
.2.15.jar:/home/minev/jate/libs/apache-opennlp-1.51/jwnl-1.3.3.jar:/home/minev/j
ate/libs/apache-opennlp-1.51/opennlp-maxent-3.0.1-incubating.jar:/home/minev/jat
e/libs/apache-opennlp-1.51/opennlp-tools-1.5.1-incubating.jar:/home/minev/jate/l
ibs/dragon/dragontool.jar:/home/minev/jate/libs/hsqldb2.2.3/hsqldb.jar:/home/min
ev/jate/libs/hsqldb2.2.3/sqltool.jar:/home/minev/jate/libs/wit-commons/wit-commo
ns.jar: uk.ac.shef.dcs.oak.jate.test.TestFrequency
/home/minev/jate/nlp_resources/ test/example test/output
What is the expected output? What do you see instead?
Exception in thread "main" java.io.FileNotFoundException:
D:/work/JATR_SDK/jate_googlecode/nlp_resources/stoplist.txt (No such file or
directory)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.<init>(FileInputStream.java:120)
at java.io.FileReader.<init>(FileReader.java:55)
at uk.ac.shef.dcs.oak.jate.util.control.StopList.loadStopList(StopList.java:44)
at uk.ac.shef.dcs.oak.jate.util.control.StopList.<init>(StopList.java:30)
at uk.ac.shef.dcs.oak.jate.test.TestFrequency.main(TestFrequency.java:70)
What version of the product are you using? On what operating system?
Product version: 1.1
OS: Mac OS 10.6 and Ubuntu Linux 11.04
Please provide any additional information below.
I've updated the config file jate_1.1.properties (once in root dir and in /src).
Original issue reported on code.google.com by [email protected]
on 21 Feb 2013 at 4:38
What steps will reproduce the problem?
1. in Algorithm Tester activate the option to write persistent HSQL database
2. check the database script
3.
What is the expected output? What do you see instead?
term2doc table only has a document for each term
doc2term table only has a term for each document
term2var only has a var for each term
What version of the product are you using? On what operating system?
1.1 / ubuntu 12.04
Please provide any additional information below.
In the GlobalIndexWriterHSQL.persist class there is a "=" instead a "+=" in the
inner for loops of the code blocks:
//persist term2DocMap, //persist doc2TermMap and //persist termId2VarIds map
i.e.
for (Map.Entry<Integer, Set<Integer>> e : index.getTerm2Docs().entrySet()) {
pstmt.setInt(1, e.getKey());
String docIdStr = "";
for(Integer i: e.getValue()){
//docIdStr=HSQLDBUtil.VALUE_SEPARATOR+i;
docIdStr+=HSQLDBUtil.VALUE_SEPARATOR+i;
}
Original issue reported on code.google.com by [email protected]
on 29 Jan 2013 at 10:57
What steps will reproduce the problem?
1. TestFrequency counter for verb phrases
2. I've created a class analogues to the NounPhraseExtractorOpenNLP and updated
the strings to B-VP and I-VP
3. The algorithm counts/extracts further the noun phrases
What is the expected output? What do you see instead?
Verb Phrase + counter
What version of the product are you using? On what operating system?
1.11
Please provide any additional information below.
What properties must be further altered so that only verb phrases are counted?
The OpenNLP parser supports this type of annotation.
Original issue reported on code.google.com by [email protected]
on 19 Sep 2013 at 4:55
When I ran this on 6000 small documents it ran for 6+ hours without completing.
With the attached performance fixes that same test runs in just a few minutes.
Original issue reported on code.google.com by [email protected]
on 6 Aug 2012 at 9:32
Attachments:
Hi,
could you provide some more information about the way you compiled the BNC
reference list? Is there any reference to the source if its external?
mihail
Original issue reported on code.google.com by [email protected]
on 22 Jul 2013 at 10:44
This is a question/feature request. In my testing, it seems like the main
bottleneck is the building of the GlobalIndex, as opposed to using the
FeatureBuilder classes for doing the counts. However, while there is a
multithread version of the FeatureBuilder classes, there is none for the
GlobalIndex builders. Are there plans to implement parallel versions of these
builders? I am not very experienced with Java, but I might try to implement
them if it is feasible to do so.
Matt
Original issue reported on code.google.com by [email protected]
on 12 Sep 2014 at 1:15
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.