Coder Social home page Coder Social logo

hunterhector / dbpedia-spotlight Goto Github PK

View Code? Open in Web Editor NEW

This project forked from dbpedia-spotlight/dbpedia-spotlight

6.0 6.0 0.0 270.19 MB

DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text.

Home Page: http://spotlight.dbpedia.org/

Shell 1.54% Python 1.44% Java 46.23% Scala 50.79%

dbpedia-spotlight's People

Contributors

dav009 avatar dirkweissenborn avatar golicar avatar halin avatar hunterhector avatar jimkont avatar jodaiber avatar kenda avatar maxjakob avatar mvnural avatar pablomendes avatar sandroacoelho avatar zaknarfen avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

dbpedia-spotlight's Issues

NoSuchMethodError when running ExtractOccsFromWikipedia

This error only occurs when executing the ExtractOccsFromWikipedia process on English Wkikpedia, the other process in the index.sh works just fine.

However, the process did give out some output, about 3686 lines, the following is the first line:

Anarchism-p2l1  Family_resemblance  family resemblance   Anarchism is a political philosophy which considers the state undesirable, unnecessary and harmful, and instead promotes a stateless society, or anarchy. It seeks to diminish or even abolish authority in the conduct of human relations. Anarchists may widely disagree on what additional criteria are required in anarchism. Oxford Companion to Philosophy says, "there is no single defining position that all anarchists hold, and those considered anarchists at best share a certain family resemblance."    481 

echo $INDEX_CONFIG_FILE produce ../conf/indexing.properties, of course....

The output of mvn -version is:

Apache Maven 2.2.1 (r801777; 2009-08-07 03:16:01+0800)
Java version: 1.7.0
Java home: /opt/java/jre
Default locale: en_US, platform encoding: UTF-8
OS name: "linux" version: "3.3.5-1-arch" arch: "amd64" Family: "unix"

The following is the full stack trace of the error.

[hector@hector-arch index]$ mvn scala:run -DmainClass=org.dbpedia.spotlight.lucene.index.ExtractOccsFromWikipedia "-DaddArgs=$INDEX_CONFIG_FILE|output/occs.tsv"
[INFO] Scanning for projects...
[INFO] ------------------------------------------------------------------------
[INFO] Building DBpedia Spotlight Indexing
[INFO]    task-segment: [scala:run]
[INFO] ------------------------------------------------------------------------
[INFO] Preparing scala:run
[INFO] [install:install-file {execution: install-weka-jar}]
[INFO] Installing /home/hector/Researches/nlp/DBpedia_Spotlight/dbpedia-spotlight/index/../lib/weka-trunk.jar to /home/hector/.m2/repository/weka/weka/3.7.3/weka-3.7.3.jar
[INFO] [resources:resources {execution: default-resources}]
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] skip non existing resourceDirectory /home/hector/Researches/nlp/DBpedia_Spotlight/dbpedia-spotlight/index/src/main/resources
[INFO] [scala:add-source {execution: scala-compile-first}]
[INFO] Add Source directory: /home/hector/Researches/nlp/DBpedia_Spotlight/dbpedia-spotlight/index/src/main/scala
[INFO] Add Test Source directory: /home/hector/Researches/nlp/DBpedia_Spotlight/dbpedia-spotlight/index/src/test/scala
[INFO] [scala:compile {execution: scala-compile-first}]
[INFO] Checking for multiple versions of scala
[INFO] includes = [**/*.scala,**/*.java,]
[INFO] excludes = []
[INFO] Nothing to compile - all classes are up to date
[INFO] [compiler:compile {execution: default-compile}]
[INFO] Nothing to compile - all classes are up to date
[INFO] [resources:testResources {execution: default-testResources}]
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] skip non existing resourceDirectory /home/hector/Researches/nlp/DBpedia_Spotlight/dbpedia-spotlight/index/src/test/resources
[INFO] [compiler:testCompile {execution: default-testCompile}]
[INFO] No sources to compile
[INFO] [scala:run {execution: default-cli}]
[INFO] Checking for multiple versions of scala
 INFO 2012-05-18 14:10:10,642 main [IndexingConfiguration] - Loading configuration file ../conf/indexing.properties
 INFO 2012-05-18 14:10:10,748 main [ExtractOccsFromWikipedia$] - Loading concept URIs from output/conceptURIs.list...
 INFO 2012-05-18 14:10:17,567 main [ExtractOccsFromWikipedia$] - Loading redirects transitive closure from output/redirects_tc.tsv...
 INFO 2012-05-18 14:10:27,077 main [FileOccurrenceSource$] - Writing occurrences to file output/occs.tsv ...
java.lang.reflect.InvocationTargetException
    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
    at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
    at java.lang.reflect.Method.invoke(Method.java:601)
    at org_scala_tools_maven_executions.MainHelper.runMain(MainHelper.java:161)
    at org_scala_tools_maven_executions.MainWithArgsInFile.main(MainWithArgsInFile.java:26)
Caused by: java.lang.NoSuchMethodError: scala.collection.immutable.StringOps.slice(II)Ljava/lang/String;
    at org.dbpedia.spotlight.io.DisambiguationContextSource$.getOccurrence(DisambiguationContextSource.scala:153)
    at org.dbpedia.spotlight.io.AllOccurrenceSource$AllOccurrenceSource$$anonfun$foreach$1$$anonfun$apply$1.apply(AllOccurrenceSource.scala:101)
    at org.dbpedia.spotlight.io.AllOccurrenceSource$AllOccurrenceSource$$anonfun$foreach$1$$anonfun$apply$1.apply(AllOccurrenceSource.scala:97)
    at scala.collection.LinearSeqOptimized$class.foreach(LinearSeqOptimized.scala:59)
    at scala.collection.immutable.List.foreach(List.scala:45)
    at org.dbpedia.spotlight.io.AllOccurrenceSource$AllOccurrenceSource$$anonfun$foreach$1.apply(AllOccurrenceSource.scala:97)
    at org.dbpedia.spotlight.io.AllOccurrenceSource$AllOccurrenceSource$$anonfun$foreach$1.apply(AllOccurrenceSource.scala:78)
    at org.dbpedia.extraction.sources.WikipediaDumpParser.readPage(WikipediaDumpParser.java:218)
    at org.dbpedia.extraction.sources.WikipediaDumpParser.readPages(WikipediaDumpParser.java:159)
    at org.dbpedia.extraction.sources.WikipediaDumpParser.readDump(WikipediaDumpParser.java:107)
    at org.dbpedia.extraction.sources.WikipediaDumpParser.run(WikipediaDumpParser.java:87)
    at org.dbpedia.extraction.sources.XMLSource$XMLFileSource.foreach(XMLSource.scala:40)
    at org.dbpedia.spotlight.io.AllOccurrenceSource$AllOccurrenceSource.foreach(AllOccurrenceSource.scala:78)
    at org.dbpedia.spotlight.filter.Filter$FilteredOccs.foreach(Filter.scala:53)
    at org.dbpedia.spotlight.filter.Filter$FilteredOccs.foreach(Filter.scala:53)
    at org.dbpedia.spotlight.filter.Filter$FilteredOccs.foreach(Filter.scala:53)
    at org.dbpedia.spotlight.io.FileOccurrenceSource$.writeToFile(FileOccurrenceSource.scala:59)
    at org.dbpedia.spotlight.lucene.index.ExtractOccsFromWikipedia$.main(ExtractOccsFromWikipedia.scala:79)
    at org.dbpedia.spotlight.lucene.index.ExtractOccsFromWikipedia.main(ExtractOccsFromWikipedia.scala)
    ... 6 more
[INFO] ------------------------------------------------------------------------
[ERROR] BUILD ERROR
[INFO] ------------------------------------------------------------------------
[INFO] wrap: org.apache.commons.exec.ExecuteException: Process exited with an error: 240(Exit value: 240)

[INFO] ------------------------------------------------------------------------
[INFO] For more information, run Maven with the -e switch
[INFO] ------------------------------------------------------------------------
[INFO] Total time: 24 seconds
[INFO] Finished at: Fri May 18 14:10:33 HKT 2012
[INFO] Final Memory: 34M/344M
[INFO] ------------------------------------------------------------------------

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.