nirmal-mewada / textrank Goto Github PK
View Code? Open in Web Editor NEWLicense: BSD 3-Clause "New" or "Revised" License
License: BSD 3-Clause "New" or "Revised" License
Open source Java implementation of the TextRank algorithm by Mihalcea, et al. Note that this code only implements key phrase extraction based on keyword co-occurance described in section 3 of the Mihalcea-Tarau paper. This code does not yet implement the sentence extraction described in section 4 of that paper. See also: http://lit.csci.unt.edu/index.php/Graph-based_NLP GitHub code repo: http://github.com/ceteri/textrank/ GoogleGroups discussion: http://groups.google.com/group/textrank-dev Paco NATHAN [email protected] @pacoid http://www.google.com/profiles/ceteri NB: There is a known issue with use of JWNL (Java libraries for WordNet) such that if the graph size exceeds a particular threshold, then low-level Java I/O reads to the WordNet database on disk will cause Java thread to block -- even though JVM tools show no blocked threads. A potential remedy is to dump WordNet, or at least the parts of it used here, into some DBD structure with an in-memory cache. --------- simple test: ant run test with a specific data file FOO.txt ant -Ddata.file=FOO.txt run build the JAR for export to another project: ant jar --------- Sources for third-party JAR files: commons-logging-1.1.1.jar http://commons.apache.org/downloads/download_logging.cgi commons-math-1.2.jar http://commons.apache.org/downloads/download_math.cgi log4j-1.2.15.jar http://logging.apache.org/log4j/1.2/download.html porterstemmer.jar http://snowball.tartarus.org/download.php opennlp-tools-1.3.0.jar http://opennlp.sourceforge.net/ maxent-2.4.0.jar https://sourceforge.net/projects/maxent/ sptoolkit.jar http://text0.mib.man.ac.uk:8080/scottpiao/sent_detector trove-2.0.2.jar http://trove4j.sourceforge.net/ jwnl-1.4rc1.jar http://sourceforge.net/projects/jwordnet jdom-1-1.jar http://jdom.org/downloads/index.html
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.