Coder Social home page Coder Social logo

Andreas Wagner's Projects

tinystats icon tinystats

Statistics about data (cardinality estimation, frequent item detection, approximate counting,...)

tokenreplacer icon tokenreplacer

Token Replacer is a simple and small Java Library that helps replacing tokens in strings. You can replace the tokens with static values or create values "on-the-fly" by calling a generator. You can even pass arguments to the generator which makes it pretty powerful.

topk-setsimilarityjoin icon topk-setsimilarityjoin

JAVA implementation of Top-kSimilarityJoin (algorithm 3) from C. Xiao, W. Wang, X. Lin, and H. Shang. Top-k set similarity joins. In ICDE, pages 916–927, 2009.

trie icon trie

A Mixed Trie and Levenshtein distance implementation in Java for extremely fast prefix string searching and string similarity.

trie4j icon trie4j

PATRICIA, Double Array, LOUDS Trie implementations for Java

unificator icon unificator

Phonetisation of french words in noisy context - Modified version of Phonetic by E. Berger

universal-recommender icon universal-recommender

Java™ Programming Language™ library for recommendation engine implementation and scientific evaluation (2009–2010)

upm-full icon upm-full

Unsupervised Products Matching via Clustering, Combinatorics and Verification

user-agent-ml icon user-agent-ml

A hybrid approach that uses rules and machine learning for detecting whether a user agent string refers to a bot or not.

vec4ir icon vec4ir

Word Embeddings for Information Retrieval

vectorai icon vectorai

Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.

vectorz icon vectorz

Fast and flexible numerical library for Java featuring N-dimensional arrays

vico icon vico

Multi-sense word embeddings from visual co-occurrences

viterbi icon viterbi

An implementation of HMM-Viterbi Algorithm 通用的维特比算法实现

watset-java icon watset-java

An implementation of the Watset clustering algorithm in Java.

wned icon wned

A sytem for Named Entity Disambiguation based on Random Walks and Learning to Rank.

word icon word

Java分布式中文分词组件 - word分词

word2vecfjava icon word2vecfjava

Word2VecfJava: Java implementation of Dependency-Based Word Embeddings and extensions

wordcorr icon wordcorr

A simple ligh-weight Java based in-memory word co-occurrence calculator

words-grouping icon words-grouping

tool for listing most common words from a file with given tolerance for each group (using Levenshtein distance)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.