andy-wagner Goto Github PK
Name: Andreas Wagner
Type: User
Company: [email protected]
Bio: Serial entrepreneur, data, information retrieval and machine learning geek
Location: Germany
Blog: www.searchhub.io
Name: Andreas Wagner
Type: User
Company: [email protected]
Bio: Serial entrepreneur, data, information retrieval and machine learning geek
Location: Germany
Blog: www.searchhub.io
Proxy ElasticSearch requests and add similarity sorting
similarity search combing query relaxation and diversification. Web, structs2
Similarity or Distance Metrics, e.g. Levenshtein, for Java
SimMetrics is a Similarity Metric Library for strings (as of http://sourceforge.net/projects/simmetrics/)
Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".
Draft of a simplified ontology format specially designed from a search and retrieval perspective
An implementation of the SimRank algorithm in Java
Semantic Category Disambiguation using SimString, large lexical resources and LibLinear
This is a java implementation for a fast approximate string matching algorithm (SimString).
This project is the Java implementation for SimString
Serverless Stanford Named Entity Recognizer
A boosting dismax query parser for Apache Solr. The bmax query parser relies on field types and tokenizer chains to parse the user query, discover synonyms, boost and penalize terms at query time. Hence it is highly configurable. The lucene query composed is a boosted and reranked dismax query with a minimum must match of 100%.
Elevation Query Extension
Apache Solr add-ons for detecting and managing quantities
elasticsearch sort script
Space Saving algorithm implementation (StreamSummary) in Java, used to solve heavy hitters / topk items.
👑 spaCy building blocks and visualizers for Streamlit apps
Fuzzy matching and more functionality for spaCy.
Metaphone is a phonetic algorithm, an algorithm published in 1990 for indexing words by their English pronunciation. It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar. As with Soundex, similar sounding words should share the same keys.
Spell: Archive / Drafts [Not current / Not being updated]
A seq2seq model that can correct spelling mistakes.
Tutorial on creating a spelling correction Python application using Gingerit and Streamlit
Symmetric Delete spelling correction algorithm using Java
Spell correct entire sentences using nltk freqdist and symspell
REST API for spell checking and auto completion using R-Way Trie
SpellGCN
Multi-language, Multi word, utf-8 spelling correction server using a levenshtein automaton and a Trie.
Implementation of an isolated word spelling error corrector based on the noisy channel model
A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.