arvinzhuang Goto Github PK
Name: Shengyao Zhuang
Type: User
Company: The University of Queensland
Bio: Interested in IR and NLP.
Twitter: ShengyaoZhuang
Location: Brisbane
Name: Shengyao Zhuang
Type: User
Company: The University of Queensland
Bio: Interested in IR and NLP.
Twitter: ShengyaoZhuang
Location: Brisbane
Experiment code for Adaptive Exploration in Online Learning to Rank
Anserini is a Lucene toolkit for reproducible information retrieval research
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
A python module to scrape arxiv.org for specific date range and categories
Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"
NAACL2021 - COIL Contextualized Lexical Retriever
Deep Learning Hard (DL-HARD) is a new annotated dataset extending TREC Deep Learning benchmark.
The official repository for "Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation", Shengyao Zhuang, Houxing Ren, Linjun Shou, Jian Pei, Ming Gong, Guido Zuccon and Daxin Jiang.
A huggingface transformers implementation of "Transformer Memory as a Differentiable Search Index"
Markdown - you can mark up titles, lists, tables, etc., in a much cleaner, readable and accurate way if you do it with HTML.
Submission archive for the MS MARCO document ranking leaderboard
MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, and passage ranking. A variant of this task will be the part of TREC and AFIRM 2019. For Updates about TREC 2019 please follow This Repository Passage Reranking task Task Given a query q and a the 1000 most relevant passages P = p1, p2, p3,... p1000, as retrieved by BM25 a succeful system is expected to rerank the most relevant passage as high as possible. For this task not all 1000 relevant items have a human labeled relevant passage. Evaluation will be done using MRR
Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question answering systems.
An onlinel learning to rank python codebase.
a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
A Python framework for performing information retrieval experiments, building on http://terrier.org/
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
Information Retrieval Relevance Judging System
Build Text Rerankers with Deep Language Models
Multilingual Sentence & Image Embeddings with BERT
Code and documentation to train Stanford's Alpaca models, and generate the data.
Tevatron - A flexible toolkit for dense retrieval research and development.
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Train transformer language models with reinforcement learning.
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and without the use of translation, and is designed for the training and evaluation of automatic question answering systems. This repository provides evaluation code and a baseline system for the dataset.
utilities for decoding deep representations (like sentence embeddings) back to text
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.