The official repository for "Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation", Shengyao Zhuang, Houxing Ren, Linjun Shou, Jian Pei, Ming Gong, Guido Zuccon and Daxin Jiang.

dsi-transformers

A huggingface transformers implementation of "Transformer Memory as a Differentiable Search Index"

ir-superproject-2023

markdown_readme

Markdown - you can mark up titles, lists, tables, etc., in a much cleaner, readable and accurate way if you do it with HTML.

msmarco-document-ranking-submissions

Submission archive for the MS MARCO document ranking leaderboard

msmarco-passage-ranking

MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, and passage ranking. A variant of this task will be the part of TREC and AFIRM 2019. For Updates about TREC 2019 please follow This Repository Passage Reranking task Task Given a query q and a the 1000 most relevant passages P = p1, p2, p3,... p1000, as retrieved by BM25 a succeful system is expected to rerank the most relevant passage as high as possible. For this task not all 1000 relevant items have a human labeled relevant passage. Evaluation will be done using MRR

natural-questions

Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question answering systems.

oltr

An onlinel learning to rank python codebase.

pygaggle

a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini

pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

pyterrier

A Python framework for performing information retrieval experiments, building on http://terrier.org/

pytorch-lightning

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

relevation

Information Retrieval Relevance Judging System

reranker

Build Text Rerankers with Deep Language Models

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

tevatron

Tevatron - A flexible toolkit for dense retrieval research and development.

transformers

🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

trl

Train transformer language models with reinforcement learning.

tydiqa

TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and without the use of translation, and is designed for the training and evaluation of automatic question answering systems. This repository provides evaluation code and a baseline system for the dataset.

typos-aware-bert

vec2text

utilities for decoding deep representations (like sentence embeddings) back to text

arvinzhuang Goto Github PK

Shengyao Zhuang's Projects

Recommend Projects

Recommend Topics

Recommend Org