Coder Social home page Coder Social logo

tsbert's Introduction

TSBERT:Text Similarity using BERT 基于BERT的文本相似度模型

无监督学习:向量白化、对比学习

bertwhitening:bert输出向量白化
论文:Whitening Sentence Representations for Better Semantics and Faster Retrieval
训练数据:lcqmc随机选取10000语句,抛弃标签。

SimCSE_unsupervised:采用与论文相同的损失函数
论文:SimCSE: Simple Contrastive Learning of Sentence Embeddings
训练数据:lcqmc随机选取10000语句,抛弃标签。

SimCSE_unsupervised_sp:采用与苏剑林相同的损失函数
训练数据:同上

SimCSE_unsupervised_sp_simplified:采用与苏剑林相同的损失函数,从transformers加载bert
训练数据:同上

SimCSE_unsupervised_simplified:采用与论文相同的损失函数,从transformers加载bert
训练数据:同上

ConSERT_unsupervised_shuffle:对posids进行shuffle
论文:ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer
训练数据:同上

ESimCSE_unsupervised_endpoints: 采用与论文相同的损失函数
论文:ESimCSE: Enhanced Sample Building Method for Contrastive Learning of Unsupervised Sentence Embedding
训练数据:同上

监督学习:双塔模型、对比学习

SBERT:SentenceBERT
论文:Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
训练数据:lcqmc训练集

SBERT:SentenceBERT_simplified, 从transformers加载bert
论文:同上
训练数据:同上

SimCSE_supervised:采用与论文相同的损失函数
训练数据:snli随机选取10000条数据,数据格式[sentence,sentence_entailment,sentence_contradiction]

tsbert's People

Contributors

drzqb avatar

Stargazers

 avatar  avatar zxyscz avatar 21. avatar Kewen avatar JustinLee avatar

Watchers

 avatar

Forkers

zxyscz lcuacm

tsbert's Issues

无法找到预训练模型

您好,
我想使用您预训练的SBERT进行句子相似度的计算,但是运行SentenceBERT.py代码时提示无法找到预训练模型,请问可以提供一下相应的预训练模型文件吗?
谢谢

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.