As your paper has noted, you used L2 similarity during end to end retrieval, but in yo

train colbert with L2 similarity, but rerank with cosine similarity? about colbert HOT 4 CLOSED

stanford-futuredata commented on June 24, 2024

train colbert with L2 similarity, but rerank with cosine similarity?

from colbert.

Comments (4)

okhat commented on June 24, 2024 1

Good question! Short answer is that both of our similarity options are identical for ranking. They only differ with a linear transformation during training, but that has no impact on the order/ranking of the passages during retrieval.

Cosine is faster, so we use it consistently in retrieval.

from colbert.

okhat commented on June 24, 2024 1

This is a valid choice.

You can train with cosine or L2, both are good although not identical. I think the results are very close.

FAISS uses L2 internally and ranking uses cosine, yes, but that's okay because the vectors are normalized.

from colbert.

wuyaoxuehun commented on June 24, 2024

Good question! Short answer is that both of our similarity options are identical for ranking. They only differ with a linear transformation during training, but that has no impact on the order/ranking of the passages during retrieval.

Cosine is faster, so we use it consistently in retrieval.

Thanks! So train and rerank both use cosine similarity but index with faiss use L2, is this right?

from colbert.

okhat commented on June 24, 2024

Closing as this seems resolved. But feel free to re-open if needed.

from colbert.

train colbert with L2 similarity, but rerank with cosine similarity? about colbert HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent