What's the easiest way to use ColBERT without loading

can't load full index into memory,about stanford-futuredata/colbert

Comments (7)

okhat commented on June 24, 2024

You just need to use batch-mode retrieval and ranking!

Just keep in mind it's two steps, not one. There are some instructions in the README. Let me know if you face issues using them.

Batch retrieval loads only the compress FAISS index and retrieves the initial (unsorted) set of passages. Batch re-ranking streams over the index one part at a time, so it uses a tiny fraction of memory at any point.

from colbert.

JamesDeAntonis commented on June 24, 2024

Very cool!

By two-step, you're referring to how the second (re-ranking) step in end-to-end isn't implemented yet? As suggested here

from colbert.

okhat commented on June 24, 2024

The second step is implemented. You just need to use a different script colbert.retrieve then colbert.rerank (give it the output topk).

What isn't implemented is two steps from one script, which would be nice to have eventually. But this shouldn't affect your goals above!

from colbert.

JamesDeAntonis commented on June 24, 2024

Yeah, to clarify I meant that we can't fully do end-to-end in one shot, but currently we instead have to call retrieve and then rerank (I think that's what you said)

from colbert.

okhat commented on June 24, 2024

Precisely! Give it a run. It should be really fast and smooth, I hope :D

from colbert.

JamesDeAntonis commented on June 24, 2024

This seems to be working properly!

I am also having some pains due to trying to use huggingface's model. I noticed that in the paper it is said that the used output dimension is 128, and that is the default in this repo, but the HF pretrained model uses 768. I plan to use 128 because I don't have space for 768, so I'll probably nix huggingface entirely, outside of how it's used in this repo.

Do you have the dim 128 model saved anywhere, as used in the paper?

from colbert.

okhat commented on June 24, 2024

Not sure if we corresponded about this by email, but as I mentioned to some other folks, I'm happy to share a checkpoint with you if you reach out by email!

from colbert.

Recommend Projects

can't load full index into memory about colbert HOT 7 CLOSED

Comments (7)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent