Coder Social home page Coder Social logo

Comments (7)

okhat avatar okhat commented on June 24, 2024

You just need to use batch-mode retrieval and ranking!

Just keep in mind it's two steps, not one. There are some instructions in the README. Let me know if you face issues using them.

Batch retrieval loads only the compress FAISS index and retrieves the initial (unsorted) set of passages. Batch re-ranking streams over the index one part at a time, so it uses a tiny fraction of memory at any point.

from colbert.

JamesDeAntonis avatar JamesDeAntonis commented on June 24, 2024

Very cool!

By two-step, you're referring to how the second (re-ranking) step in end-to-end isn't implemented yet? As suggested here

from colbert.

okhat avatar okhat commented on June 24, 2024

The second step is implemented. You just need to use a different script colbert.retrieve then colbert.rerank (give it the output topk).

What isn't implemented is two steps from one script, which would be nice to have eventually. But this shouldn't affect your goals above!

from colbert.

JamesDeAntonis avatar JamesDeAntonis commented on June 24, 2024

Yeah, to clarify I meant that we can't fully do end-to-end in one shot, but currently we instead have to call retrieve and then rerank (I think that's what you said)

from colbert.

okhat avatar okhat commented on June 24, 2024

Precisely! Give it a run. It should be really fast and smooth, I hope :D

from colbert.

JamesDeAntonis avatar JamesDeAntonis commented on June 24, 2024

This seems to be working properly!

I am also having some pains due to trying to use huggingface's model. I noticed that in the paper it is said that the used output dimension is 128, and that is the default in this repo, but the HF pretrained model uses 768. I plan to use 128 because I don't have space for 768, so I'll probably nix huggingface entirely, outside of how it's used in this repo.

Do you have the dim 128 model saved anywhere, as used in the paper?

from colbert.

okhat avatar okhat commented on June 24, 2024

Not sure if we corresponded about this by email, but as I mentioned to some other folks, I'm happy to share a checkpoint with you if you reach out by email!

from colbert.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.