Coder Social home page Coder Social logo

Comments (5)

okhat avatar okhat commented on July 22, 2024 1

Closing. But feel free to reopen if needed.

We're considering whether to host a 2023 Wikipedia index instead and to fix up some of the issues in the DPR corpus in it. Will this be helpful to you?

from dspy.

okhat avatar okhat commented on July 22, 2024 1

This is actually a common request! A member of the team will merge a version soon. Could you just paste the same request in a new issue and I’ll forward that to him

from dspy.

okhat avatar okhat commented on July 22, 2024

Thanks for the report! I've looked into this. Basically, the page on "2015 Diamond Head Classic" (and also 2016, 2017, 2018) isn't in the downloaded corpus, possibly because the crawler/parser decided it's too short and removed it. It's the DPR wiki-100 corpus in case you'd like to directly use it.

In my experience whenever such a direct query fails to find the document, 90% of the time it's just not in the index (or, a bit less likely, the passage splitting is unfavorable).

from dspy.

abhinavkulkarni avatar abhinavkulkarni commented on July 22, 2024

Hey @okhat,

A newer version of Wikipedia would undoubtedly help, but I am currently only trying out a few ideas; I could work with the 2019 corpus.

As in the DSP notebook, I couldn't find how to set up a remote ColBERTv2 server. All I could find on ColBERTv2 README was the Python API. Can you please elaborate more on that?

I am trying to set up a small ColBERTv2 server on a remote GPU-enabled machine and would like to query it from my laptop for experimentation.

Thanks!

from dspy.

abhinavkulkarni avatar abhinavkulkarni commented on July 22, 2024

Thanks @okhat, I have added the issue here: stanford-futuredata/ColBERT#173

from dspy.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.