Comments (5)
Closing. But feel free to reopen if needed.
We're considering whether to host a 2023 Wikipedia index instead and to fix up some of the issues in the DPR corpus in it. Will this be helpful to you?
from dspy.
This is actually a common request! A member of the team will merge a version soon. Could you just paste the same request in a new issue and I’ll forward that to him
from dspy.
Thanks for the report! I've looked into this. Basically, the page on "2015 Diamond Head Classic" (and also 2016, 2017, 2018) isn't in the downloaded corpus, possibly because the crawler/parser decided it's too short and removed it. It's the DPR wiki-100 corpus in case you'd like to directly use it.
In my experience whenever such a direct query fails to find the document, 90% of the time it's just not in the index (or, a bit less likely, the passage splitting is unfavorable).
from dspy.
Hey @okhat,
A newer version of Wikipedia would undoubtedly help, but I am currently only trying out a few ideas; I could work with the 2019 corpus.
As in the DSP notebook, I couldn't find how to set up a remote ColBERTv2 server. All I could find on ColBERTv2 README was the Python API. Can you please elaborate more on that?
I am trying to set up a small ColBERTv2 server on a remote GPU-enabled machine and would like to query it from my laptop for experimentation.
Thanks!
from dspy.
Thanks @okhat, I have added the issue here: stanford-futuredata/ColBERT#173
from dspy.
Related Issues (20)
- Chat support in Ollama impacting normal generation, something with `"content": "".join(text)` HOT 1
- Claude does not work with the new experimental setting. HOT 1
- ModuleNotFoundError: No module named 'dspy.datasets'; 'dspy' is not a package
- Question: BootstrapFewShot - Intermediate Label Bootstrapping HOT 1
- Signatures rarely work out of the box HOT 2
- OpenAI + ReAct : TypeError: unhashable type: 'list' HOT 1
- How to get multiple output from the dspy program? (e.g., two answers for a question)
- Deprecation warnings point to non-existing `dspy.set_log_level` HOT 1
- BuilderConfig doesn't have "trust_remote_code" key when loading HotPotQA HOT 1
- 'ReAct' object has no attribute 'best_programs'
- dspy.Retrieve cannot read csv file HOT 1
- dsp.settings to modify templates. Where can I find them? HOT 1
- List index error using MIPROv2 HOT 1
- Miprov2 doesn't work with TypedPredictors HOT 1
- import error: AttributeError: module 'threading' has no attribute '_Condition'. Did you mean: 'Condition'? HOT 1
- semver dependency error when setting up dev container HOT 2
- dspyComponent module only allows for a dspy.Predict module as input HOT 2
- `named_parameters()` and `named_submodules()` for dictionary attributes with non-string keys
- MIPROv2 - Error generating fewshot examples: name 'metric_threshold' is not defined
- Debug in DSPy
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dspy.