Comments (9)
You’re right about this. We are pushing a number of fixes for HF server and clients soon. We’re just starting to play with them — expect many rough edges for a couple of weeks, but then should become something stable
from dspy.
Update: I just figured out the reason it is not working was somehow related to my usage of ngrok, if I have the the server and client running on the same machine it works fine. I will try to get it running through ngrok as well now.
from dspy.
Thanks for looking into this! This HF side of DSP is not very mature right now, so let us know if you face any issues. The FLAN models may not be ideal here based on our initial testing
from dspy.
Why are the FLAN models not ideal? Just because they lack the "emergent abilities" to really make use of DSP programs or some technical reason? I wasn't planning on using FLAN for my experiments but Llama and Open Assistant type models. So I am wondering if there is anything i should watch out for.
from dspy.
Yeah, you may see better results with decoder-only models, like the ones you mentioned, especially if you use closed LLMs as the demonstrator. We’ll share more soon
from dspy.
Very nice, I am excited to see you further progress.
(I hope "compiling" LoRa adapters on consumer GPUs is somewhere on your roadmap, since that would be a really exciting feature for me)
In my fork of DSP I have now adapted the hf_client.py a little bit so that you can pass an ngrok url to the object and it will just work with the hf_server.py.
This is very useful for me since I am serving the models in a HPC cluster an then ngrok them out to the public internet so I can access my models from any device so I think this may be useful for others as well :)
from dspy.
Yup we already have these LoRAs — @paridhimaheshwari2708 maybe we should merge something soon?
from dspy.
I noticed that when using hf_client that my kwargs are not sent to the hf_server. This is a problem when trying the self-consistency use cases. Thus in hf_client in the generate function I changed:
From:
payload = {"prompt": prompt, **kwargs}
To:
payload = {"prompt": prompt, "kwargs": kwargs}
This seemed to fix the problem.
Can you tell me the reason why the ** operator was there initially?
from dspy.
By the way things are much more stable now with TGI client in DSPy. Feel free to let us know if you face problems in the new version of the framework.
from dspy.
Related Issues (20)
- AttributeError: 'dict' object has no attribute '_store' HOT 1
- Can't able to import the dspy package HOT 1
- Error when running Evaluate: 'str' object has no attribute '_store' HOT 3
- Upgrade pydantic to ^2.6 HOT 2
- Support python 3.12 HOT 2
- Adding `self.history = []` in the custom LM doc example HOT 1
- Pickle error when trying to compile module that contains ChromadbRM HOT 4
- max_tokens error
- format_handler error when setting non-string as type for OutputField HOT 4
- How to remove and disable cache? HOT 1
- Azure OpenAI (1.12.0) `Completions.create() got an unexpected keyword argument 'api_base'` Error HOT 2
- Conflicting openai dependencies
- Docs Discrepancy / Clarification Needed HOT 3
- Questionable Ruff Errors - Define Code Style? HOT 1
- How to change the generation length
- BootstrapFewShot failing due to lm.copy for AzureOpenAI
- Deeplake retriever import issue for openai.error
- colab note: An error with suggestion was encountered during execution HOT 2
- Tag releases
- multihot_finetune colab notebook bug when `4) Compile the program with Llama2-13b-chat`
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dspy.