Comments (3)
Looks like VLLM changed a parameter name. Here's my fix to get this project working with VLLM 0.4.1
--- a/llama3-8b-instruct/bentovllm_openai/utils.py
+++ b/llama3-8b-instruct/bentovllm_openai/utils.py
@@ -51,7 +51,7 @@ def openai_endpoints(
chat_template=None,
):
super(OpenAIServingChat, self).__init__(
- engine=engine, served_model=served_model,
+ engine=engine, served_model_names=served_model,
lora_modules=None,
)
self.response_role = response_role
@@ -75,7 +75,7 @@ def openai_endpoints(
return super()._load_chat_template(chat_template)
self.openai_serving_completion = OpenAIServingCompletion(
- engine=self.engine, served_model=served_model,
+ engine=self.engine, served_model_names=served_model,
)
self.chat_template = chat_template
diff --git a/llama3-8b-instruct/requirements.txt b/llama3-8b-instruct/requirements.txt
index fce4600..194a74e 100644
--- a/llama3-8b-instruct/requirements.txt
+++ b/llama3-8b-instruct/requirements.txt
@@ -1,6 +1,6 @@
accelerate==0.29.3
-bentoml>=1.2.11
+bentoml>=1.2.12
packaging==24.0
-torch==2.1.2
-transformers==4.39.3
-vllm==0.4.0.post1
+torch==2.2.1
+transformers==4.40.0
+vllm==0.4.1
diff --git a/llama3-8b-instruct/service.py b/llama3-8b-instruct/service.py
from bentovllm.
I'm not a git expert, but attempted to create a PR for you
#39
from bentovllm.
@ProVega thanks for the finding! We need to update the same file in multiple folders. I will create another pr to fix this later.
from bentovllm.
Related Issues (7)
- BentoVLLM Service Fails to Start on Linux Server Due to Pydantic Related Errors HOT 2
- No Swagger UI on localhost:3000 HOT 6
- VLLM is stuck on Outlines 0.0.34 and this sample requires 0.0.37 HOT 1
- Using Bentoml v 1.2.10, cant connect to host network and fails to download the debian HOT 1
- Does BentoVLLM support multi LoRA adapters functionality of vLLM? HOT 1
- CUDA out of memory
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bentovllm.