Comments (9)
I've opened a PR that should address this issue using the method outlined in my previous comment. I've run this locally with no issues since it circumvents the need for compiling entirely.
from serge.
I have the same problem.
from serge.
This issue seems to arise while compiling https://github.com/ggerganov/llama.cpp from source. It seems like the simple solution here would be to simply use the prebuilt Docker image from that repo as a build step as opposed to compiling from source.
from serge.
I've noticed that the errors refer to /usr/local/lib/gcc of which doesn't exist for me, but gcc and build-essentials are installed.
from serge.
I've opened a PR that should address this issue using the method outlined in my previous comment. I've run this locally with no issues since it circumvents the need for compiling entirely.
I'm sorry for not understanding but what is done with the repository that you located in your other comment
from serge.
Seems related to ggerganov/llama.cpp#196 and by extension ggerganov/llama.cpp#535.
I tweaked the VMware EVC CPU Mode to "Haswell" in my environment and was able to complete the compilation.
from serge.
Hey everyone! If that's still an issue, anyone willing to try out the changes in the following PR: #109
This should hopefully fix things, by switching to llama-rs. Just do:
git checkout feature/llama-rs-caching
docker compose up -d --build
from serge.
I deployed this PR as a test and I'm still running into it on my Dell PowerEdge R620, but I believe it doesn't support the AVX instruction set so this may be moot as far as it is concerned. I can test deploy on my known working setup of Ubuntu VM on MacBook if you need.
from serge.
@willjasen did you figured out the issue?
from serge.
Related Issues (20)
- 🚀 [Feature]: Add OpenVino / OpenVino Model Server HOT 1
- 🐛 [Bug]: Web interface does not render properly on mobile devices HOT 1
- 🚀 [Feature]: Add LINCE-Mistal model HOT 1
- No models running at all HOT 1
- 🚀 [Feature]: add vigogne and Marx-3B models HOT 4
- 🐛 [Bug]: Seems like support for CPU-only is gone with the new version of serge? HOT 5
- 🐛 [Bug]: Chat responses show up in all chats during response time HOT 2
- 🚀 [Feature]: Ability to rename chats HOT 1
- 🚀 [Feature]: Ability to stop response in progress
- 🤗 [Question]: Is there an easy way to pass a prompt in a URL and get the answer returned? HOT 1
- 🤗 [Question]: jetson agx orin HOT 1
- 🐛 [Bug]: serge crashes with pydantic.error_wrappers.ValidationError: 1 validation error for Chat | params -> n_gpu_layers | field required (type=value_error.missing) HOT 3
- 🚀 [Feature]: Ability to pin chats
- 🚀 [Feature]: Mistral AI
- 🐛 [Bug]: So insanely slow, even on high CPU and RAM settings. HOT 2
- 🐛 [Bug]: CodeLlama-7B instruct download stuck HOT 7
- 🤗 [Question]: I cannot use a custom model HOT 2
- 🚀 [Feature]: User Management for Privacy HOT 1
- 🐛 [Bug]: DLLAMA_BLAS_VENDOR=OpenBLAS build with pip is not enabling OpenBlas HOT 3
- how to use mixtral-8x7b-v0.1🤗 [Question]: HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from serge.