Comments (6)
@apstrom yes the site is being worked on by Mudler and is unable to be updated at this time, for now our how tos are being updated on my site. Thank you <3
Please check out our updated how tos - https://io.midori-ai.xyz/howtos/
Installing a embedding model - https://io.midori-ai.xyz/howtos/easy-setup-embeddings/
Installing a LLM, SD, Embedding, TTS, and other starting models - https://io.midori-ai.xyz/howtos/easy-model-installer/
from localai.
@lunamidori5
I am happy to hear about the updated documentation. That's exciting news.
I've got models installed; the issue that I am having is with the embedding model, and any custom embedding models that I would like to use. Only one of these issues is related to LocalAI -- the other issue is that many Huggingface pytorch embedding models do not seem to work natively with LocalAI. This issue is a conversion problem that I am trying to solve. I would like to run law-specific embedding models for my applications.
The issue that I detailed above with logs is that the bert-embedding model, which is installed and which does run on LocalAI, crashes. The error looks like a gRPC error. A similar error was reported in #1198.
from localai.
@apstrom did you reinstall using the model installer or the updated documentation on how to install it? The previous documentation was completely wrong. And I didn't catch it in time I do apologize for that that is my fault
from localai.
@lunamidori5
The reinstalled model responds to basic (short) texts; it does not respond to long texts. I get the above error again.
Based on the behaviour, I am guessing that I need to set a context size for the model to handle longer texts.
I am looking at your new documentation. The presentation is clearer. I will provide you with some feedback on the other thread.
from localai.
@apstrom feedback is always welcome to. And sometimes I'm not sure where the issues lay. Until they are reported. I did notice that somebody mentioned API keys. I think I'm going to make a how to just like explaining each variable and stuff. I'm not sure. Is that something you're interested in?
from localai.
@apstrom did you tried with one of the examples in the quickstart? this works for me quite well:
docker run -ti -p 8080:8080 localai/localai:v2.5.1-ffmpeg all-minilm-l6-v2
You can also set that with an environment variable, for example:
MODELS=all-minilm-l6-v2
from localai.
Related Issues (20)
- AMD support is completly broken - no load is placed on GPU HOT 4
- Broken dependency to transformers, fails to build
- AudioLDM2Pipeline
- Can compile run and load the models but I have some issue with grpc with actually running them.
- Problem with TTS in 2.8 HOT 17
- Example: telegram-bot HOT 1
- vLLM backend broken in v2.8.0 HOT 2
- Add transparent conversion for "tools" to "functions" in v1/chat/completions endpoint HOT 5
- tools: reply with text when no tool is selected in SSE mode HOT 1
- support functionary models HOT 1
- Replace model in Build tutorial for mac to phi-2 HOT 1
- ERR Failed starting/connecting to the gRPC service HOT 6
- feat. add OpenVINO Model Server as a Backend
- Completion endpoint returns same response repeatedly HOT 4
- latest-cublas-cuda12 tag does not exist on DockerHub or Quay
- Cannot run coqui tts - Error: grpc process not found (image and local docker build) HOT 3
- CUDA Memory - GRPCs do not get reused or alternatively removed HOT 1
- 404 Error when trying voice cloning with Vall-E-X HOT 1
- Whisper V3 model can support long Audio input, can you add an API to support large Audio file as a whole? HOT 1
- Functions: function_call has null arguments HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from localai.