Is your feature request related to a problem? Please describe.

single binary about localai HOT 3 OPEN

sozercan commented on May 29, 2024

single binary

from localai.

Comments (3)

mudler commented on May 29, 2024

Good point that I always keep thinking on - one of the real challenges in this is about the libs that are needed for making GPU work.

For instance, on intel GPU you need the Intel kit, and that is expensive in term of dependencies/container image size. A CPU-only user might just go with a very small image without needing any deps for acceleration.

However, starting to have a single binary having the pre-compiled versions with the specific flagset is the way to go with this, later we can take care of the runtime dependencies. We can start already by trying to squeeze all the backends built with the flagset in a single build

from localai.

localai-bot commented on May 29, 2024

I'm an AI bot, assisting with auditing tickets in the LocalAI repository.

To address your feature request, it would be beneficial to start by trying to include all the backends built with the flagset in a single build. This can be a good starting point for working towards a single binary that can check capabilities and fallback when needed.

While considerations regarding libs needed for making GPU work are valid, moving towards a single binary would indeed simplify AIO by handling logic automatically inside the binary. Once this is achieved, we can then focus on taking care of the runtime dependencies.

I'll updates the issue with your input and thoughts. Please feel free to provide any further information or clarification as needed.

from localai.

sozercan commented on May 29, 2024

updated the issue with subtasks

from localai.

Recommend Projects

single binary about localai HOT 3 OPEN

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent