LocalAI version: v2.14.0 <strong

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

related to <a class="issue-link js-issue-link" data-error-text="Failed to load title"

ERROR: stderr update_slots : failed to find free space in the KV cache about localai HOT 4 OPEN

netandreus commented on June 18, 2024

ERROR: stderr update_slots : failed to find free space in the KV cache

from localai.

Comments (4)

mudler commented on June 18, 2024

@netandreus this seems to likely happen when the prompt exhausts the context size - can you check if that's causing issues in your case by bumping the context size?

In any case sounds reasonable to bail out early instead of trying to free space in the KV cache. this seem also related to #2258 - can you also try by setting batch to 1 in the model configuration and see if keeps happening?

parameters:
  batch: 1

from localai.

DavidGOrtega commented on June 18, 2024

related to #2258

from localai.

netandreus commented on June 18, 2024

Thank you for assistance, I will check.

from localai.

imihic commented on June 18, 2024

It seems that this error also happens if we enable parallel llama.cpp processing. For an example, setting the context size to 8192 and the number of parallel processes to 20, the token stream generation always stops at around 410 characters, which is roughly equal to 8192 divided by 20.

So, instead of each process allocating 8192 context window size, as specified in the .env/yaml file, the backend takes this value and splits it between all the processes.

Is this a bug or expected behaviour? If it's expected it might not be a bad idea to clarify this behaviour in the documentation.

from localai.

ERROR: stderr update_slots : failed to find free space in the KV cache about localai HOT 4 OPEN

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent