I created a container using the docker image. I use this to launch s

Hey <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url=

Error when specifying adapter_id with invalid base_model_name_or_path about lorax HOT 3 CLOSED

sleepwalker2017 commented on June 12, 2024

Error when specifying adapter_id with invalid base_model_name_or_path

from lorax.

Comments (3)

thincal commented on June 12, 2024

input_dict = {
"inputs": test_data[idx],
"parameters": {
"adapter_id": "merror/llama_13b_lora_beauty",
"max_new_tokens": 256,
"top_p": 0.7
}
}

if the adapter_id is from local filesystem, "adapter_source": "local" is required also.

from lorax.

sleepwalker2017 commented on June 12, 2024

input_dict = {
"inputs": test_data[idx],
"parameters": {
"adapter_id": "merror/llama_13b_lora_beauty",
"max_new_tokens": 256,
"top_p": 0.7
}
}

if the adapter_id is from local filesystem, "adapter_source": "local" is required also.

hi, the adapter is from hub.
I wonder why it looks for this repo: llama-13b-hf.

from lorax.

tgaddair commented on June 12, 2024

Hey @sleepwalker2017, I think I see the issue here. Looking at the adapter_confg.json for this adapter, it states the base_mdel_name_or_path as decapoda-research/llama-13b-hf. Because your base model is /data/vicuna-13b/vicuna-13b-v1.5 (different from decapoda-research/llama-13b-hf), LoRAX will attempt to do an additional check here to see if the architectures of the adapter and base model are compatible.

In this case, the architecture compatibility check fails because decapoda-research/llama-13b-hf no longer exists (or was made private).

I think the architecture check should not cause a hard failure if it can't be performed. I'll put together a PR to make this check non-fatal in this case.

from lorax.

Error when specifying adapter_id with invalid base_model_name_or_path about lorax HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent