首先感谢开源 Qwen-7B 模型，我基于该模型实现了 QLoRA 多轮对话微调，项目地址：<a href="https://github.com/hiyouga/LLaM

首先感谢开源 Qwen-7B 模型，我基于该模型实现了 QLoRA 多轮对话微调，项目地址：<a href="https://github.com

首先感谢开源 Qwen-7B 模型，我基于该模型实现了 QLoRA 多轮对话微调，项目地址：<a href="https

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

<a class="user-mention notranslate" data-hovercard-type="user" data-hover

基于Qwen-7B实现了QLoRA多轮对话微调，完善API和Web demo功能 about qwen HOT 7 CLOSED

qwenlm commented on July 3, 2024 34

基于Qwen-7B实现了QLoRA多轮对话微调，完善API和Web demo功能

from qwen.

Comments (7)

ArtificialZeng commented on July 3, 2024

首先感谢开源 Qwen-7B 模型，我基于该模型实现了 QLoRA 多轮对话微调，项目地址：https://github.com/hiyouga/LLaMA-Efficient-Tuning

QLoRA 指令微调：
CUDA_VISIBLE_DEVICES=0 python src/train_bash.py \
    --stage sft \
    --model_name_or_path Qwen/Qwen-7B-Chat \
    --do_train \
    --dataset sharegpt_zh \
    --template chatml \
    --finetuning_type lora \
    --lora_target c_attn \
    --output_dir qwen_lora \
    --per_device_train_batch_size 4 \
    --gradient_accumulation_steps 4 \
    --lr_scheduler_type cosine \
    --logging_steps 10 \
    --save_steps 100 \
    --learning_rate 3e-5 \
    --num_train_epochs 1.0 \
    --quantization_bit 4 \
    --fp16
Web Demo：
python src/web_demo.py \
    --model_name_or_path Qwen/Qwen-7B-Chat \
    --template chatml
API 部署（基于 OpenAI 格式）：
python src/api_demo.py \
    --model_name_or_path Qwen/Qwen-7B-Chat \
    --template chatml
~~另外，希望开发者可以修复一下 tokenizer 的 decode 方法，使其支持 skip_special_tokens 参数，便于后续开发，目前该参数没有实际生效。~~ （最新版已修复）

~~源码对应位置：huggingface.co/Qwen/Qwen-7B-Chat/blob/5e7f6a3f41724e7cb8ea3e3be7a1faf2bd5d6a38/tokenization_qwen.py#L228~~
def _decode(
    self,
    token_ids: Union[int, List[int]],
    skip_special_tokens: bool = False,
    clean_up_tokenization_spaces: bool = None,
    **kwargs,
) -> str:
    if isinstance(token_ids, int):
        token_ids = [token_ids]
    return self.tokenizer.decode(token_ids)

ValueError: Encountered text corresponding to disallowed special token '<|im_start|>'.
If you want this text to be encoded as a special token, pass it to allowed_special, e.g. allowed_special={'<|im_start|>', ...}.
If you want this text to be encoded as normal text, disable the check for this token by passing disallowed_special=(enc.special_tokens_set - {'<|im_start|>'}).
To disable this check for all special tokens, pass disallowed_special=().

from qwen.

stuarthe commented on July 3, 2024

首先感谢开源 Qwen-7B 模型，我基于该模型实现了 QLoRA 多轮对话微调，项目地址：https://github.com/hiyouga/LLaMA-Efficient-Tuning
QLoRA 指令微调：
CUDA_VISIBLE_DEVICES=0 python src/train_bash.py \
    --stage sft \
    --model_name_or_path Qwen/Qwen-7B-Chat \
    --do_train \
    --dataset sharegpt_zh \
    --template chatml \
    --finetuning_type lora \
    --lora_target c_attn \
    --output_dir qwen_lora \
    --per_device_train_batch_size 4 \
    --gradient_accumulation_steps 4 \
    --lr_scheduler_type cosine \
    --logging_steps 10 \
    --save_steps 100 \
    --learning_rate 3e-5 \
    --num_train_epochs 1.0 \
    --quantization_bit 4 \
    --fp16
Web Demo：
python src/web_demo.py \
    --model_name_or_path Qwen/Qwen-7B-Chat \
    --template chatml
API 部署（基于 OpenAI 格式）：
python src/api_demo.py \
    --model_name_or_path Qwen/Qwen-7B-Chat \
    --template chatml
~~另外，希望开发者可以修复一下 tokenizer 的 decode 方法，使其支持 skip_special_tokens 参数，便于后续开发，目前该参数没有实际生效。~~ （最新版已修复）
~~源码对应位置：huggingface.co/Qwen/Qwen-7B-Chat/blob/5e7f6a3f41724e7cb8ea3e3be7a1faf2bd5d6a38/tokenization_qwen.py#L228~~
def _decode(
    self,
    token_ids: Union[int, List[int]],
    skip_special_tokens: bool = False,
    clean_up_tokenization_spaces: bool = None,
    **kwargs,
) -> str:
    if isinstance(token_ids, int):
        token_ids = [token_ids]
    return self.tokenizer.decode(token_ids)
ValueError: Encountered text corresponding to disallowed special token '<|im_start|>'. If you want this text to be encoded as a special token, pass it to allowed_special, e.g. allowed_special={'<|im_start|>', ...}. If you want this text to be encoded as normal text, disable the check for this token by passing disallowed_special=(enc.special_tokens_set - {'<|im_start|>'}). To disable this check for all special tokens, pass disallowed_special=().

我能成功运行，加载了Qwen7B .但是如果是openAI格式的API，客户端的api key填什么呢？

from qwen.

hiyouga commented on July 3, 2024

@stuarthe 留空

from qwen.

stuarthe commented on July 3, 2024

@stuarthe 留空

嗯，已成功连接。谢谢！

from qwen.

ArtificialZeng commented on July 3, 2024

@stuarthe 留空

求通过 llama efficient tuning 的PR，解决了 bos token的问题

from qwen.

yechong316 commented on July 3, 2024

mark

from qwen.

sunzhaowei commented on July 3, 2024

Lora微调后的Qwen模型根本不能直接调用chat接口！报错 generation_config缺少chat_ml字段

from qwen.

基于Qwen-7B实现了QLoRA多轮对话微调，完善API和Web demo功能 about qwen HOT 7 CLOSED

Comments (7)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent