Coder Social home page Coder Social logo

Comments (4)

nuocheng avatar nuocheng commented on June 2, 2024

CUDA_VISIBLE_DEVICES=1 python3 uniform_finetune.py --model_type chatglm --model_name_or_path huggingface.co/THUDM/chatglm3-6b --data ./data/formatted_cot_data/aqua_train.json ./data/formatted_cot_data/ecqa_train.json ./data/formatted_cot_data/esnli_train.json --lora_target_modules query_key_value --lora_r 32 --lora_alpha 32 --lora_dropout 0.1
运行uniform_finetune.py这个脚本进行进行微调,出现了
image
这个问题。
使用的数据是库中自带的几个jsno数据

from alpaca-cot.

PhoebusSi avatar PhoebusSi commented on June 2, 2024

【1】 其实3是可有可无的一步,在1微调得到lora权重后,2inference时同时加载llm和lora的权重即可完成推理。而3则是将lora合进llm的操作,用lora替代原有llm中的矩阵,得到一个新的llm,可直接用新llm完成推理,不再需要同时再加载llm和lora权重了。因此3并不是一个必要操作,基本上1和2就满足了训练和测试的需求。

【2】用uniform_finetune.py跑llama-13应该是能跑起来的,是不是本地显存过低?

【3】tabular_LLM主要是提供了相关tabular数据,模型训练相关的代码请参考main分支。

from alpaca-cot.

PhoebusSi avatar PhoebusSi commented on June 2, 2024

CUDA_VISIBLE_DEVICES=1 python3 uniform_finetune.py --model_type chatglm --model_name_or_path huggingface.co/THUDM/chatglm3-6b --data ./data/formatted_cot_data/aqua_train.json ./data/formatted_cot_data/ecqa_train.json ./data/formatted_cot_data/esnli_train.json --lora_target_modules query_key_value --lora_r 32 --lora_alpha 32 --lora_dropout 0.1 运行uniform_finetune.py这个脚本进行进行微调,出现了 image 这个问题。 使用的数据是库中自带的几个jsno数据

暂时还不支持chatglm3 可以跑下chatglm2就不会有这个报错了

from alpaca-cot.

nuocheng avatar nuocheng commented on June 2, 2024

CUDA_VISIBLE_DEVICES=1 python3 uniform_finetune.py --model_type chatglm --model_name_or_path huggingface.co/THUDM/chatglm3-6b --data ./data/formatted_cot_data/aqua_train.json ./data/formatted_cot_data/ecqa_train.json ./data/formatted_cot_data/esnli_train.json --lora_target_modules query_key_value --lora_r 32 --lora_alpha 32 --lora_dropout 0.1 运行uniform_finetune.py这个脚本进行进行微调,出现了 image 这个问题。 使用的数据是库中自带的几个jsno数据

暂时还不支持chatglm3 可以跑下chatglm2就不会有这个报错了

好的,我试一下2

from alpaca-cot.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.