Comments (1)
Here is the answer from @pppppM:
-
LLaVA finetuned by xtuner cannot be loaded like Qwen-VL. Developers of Qwen-VL also put a model file (modeling_qwen.py) in their huggingface repo so they can load their model in this way. However, it also means Qwen-VL only allows a fixed model architecture. By contrast, LLaVA finetuned by xtuner allows different model architectures, like CLIP+Vicuna, CLIP+internlm, CLIP+internlm2, Dinov2+internlm, etc.
-
LLaVA finetuned by xtuner can be deployed in another way (pull #317, still under development). You can deploy the LLaVA model with huggingface llava chatbot (based on Huggingface transformers) or lmdeplot llava chatbot (based on LMDeploy Turbomind). The two chatbots share the same interface.
from xtuner.
Related Issues (20)
- LLaVA MME指标 HOT 1
- xtuner qlora微调internlm2-chat-7b报错RuntimeError: FlashAttention only support fp16 and bf16 data type
- Support finetuning LLaVA 1.6 HOT 1
- 官方示例微调出现KeyError: 'need_eos_token' HOT 3
- About custom dataset HOT 11
- deepspeed超时问题 HOT 6
- 1.8b模型上微调报错 KeyError: 'text' HOT 2
- [Feature] 如果增量训练数据上亿条,如何加快处理速度,有没有离线处理的方式? HOT 5
- 请教LLaVA混合纯文本训练问题 HOT 1
- 使用 internlm2-chat-1.8b模型微调后,大概率出现了 循环输出的情况。 HOT 12
- 配置环境安装好后,运行预训练脚本时报错 HOT 6
- Any method to finetune embedding layers using Xtuner? HOT 1
- 关于gemma的template问题 HOT 1
- 关于自定义图文数据微调 HOT 8
- RuntimeError: Rank 2 successfully reached monitoredBarrier, but received errors while waiting for send/recv from rank 0. Please check rank 0 logs for faulty rank. HOT 2
- Is there any plan to support MAC?
- KeyError: 'Column length not in the dataset. Current columns in the dataset: []' HOT 2
- internlm2_20b_qlora_msagent_react_e3_gpu8脚本运行时报错 HOT 22
- 请问如何对deepspeed中的相关参数进行配置,比如master_port? HOT 2
- xtuner check-custom-dataset /home/internlm2.py不通过怎么办? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from xtuner.