Comments (1)
环境:
Jinja2 - 2.11.3
MarkupSafe - 2.1.5
Werkzeug - 3.0.1
最后报错:
{'loss': 0.0, 'learning_rate': 7.692307692307693e-09, 'epoch': 426.23}
{'train_runtime': 39478.1948, 'train_samples_per_second': 7.903, 'train_steps_per_second': 1.317, 'train_loss': 0.0005290311501030626, 'epoch': 426.23}
Traceback (most recent call last):
File "finetune.py", line 132, in <module>
main()
File "finetune.py", line 128, in main
model.save_pretrained(training_args.output_dir)
File "/opt/conda/lib/python3.8/site-packages/peft/peft_model.py", line 210, in save_pretrained
self.create_or_update_model_card(save_directory)
File "/opt/conda/lib/python3.8/site-packages/peft/peft_model.py", line 795, in create_or_update_model_card
card = ModelCard.load(filename) if os.path.exists(filename) else ModelCard.from_template(ModelCardData())
File "/opt/conda/lib/python3.8/site-packages/huggingface_hub/repocard.py", line 405, in from_template
return super().from_template(card_data, template_path, **template_kwargs)
File "/opt/conda/lib/python3.8/site-packages/huggingface_hub/repocard.py", line 314, in from_template
import jinja2
File "/opt/conda/lib/python3.8/site-packages/jinja2/__init__.py", line 12, in <module>
from .environment import Environment
File "/opt/conda/lib/python3.8/site-packages/jinja2/environment.py", line 25, in <module>
from .defaults import BLOCK_END_STRING
File "/opt/conda/lib/python3.8/site-packages/jinja2/defaults.py", line 3, in <module>
from .filters import FILTERS as DEFAULT_FILTERS # noqa: F401
File "/opt/conda/lib/python3.8/site-packages/jinja2/filters.py", line 13, in <module>
from markupsafe import soft_unicode
ImportError: cannot import name 'soft_unicode' from 'markupsafe' (/opt/conda/lib/python3.8/site-packages/markupsafe/__init__.py)
这是包冲突的问题吗
from chatglm-tuning.
Related Issues (20)
- 请问大佬是否有计划可以支持下qlora? HOT 1
- 修改max_seq_length好像并没有生效? HOT 1
- 如何支持多卡跑
- 请教一个问题,data_collator中不需要实现attention mask么? HOT 2
- ChatGLM LoRA微调之后,量化quantize=8显存、推理耗时都反向增加 HOT 1
- finetune数据使用data_collator时报错 KeyError:seq_len HOT 2
- 微调语料格式转换出现乱码 HOT 1
- 请问如何读取checkpoint继续训练? HOT 1
- AttributeError: 'ChatGLMModel' object has no attribute 'lm_head' HOT 3
- 请问下如果想让模型学到某个领域的数据集,大概需要多大的数据量呢?
- 这个项目停更了吗
- 问题请教
- 问题请教:将prompt token设置为-100即可不计算loss
- [数据预处理-tokenization时报错] datasets.builder.DatasetGenerationError
- 请问这个项目支持chatglm3吗
- 请问在训练过程中输出的日志中loss、learning_rate和epoch分别代表什么含义
- 在colab上运行finetune.ipynb的时候会报一个huggingface登录的错误,有人遇到同样的错误吗? HOT 1
- 关于保存的adapter_model.bin无实际推理效果的问题 HOT 2
- 基于3af1bfd提交在3090上跑起来的requirements.txt
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from chatglm-tuning.