Comments (9)
RuntimeError: Error(s) in loading state_dict for NormalNerModel: size mismatch for linear.weight: copying a param with shape torch.Size([33, 256]) from checkpoint, the shape in current model is torch.Size([25, 256]). size mismatch for linear.bias: copying a param with shape torch.Size([33]) from checkpoint, the shape in current model is torch.Size([25]). size mismatch for crf.start_transitions: copying a param with shape torch.Size([33]) from checkpoint, the shape in current model is torch.Size([25]). size mismatch for crf.end_transitions: copying a param with shape torch.Size([33]) from checkpoint, the shape in current model is torch.Size([25]). size mismatch for crf.transitions: copying a param with shape torch.Size([33, 33]) from checkpoint, the shape in current model is torch.Size([25, 25]).
实体的标签数目不对。
from pytorch_bert_bilstm_crf_ner.
from pytorch_bert_bilstm_crf_ner.
请问要怎么解决呀!我是小白一枚,实在不知道改哪里! 发自我的iPhone
…
------------------ 原始邮件 ------------------ 发件人: 西西嘛呦 @.> 发送时间: 2022年11月17日 19:30 收件人: taishan1994/pytorch_bert_bilstm_crf_ner @.> 抄送: shitft @.>, Author @.> 主题: 回复:[taishan1994/pytorch_bert_bilstm_crf_ner] 作者大大您好!我在训练我自己的数据时,出现了以下的问题,我尝试着用pop函数来解决,但始一直解决不了,麻烦你帮我看看问题所在!! (Issue #26) RuntimeError: Error(s) in loading state_dict for NormalNerModel: size mismatch for linear.weight: copying a param with shape torch.Size([33, 256]) from checkpoint, the shape in current model is torch.Size([25, 256]). size mismatch for linear.bias: copying a param with shape torch.Size([33]) from checkpoint, the shape in current model is torch.Size([25]). size mismatch for crf.start_transitions: copying a param with shape torch.Size([33]) from checkpoint, the shape in current model is torch.Size([25]). size mismatch for crf.end_transitions: copying a param with shape torch.Size([33]) from checkpoint, the shape in current model is torch.Size([25]). size mismatch for crf.transitions: copying a param with shape torch.Size([33, 33]) from checkpoint, the shape in current model is torch.Size([25, 25]). 实体的标签数目不对。 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: @.***>
nor_ent2id.json里面有多少个标签就设置num_tags=多少
from pytorch_bert_bilstm_crf_ner.
from pytorch_bert_bilstm_crf_ner.
又来打扰您啦!
我在main.sh和bert_ner_model中修改了mum=tags的参数,没有解决问题。我是在windows中运行项目的,下载了git后在pycharm终端已经能正常运行main.sh文件了,但是一运行main.py还是会出现size mismatch的错误!
您有空时帮我看看
from pytorch_bert_bilstm_crf_ner.
又来打扰您啦! 我在main.sh和bert_ner_model中修改了mum=tags的参数,没有解决问题。我是在windows中运行项目的,下载了git后在pycharm终端已经能正常运行main.sh文件了,但是一运行main.py还是会出现size mismatch的错误! 您有空时帮我看看
不要在pycharm里面直接运行,将指令复制出来在终端里面运行。
from pytorch_bert_bilstm_crf_ner.
作者大大,我在将model_name设为bilstm、idcnn、crf训练时会出现model.pt不生成的问题,之前用您的数据训练时,也出现了checkpoints下有的模型有.pt文件,有的没有的问题,请问是因为什么呐
另外就是,我的显卡是rtx2060s 8G的,我把model_name修改为bert,Albert等模型时,就会报错说我的内存不足,我看了csdn上的几个解决策略,都没能解决我目前的问题!请问除了换显卡,还能通过其他方法来解决吗
from pytorch_bert_bilstm_crf_ner.
作者大大,我在将model_name设为bilstm、idcnn、crf训练时会出现model.pt不生成的问题,之前用您的数据训练时,也出现了checkpoints下有的模型有.pt文件,有的没有的问题,请问是因为什么呐
另外就是,我的显卡是rtx2060s 8G的,我把model_name修改为bert,Albert等模型时,就会报错说我的内存不足,我看了csdn上的几个解决策略,都没能解决我目前的问题!请问除了换显卡,还能通过其他方法来解决吗
1、打印下train里面保存模型的那里看看是否有执行保存模型。train里面有个eval_steps,如果总的step小于它是不会有模型生成的。
2、调小train_batch_size和eval_batch_size直到显存够用为止。
from pytorch_bert_bilstm_crf_ner.
from pytorch_bert_bilstm_crf_ner.
Related Issues (20)
- 大佬您好,使用您的cner任务,使用的是最初始的参数,use crf打开后,训练89轮报错,无法进行eval和保存模型 HOT 14
- 大哥你好。我想问一下File "/home/vrlab/wwt/pytorch_bert_bilstm_crf_ner-main/bert_base_model.py", line 11, in __init__ assert os.path.exists(bert_dir) and os.path.exists(config_path), \ AssertionError: pretrained bert file does not exist是为啥 HOT 5
- 运行问题 HOT 6
- 您好 HOT 9
- 大佬您好,可以帮忙看看这个bug吗 HOT 13
- 换成BIO类型的数据应该怎么做? HOT 7
- 导出onnx问题 Error(s) in loading state_dict for BertNerModel: Unexpected key(s) in state_dict: "linear.weight", "linear.bias". HOT 2
- 对BERT模型进行继续预训练对提高性能是否有帮助? HOT 4
- 训练自己的数据,内存占用一直增长,直到吃满内存 HOT 2
- 请问一小pkl文件怎么处理得到? HOT 1
- > 我加你qq吧,你说下。 HOT 2
- 关于使用CRF文件将BERT+CRF模型转换为ONNX的问题 HOT 6
- 网页问题 HOT 1
- 英文实体识别的问题 HOT 1
- RuntimeError: expected predicate to be bool, got torch.uint8 HOT 1
- Albert问题 HOT 2
- AssertionError: pretrained bert file does not exist HOT 1
- 更换数据集后报CUDA error: device-side assert triggered HOT 1
- 求一份分词数据集 HOT 1
- 我的checkpoints文件夹是空的 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pytorch_bert_bilstm_crf_ner.