The e2e-joint-coref from cheniison

中文指代消解模型收敛很慢

请问在中文指代消解的时候，如果要复现论文结果，具体参数要如何设置？按照代码中默认参数，模型的损失一直无法下降，f1距离论文描述效果相去甚远。
`# best config
best_config = {
"embedding_dim": 768,
"max_span_width": 20,
# max training sentences depends on size of memery
"max_training_sentences": 11,
# max seq length
"max_seq_length": 128,
"bert_max_seq_length": 512,

"device": "cuda",
# "device": "cpu",
"checkpoint_path": "./data/checkpoint",
"lr": 0.0002,
"weight_decay": 0.0005,
"dropout": 0.3,

"report_frequency": 50,
"eval_frequency": 200,

# ontonotes dir
# 英文
# "ontonotes_root_dir": "./data/ontonotes",
# 中文零指代
"ontonotes_root_dir": "./data/ontonotes/data/files/data/chinese/annotations",
"train_file_path": "./data/train.json",
"test_file_path": "./data/test.json",
"val_file_path": "./data/val.json",

# max candidate mentions size in first/second stage
"top_unit_ratio": 0.5,
"max_top_antecedents": 50,
# use coarse to fine pruning
"coarse_to_fine": True,
# high order coref depth
"coref_depth": 2,

# FFNN config
"ffnn_depth": 1,
"ffnn_size": 3000,

# use span features, such as distance
"use_features": True,
"feature_dim": 20,
"model_heads": True,
# use metadata, such as genre and speaker info
"use_metadata": True,
"genres": ["bc", "bn", "mz", "nw", "tc", "wb"],

# 选择topk时是否考虑单元互斥
"extract_units": True,

# 指代检测损失权重
"unit_detection_loss_weight": 0.2,

# interaction among units
"use_units_interaction_before_score": True,  # 计算得分前的交互
"use_units_interaction_after_score": True,  # 计算得分后的交互
"interaction_method_after_score": "max",  # 计算得分后的交互方式max/mean

# 带权self attention
"wsa_depth": 1,
"wsa_layer_num": 1,
"wsa_dropout": 0.3,
"wsa_pwff_size": 3072,
"wsa_head_num": 8,
"wsa_lr": 0.0002,

# transformer model
# "transformer_model_name": './data/bert-base-chinese',
"transformer_model_name": 'bert-base-chinese',
"transformer_lr": 0.00001,

}
`

关于数据集

你好，感谢开源这么棒的工作。
我想问一下经过数据处理之后我的验证集有252个文档，和论文中汇报的数量对不上。。。
谢谢！

bug in `predict.py`

Hi,

我在运行你的python predict.py文件时发现了一些bug，针对测试样本的推理部分好像没有写完是吗？

cheniison / e2e-joint-coref Goto Github PK

e2e-joint-coref's People

Contributors

Stargazers

Watchers

Forkers

e2e-joint-coref's Issues

中文指代消解模型收敛很慢

关于数据集

bug in `predict.py`

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent