genezc / asgcn Goto Github PK

Code and preprocessed dataset for EMNLP 2019 paper titled "Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional Networks"

Python 100.00%

asgcn's Introduction

ASGCN

ASGCN - Aspect-Specific Graph Convolutional Network

Code and preprocessed dataset for EMNLP 2019 paper titled "Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional Networks"
Chen Zhang, Qiuchi Li and Dawei Song.

Updates

11/11/2020: I introduce a new ASTCN model which contains a bidirectional graph convolutional network over directed dependency trees.
10/5/2020: Many of you may be faced with reproducibility issue owing to corrupted word vectors when downloading (i.e., glove.840B.300d.txt is generally too large). Thus, we have released trimmed version of word embeddings on rest14 dataset as a pickled file along with vocabulary for you to verify the reproducibility.

Requirements

Python 3.6
PyTorch 1.0.0
SpaCy 2.0.18
numpy 1.15.4

Usage

Install SpaCy package and language models with

pip install spacy

and

python -m spacy download en

Generate graph data with

python dependency_graph.py

Download pretrained GloVe embeddings with this link and extract glove.840B.300d.txt into glove/.
Train with command, optional arguments could be found in train.py

python train.py --model_name asgcn --dataset rest14 --save True

Infer with infer.py

Model

we propose to build a Graph Convolutional Network (GCN) over the dependency tree of a sentence to exploit syntactical information and word dependencies. Based on it, a novel aspectspecific sentiment classification framework is raised.

An overview of our proposed model is given below

Citation

If you use the code in your paper, please kindly star this repo and cite our paper

@inproceedings{zhang-etal-2019-aspect, 
    title = "Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional Networks", 
    author = "Zhang, Chen and Li, Qiuchi and Song, Dawei", 
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)", 
    month = nov, year = "2019", 
    address = "Hong Kong, China", 
    publisher = "Association for Computational Linguistics", 
    url = "https://www.aclweb.org/anthology/D19-1464", 
    doi = "10.18653/v1/D19-1464", 
    pages = "4560--4570",
}

Credits

Code of this repo heavily relies on ABSA-PyTorch, in which I am one of the contributors.
For any issues or suggestions about this work, don't hesitate to create an issue or directly contact me via [email protected] !

asgcn's People

Contributors

Stargazers

Watchers

Forkers

taop-from-dlut amirveyseh lwgkzl archerwoo bowenzzzzzz999 adonis1022 guangzidetiaoyue vhientran cblby li-ming-fan alyxstraysa baymaxoct binliang-nlp lsllay macro03 jiahuisophiehu tobyge onkarsabnis cytsinghua luowangda xinhai-zhu xiangju2017 yanjinfeng101 chenyang918 damionfan zhangxuemiao teamlir david599 ma40050600 kthwaite jasonchow1991 pdsxsf lixiansen2048 abignu yangluo7 tslzs jingyu-14 alifeline 201528014227051 zhou8827 elveshh abhilashreddys tommy-xu wangludewdrop llj110 xiongshufeng fangzheng354 krishnakumar2925 jxyxiangyu anshiquanshu66 yasin-666 mathisall juliefromah hwhaaa yzp-tk ileader1 kikikio littlepotato1994 dywe666 nilzmoradi94 wufei50 bbtrbbt4dww4 tim08094495757 pokeboy0815 felixpf

asgcn's Issues

迭代三次模型，指标一次不如一次？

repeat: 1
max_test_acc: 0.887987012987013, max_test_f1: 0.7024944579664142
repeat: 2
max_test_acc: 0.887987012987013, max_test_f1: 0.6555064644327216
repeat: 3
max_test_acc: 0.8814935064935064, max_test_f1: 0.5690872648781992
max_test_acc_avg: 0.8858225108225107, max_test_f1_avg: 0.642362729092445
指标一次不如一次，请问最可能的原因是什么呢？

best model的选择

您好，在这份代码里，在训练过程选择模型的时候，好像没有用到验证集，而是直接每训练一次就在测试集上跑性能，根据测试集上的性能来决定模型什么时候停止训练，想问一下这样的方式是合理的吗，谢谢~

你好我想请教一下为什么要用mask？

我想请教一下为什么要用到mask，只留下方面词？这是出于对什么的考虑？谢谢！

When I changed the network to "ASCNN", the following error appeared when running train.py, how to solve it?

Traceback (most recent call last):
File "C:/Users/3403/PycharmProjects/ASGCN-master/train.py", line 226, in
ins.run()
File "C:/Users/3403/PycharmProjects/ASGCN-master/train.py", line 149, in run
max_test_acc, max_test_f1 = self._train(criterion, optimizer)
File "C:/Users/3403/PycharmProjects/ASGCN-master/train.py", line 74, in _train
outputs = self.model(inputs)
File "C:\Anaconda\Anaconda3\envs\pt\lib\site-packages\torch\nn\modules\module.py", line 550, in call
result = self.forward(*input, **kwargs)
File "C:\Users\3403\PycharmProjects\ASGCN-master\models\ascnn.py", line 84, in forward
x = F.relu(self.conv1(self.position_weight(text_out, aspect_double_idx, text_len, aspect_len).transpose(1, 2))) #whs
File "C:\Anaconda\Anaconda3\envs\pt\lib\site-packages\torch\nn\modules\module.py", line 550, in call
result = self.forward(*input, **kwargs)
File "C:\Anaconda\Anaconda3\envs\pt\lib\site-packages\torch\nn\modules\conv.py", line 208, in forward
self.padding, self.dilation, self.groups)
RuntimeError: Expected object of scalar type Double but got scalar type Float for argument #3 'mat1' in call to th_addmm

Process finished with exit code 1

模型分享请求

您的模型是基于GloVe的极限模型中非常出众的代表，我想请问您是否有兴趣将模型移植到https://github.com/yangheng95/PyABSA/tree/release/pyabsa/tasks/glove_apc/models 中，由于目前的有些ABSA仓库维护不及时、时常报错，所以我花了点时间构建了PyABSA，这个库主要目的是解决易用性的问题。然而由于精力不够在移植模型时出了一些问题。
请问您是否愿意将您的模型移植到PyABSA？谢谢您，祝您生活愉快！

数据集中可能有一些错误

感谢作者提供的rest15和rest16的数据集，发现数据集中存在一些错误。很多句子中会出现$ t$这个符号，发生在句子中存在多个aspect的情况，我想可能是数据集处理代码的问题。我在本地进行了修改，但不确定是否引入了其他错误。大佬有时间的话可以看看

Formulation 5 in your paper

Dear @GeneZC ,
I am sorry for disturbing you, but I wonder about the formulation 5 about the function F() in your paper. You use this function for calculating the position-aware weight. I understand that when r+1 <= i <= r+m, q_i should be 1, instead of 0 as in your paper?
Thank you very much for your time!

left_indices是指哪一部分的输入呢？

图卷积神经网络中的邻接矩阵

请问图卷积神经网络公式中的di+1,为什么要度+1 呢？谢谢~

关于损失函数的问题

请问您论文中的损失函数的c是数据集C中的每一个评论数据吗？p(hat)是每个c的实际标签？Pp(hat) 代表第p(hat)个对应的概率？也有可能是我翻译不准确的问题。其中您代码中用的是criterion = nn.CrossEntropyLoss()，这俩公式我没有搞清楚是不是一样的。

数据集相关问题

请问下有数据处理的代码？就是将原始XML数据处理为你仓库中数据格式的预处理代码。

dataset

你好，我想请问一下dataset里的数据是怎么转换为.raw的？有点搞不太懂唉

项目中的.graph文件和.tree文件

您好我想请问一下，项目中的.graph文件和.tree文件，0代表什么意思，1代表什么意思呢？.graph文件和.tree文件的区别是什么呢？

result

您好，我按照您的步骤跑了源代码，但是结果并没有达到论文里的标准。我使用的是您提供的这个命令python train.py --model_name asgcn --dataset rest14 --save True，在执行之前，下载了glove的数据集，其他的构造图我看您已经提供了，所以直接执行了上述这个命令，请问还有什么需要改的吗？如果您方便的话，希望可以回复一下，谢谢

How should I change the data?

I noticed that your work provides data such as .raw. If I want to replace it with my own related data set, what are the requirements? How should I replace other datasets?
Looking forward to your reply, thank you

关于源码中的.graph文件

你好，论文中提到了两个模型ASGCN-DG和ASGCN-DT，它们的不同在于拥有不同的邻接矩阵，dependency_graph.py应该是用于生成并存储邻接矩阵的，同时源码当中也提供了得到的邻接矩阵文件，laptop_test.raw.graph等，我想知道它们对应了ASGCN-DG还是ASGCN-DT。

RuntimeError: cuDNN error: CUDNN_STATUS_EXECUTION_FAILED

想问一下这个问题大概可能是什么版本出错了呀？RuntimeError: cuDNN error: CUDNN_STATUS_EXECUTION_FAILED

生成图

作者您好，请问您生成依赖树图用的设备是什么的，我换了其他数据集，内存需求太大了。

left_indices

May I ask what left_indices means

输入模型的文本长度

请问为什么每次输入到模型中文本的长度均不一样，是否可以通过更改代码将每次输入模型的文本长度固定

如果可以的话，请问应该修改源码中的哪部分，如果很麻烦不用太具体告知，告诉我个大概范围我自己更改就行。

谢谢大佬的解答。

请问您论文那个图4 关于方面的个数的数据是怎么分的呢？

想请教作者，第一版的asgcn（2年前发布的asgcn）中.graph生成文件使用的spacy版本和en_core_web_sm版本分别是多少？

想请教作者，第一版的asgcn（2年前发布的asgcn）中.graph生成文件使用的spacy版本和en_core_web_sm版本分别是多少？模型版本对精度影响还是挺大的。

您好，请问论文里的那个注意力可视化的那个彩色的表（Table 4）是怎么画的呀？

针对SemEval2014数据集的疑惑

请问用SemEval2014中的laptops，restaurants两个数据集，本文做的工作是SemEval2014 Task4中的subtask2吗？

就是针对Aspect term polarity的分类，而不是针对Aspect category polarity的分类？

谢谢您的解答。

我在LAP14数据集上用ASCNN模型得出的结果远高于您在论文中给出的结果。

您好，我在复现您的实验时，实验结果几乎没有差距。但是在LAP14上，我在ASCNN模型上得出的结果远高于您在论文中给出的结果（论文中：Acc=72.62，F1=66.72。我的结果：Acc=75.13，F1=70.73）。实验的超参我没有修改，使用的是train.py中默认的数值。您认为出现这种结果的原因是什么。

How to generate the dependency tree of German or Chinese

thanks for your sharing,it's really helpful,could i know how to generate the dependency tree of German or Chinese.?Because that you process the English dataset in your code. Thanks again^_^