ISSN 1000-1239 CN 11-1777/TP

• 人工智能 •

### 基于图注意力网络的因果关系抽取

1. 1(吉林大学计算机科学与技术学院 长春 130012);2(符号计算与知识工程教育部重点实验室(吉林大学) 长春 130012) (xujh17@mails.jlu.edu.cn)
• 出版日期: 2020-01-01
• 基金资助:
国家自然科学基金项目(61976103，61872161)；吉林省技术攻关项目(20190302029GX)；吉林省自然科学基金项目(20180101330JC，2018101328JC)；吉林省发改委项目(2019C053-8)

### Causal Relation Extraction Based on Graph Attention Networks

Xu Jinghang1, Zuo Wanli1,2, Liang Shining1, Wang Ying1,2

1. 1(College of Computer Science and Technology, Jilin University, Changchun 130012);2(Key Laboratory of Symbol Computation and Knowledge Engineering (Jilin University), Ministry of Education, Changchun 130012)
• Online: 2020-01-01
• Supported by:
This work was supported by the National Natural Science Foundation of China(61976103, 61872161), the Project of Technical Tackle-key-problem of Jilin Province of China(20190302029GX), the Natural Science Foundation of Jilin Province of China(20180101330JC, 2018101328JC), and the Project of the Development and Reform Commission of Jilin Province (2019C053-8).

Abstract: Causality represents a kind of correlation between cause and effect, where the happening of cause will leads to the happening of effect. As the most important type of relationship between entities, causality plays a vital role in many fields such as automatic reasoning and scenario generation. Therefore, extracting causal relation becomes a basic task in natural language processing and text mining. Different from traditional text classification methods or relation extraction methods, this paper proposes a sequence labeling method to extract causal entity in text and identify direction of causality, without relying on feature engineering or causal background knowledge. The main contributions of this paper can be summarized as follows: 1) we extend syntactic dependency tree to the syntactic dependency graph, adopt graph attention networks in natural language processing, and introduce the concept of S-GAT(graph attention network based on syntactic dependency graph); 2) Bi-LSTM+CRF+S-GAT model for causal extraction is proposed, which generates causal label of each word in sentence based on input word vectors; 3) SemEval data set is modified and extended, and rules are defined to relabel experimental data with an aim of overcoming defects of the original labeling method. Extensive experiments are conducted on the expanded SemEval dataset, which shows that our model achieves 0.064 improvement over state-of-the-art model Bi-LSTM+CRF+self-ATT in terms of prediction accuracy.