针对情境感知的自然语言的因果去偏推理方法

张大操; 张琨; 吴乐; 汪萌

doi:10.7544/issn1000-1239.202330248

针对情境感知的自然语言的因果去偏推理方法

Causal-Based Debiased Reasoning Method for Grounded Textual Entailment

摘要

摘要: 情境感知的自然语言推理任务要求模型能够根据给定情境信息判断前提句子与假设句子之间的语义推理关系. 大量的研究工作通过利用情境信息增强对输入句子的语义表征学习，取得了显著的效果. 然而，这些方法忽略了情境信息以及输入句子之间存在的虚假关联，导致模型存在泛化性及鲁棒性差的问题. 同时，已有的去偏方法未能充分考虑语义推理过程中情境信息的影响，造成情境信息利用不充分、虚假关联识别不准确的问题. 针对以上问题，通过融合因果推断方法，提出一种全新的因果去偏推理方法CBDRM（causal-based debiased reasoning method），在充分考虑情境信息的条件下，缓解模型在推理过程中受到的有偏信息的影响. 具体而言，首先通过统计分析为输入数据构建因果图，实现对输入数据中的不同变量之间的关系的准确刻画；在此基础上，利用预训练模型的有偏训练实现输入数据对预测结果的总因果效应的计算. 同时，利用因果反事实方法实现计算数据中的虚假关联所导致的直接因果效应. 通过从总因果效应中去除虚假关联所带来的直接因果效应，实现了对输入句子的语义推理关系的无偏预测. 更进一步，考虑到在语义推理过程中情境信息对语义表达的影响，设计了一个全新的对比学习模块，实现了在考虑情境信息的情况下输入文本的语义表示，进一步提升了模型的无偏推理性能. 最后，在公开数据集上进行了大量的实验验证. 实验结果充分证明了所提出的方法的有效性. 为了对无偏自然语言推理方法进行更好的评估，构建并公开了一个无偏的情境感知的自然语言推理挑战集，用于推动该领域的相关研究.

Abstract: Grounded textual entailment (GTE) requires an agent to distinguish the inference relations between premise and hypothesis sentences based on given context. While significant progress has been made to enhance representation learning by using contextual information. However, current methods overlook spurious correlations between context and input sentences, leading to poor model generalization and robustness. Moreover, existing debiasing techniques fail to fully consider the impact of contextual information on inference processes, resulting in inaccurate identification of spurious correlations. To address these issues, we propose a novel causal-based debiased reasoning method (CBDRM) that integrates causal inference methods while fully considering contextual information. Specifically, we first construct a causal graph through statistical analysis to accurately describe the relationship between different variables among the input data. Then, we calculate the total causal effect of input data on the prediction results by using a biased pre-training model. Additionally, the direct causal effect caused by spurious correlations are calculated by using counterfactual methods. By removing the direct causal effect from the total causal effect, CBDRM achieves unbiased inference relation prediction. Furthermore, we take the impact of context into consideration and design a novel contrastive learning module to improve the unbiased inference performance of CBDRM. Finally, extensive experiments over publicly available datasets demonstrate the superiority and effectiveness of our proposed CBDRM. Moreover, we construct and release an unbiased GTE challenge set to promote the related research.

HTML全文

参考文献(46)

施引文献

资源附件(0)