Research on Document Grounded Conversations
-
摘要: 基于文档的对话是目前对话领域一个新兴的热点任务.与以往的任务不同,其需要将对话信息和文档信息综合进行考虑.然而,先前的工作着重考虑二者之间的关系,却忽略了对话信息中的句子对回复生成的作用具有差异性.针对这一问题,提出了一种新的辩证看待对话历史的方法,在编码阶段讨论利用历史和忽略历史2种情况进行语义信息的建模,并采用辩证整合的方式进行分支信息的汇总.由此避免了在历史信息与当前对话不相关时,其作为噪声被引入进而损害模型性能,同时也强化了当前对话对信息筛选的指导作用.实验结果表明,该模型与现有基线模型相比,能够生成更为符合当前语境且信息量更加丰富的回复,从而说明其能够更好地理解对话信息并进行知识筛选.并且通过进行消融实验,也验证了各模块在建模过程中的有效性.
-
关键词:
- 基于文档的对话 /
- 回复生成 /
- 信息筛选 /
- Transformer模型 /
- 注意力机制
Abstract: Document grounded conversations is an emerging hot task in the field of dialogue system. Different from previous tasks, it needs to consider both the utterances and the given document. However, previous work focused on the relationship between the two, but ignored the utterances’ difference in the effect of response generation. To solve this problem, a new dialectical approach to the dialogue history, which means the utterances before the last one, is proposed in this paper. At the encoding step, it divides the modeling of the semantic information into two parts: using history and ignoring history, and then uses the comparative integration method to summarize the branch results. In this way, when the dialogue history is not related to the current utterance, it can avoid being introduced as noise which will damage the performance of the model. Besides, it also strengthens the guiding role of the current utterance in the information filtering process. Experimental results show that compared with the existing baselines, this model can generate responses that are more in line with the current context and more informative, indicating that it can better understand dialogue information and conduct knowledge filtering. And through the ablation study, the effectiveness of each module in the modeling process is also verified. -
-
期刊类型引用(6)
1. 郭晓龙,牛晋宇,杜永萍. 基于树莓派的高效卷积优化方法. 计算机技术与发展. 2023(05): 96-104 . 百度学术
2. 辛明勇,祝健杨,徐长宝,姚浩,刘德宏. 基于循环神经网络的多核处理器层次化存储技术. 电子设计工程. 2023(22): 121-124+129 . 百度学术
3. 王利伟,玄志武,徐洪洲,刘学. Windows环境下遥测数据并行拼接处理方法研究. 电子设计工程. 2021(02): 10-15 . 百度学术
4. 孟慧玲,王耀彬,李凌,杨洋,王欣夷,刘志勤. TACLeBench中内核程序循环级推测并行性分析. 计算机应用. 2021(09): 2652-2657 . 百度学术
5. 于海心,王晶,李晓锋. 基于改进RMS算法的多核嵌入式系统总线周期调度表优化设计. 火炮发射与控制学报. 2021(03): 71-75 . 百度学术
6. 丁艳,张海文,孙永彦. 基于多网格技术的电网工程造价数据信息分析方法研究. 电子设计工程. 2021(19): 35-39 . 百度学术
其他类型引用(8)
计量
- 文章访问数: 572
- HTML全文浏览量: 5
- PDF下载量: 216
- 被引次数: 14