高级检索

    利用模式及语言学特征提高阅读理解性能

    Using Pattern and Linguistic Features to Improve Reading Comprehension Performance

    • 摘要: 阅读理解(reading comprehension,RC)任务的目的在于理解一篇文档并对提出的问题返回答案句.提出了一种充分利用外部资源来提高RC系统性能的方法,使得RC系统性能在Remedia和ChungHwa两种语料上均得到提高.特别地,在对基于Remedia语料RC系统的性能分析表明,24.1%的性能提高归因于基于Web的答案模式匹配的运用,11.1%的性能提高归因于语言学特征匹配策略运用.同时也进行了t-test,结果表明答案模式匹配、语言学特征匹配和词汇语义关联推理的运用所得到的性能提高是显著的.

       

      Abstract: A reading comprehension (RC) system aims to understand a single document (i.e. story or passage) in order to be able to automatically answer questions about it. RC resembles the ad hoc question answering (QA) task that aims to extract an answer from a collection of documents when posed with a question. However, since RC focuses only on a single document, the system needs to draw upon external knowledge sources to achieve deep analysis of passage sentences for answer sentence extraction. Proposed in this paper is an approach towards RC that attempts to utilize external knowledge to improve performance, including (i) automatic acquisition of Web-based answer patterns and its application in answer sentence matching; (ii) linguistic feature matching; (iii)lexical semantic relation inference, and (iv)context assistance. This approach gives improved RC performances for both the Remedia and ChungHwa corpora, attaining HumSent accuracies of 45% and 69% respectively. In particular, performance analysis based on Remedia shows that relative performances of 24.1% is due to the application of Web-derived answer patterns and a further 11.1% is due to linguistic feature matching. Pairwise t-tests are also conducted and the result shows that the performance improvements due to Web-derived answer patterns, linguistic feature matching and lexical semantic relation inference technique are statistically significant.

       

    /

    返回文章
    返回