高级检索

    话题识别与跟踪中的层次化话题识别技术研究

    Research on Hierarchical Topic Detection in Topic Detection and Tracking

    • 摘要: 话题识别与跟踪(topic detection and tracking,TDT)旨在发展一系列基于事件的信息组织技术,层次化话题识别(hierarchical topic detection,HTD)是其中一项全新的任务定义形式.通过连续的大规模评测,话题识别与跟踪已成为国际上自然语言处理尤其是信息检索领域的一个研究热点.为此,将自然语言处理与信息检索技术相结合,提出了针对事件特点的切实有效的单粒度话题识别方法,并提出了基于多层聚类的MLCS算法对话题进行层次化组织.所提出的方法具有很好的效果,在TDT2004的HTD评测中,该方法取得了第2名的成绩.

       

      Abstract: Topic detection and tracking (TDT) aims to develop a series of technologies for event based information organization, and hierarchical topic detection (HTD) is a new task of it. Through a series of large-scale evaluations, TDT has become a hot problem for worldwide research in the fields of natural language processing, especially in information retrieval. In this paper, an effective method of topic detection focusing on the features of events is proposed, and an arithmetic named MLCS is also offered to organize topics into hierarchical structures. The methods proposed are very effective, and score second in the HTD evaluation of TDT2004.

       

    /

    返回文章
    返回