• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
高级检索

基于复合结构的知识库分类体系匹配方法

林海伦, 贾岩涛, 王元卓, 靳小龙, 程学旗, 王伟平

林海伦, 贾岩涛, 王元卓, 靳小龙, 程学旗, 王伟平. 基于复合结构的知识库分类体系匹配方法[J]. 计算机研究与发展, 2017, 54(1): 50-62. DOI: 10.7544/issn1000-1239.2017.20150843
引用本文: 林海伦, 贾岩涛, 王元卓, 靳小龙, 程学旗, 王伟平. 基于复合结构的知识库分类体系匹配方法[J]. 计算机研究与发展, 2017, 54(1): 50-62. DOI: 10.7544/issn1000-1239.2017.20150843
Lin Hailun, Jia Yantao, Wang Yuanzhuo, Jin Xiaolong, Cheng Xueqi, Wang Weiping. A Composite Structure Based Method for Knowledge Base Taxonomy Matching[J]. Journal of Computer Research and Development, 2017, 54(1): 50-62. DOI: 10.7544/issn1000-1239.2017.20150843
Citation: Lin Hailun, Jia Yantao, Wang Yuanzhuo, Jin Xiaolong, Cheng Xueqi, Wang Weiping. A Composite Structure Based Method for Knowledge Base Taxonomy Matching[J]. Journal of Computer Research and Development, 2017, 54(1): 50-62. DOI: 10.7544/issn1000-1239.2017.20150843
林海伦, 贾岩涛, 王元卓, 靳小龙, 程学旗, 王伟平. 基于复合结构的知识库分类体系匹配方法[J]. 计算机研究与发展, 2017, 54(1): 50-62. CSTR: 32373.14.issn1000-1239.2017.20150843
引用本文: 林海伦, 贾岩涛, 王元卓, 靳小龙, 程学旗, 王伟平. 基于复合结构的知识库分类体系匹配方法[J]. 计算机研究与发展, 2017, 54(1): 50-62. CSTR: 32373.14.issn1000-1239.2017.20150843
Lin Hailun, Jia Yantao, Wang Yuanzhuo, Jin Xiaolong, Cheng Xueqi, Wang Weiping. A Composite Structure Based Method for Knowledge Base Taxonomy Matching[J]. Journal of Computer Research and Development, 2017, 54(1): 50-62. CSTR: 32373.14.issn1000-1239.2017.20150843
Citation: Lin Hailun, Jia Yantao, Wang Yuanzhuo, Jin Xiaolong, Cheng Xueqi, Wang Weiping. A Composite Structure Based Method for Knowledge Base Taxonomy Matching[J]. Journal of Computer Research and Development, 2017, 54(1): 50-62. CSTR: 32373.14.issn1000-1239.2017.20150843

基于复合结构的知识库分类体系匹配方法

基金项目: 国家“九七三”重点基础研究发展计划基金项目(2012CB316303,2013CB329602);“核高基”国家科技重大专项(2013ZX01039-002-001-001);国家自然科学基金项目(61303056,61402464,61402442,61572469,61502478);北京市自然科学基金项目(4154086) This work was supported by the National Basic Research Program of China (973 Program) (2012CB316303, 2013CB329602), the National Science and Technology Major Projects of Hegaoji (2013ZX01039-002-001-001), the National Natural Science Foundation of China (61303056, 61402464, 61402442, 61572469, 61502478), and the Beijing Natural Science Foundation (4154086).
详细信息
  • 中图分类号: TP391

A Composite Structure Based Method for Knowledge Base Taxonomy Matching

  • 摘要: 近年来,分类体系匹配由于其在知识库构建和融合等方面的广泛应用,已成为国内外工业界和学术界的研究热点.然而,随着网络大数据的不断发展,分类体系变得越来越庞大和复杂,构造一种通用有效的分类体系匹配器以适应大规模、异构分类体系匹配的扩展性仍然面临很大的挑战.为此,提出了一种基于复合结构的分类体系匹配方法BiMWM,该方法利用分类体系中分类的复合结构信息:微观结构和宏观结构,将分类体系匹配问题转化为二部图上的优化问题进行求解.首先,创建赋权的二部图建模分类体系之间候选的匹配类对关系;然后,通过计算二部图上的最大权匹配剪枝选择最优的分类体系的匹配类对.BiMWM方法可以在多项式时间内为2个分类体系产生最优匹配.实验结果表明:与当前先进的基准方法相比,该方法能够有效提升大规模、异构分类体系匹配的性能.
    Abstract: Taxonomy matching, i.e., an operation of taxonomy merging across different knowledge bases, which aims to align common elements between taxonomies, has been extensively studied in recent years due to its wide applications in knowledge base population and proliferation. However, with the continuous development of network big data, taxonomies are becoming larger and more complex, and covering different domains. Therefore, to pose an effective and general matching strategy covering cross-domain or large-scale taxonomies is still a considerable challenge. In this paper, we presents a composite structure based matching method, named BiMWM, which exploits the composite structure information of class in taxonomy, including not only the micro-structure but also the macro-structure. BiMWM models the taxonomy matching problem as an optimization problem on a bipartite graph. It works in two stages: it firstly creates a weighted bipartite graph to model the candidate matched classes pairs between two taxonomies, then performs a maximum weight matching algorithm to generate an optimal matching for two taxonomies in a global manner. BiMWM runs in polynomial time to generate an optimal matching for two taxonomies. Experimental results show that our method outperforms the state-of-the-art baseline methods, and performs good adaptability in different domains and scales of taxonomies.
计量
  • 文章访问数:  1366
  • HTML全文浏览量:  1
  • PDF下载量:  553
  • 被引次数: 0
出版历程
  • 发布日期:  2016-12-31

目录

    /

    返回文章
    返回