• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
高级检索

古诗词图谱的构建及分析研究

刘昱彤, 吴斌, 白婷

刘昱彤, 吴斌, 白婷. 古诗词图谱的构建及分析研究[J]. 计算机研究与发展, 2020, 57(6): 1252-1268. DOI: 10.7544/issn1000-1239.2020.20190641
引用本文: 刘昱彤, 吴斌, 白婷. 古诗词图谱的构建及分析研究[J]. 计算机研究与发展, 2020, 57(6): 1252-1268. DOI: 10.7544/issn1000-1239.2020.20190641
Liu Yutong, Wu Bin, Bai Ting. The Construction and Analysis of Classical Chinese Poetry Knowledge Graph[J]. Journal of Computer Research and Development, 2020, 57(6): 1252-1268. DOI: 10.7544/issn1000-1239.2020.20190641
Citation: Liu Yutong, Wu Bin, Bai Ting. The Construction and Analysis of Classical Chinese Poetry Knowledge Graph[J]. Journal of Computer Research and Development, 2020, 57(6): 1252-1268. DOI: 10.7544/issn1000-1239.2020.20190641
刘昱彤, 吴斌, 白婷. 古诗词图谱的构建及分析研究[J]. 计算机研究与发展, 2020, 57(6): 1252-1268. CSTR: 32373.14.issn1000-1239.2020.20190641
引用本文: 刘昱彤, 吴斌, 白婷. 古诗词图谱的构建及分析研究[J]. 计算机研究与发展, 2020, 57(6): 1252-1268. CSTR: 32373.14.issn1000-1239.2020.20190641
Liu Yutong, Wu Bin, Bai Ting. The Construction and Analysis of Classical Chinese Poetry Knowledge Graph[J]. Journal of Computer Research and Development, 2020, 57(6): 1252-1268. CSTR: 32373.14.issn1000-1239.2020.20190641
Citation: Liu Yutong, Wu Bin, Bai Ting. The Construction and Analysis of Classical Chinese Poetry Knowledge Graph[J]. Journal of Computer Research and Development, 2020, 57(6): 1252-1268. CSTR: 32373.14.issn1000-1239.2020.20190641

古诗词图谱的构建及分析研究

基金项目: 国家重点研发计划项目(2018YFC0831500);国家自然科学基金项目(U1936220,61972047)
详细信息
  • 中图分类号: TP391

The Construction and Analysis of Classical Chinese Poetry Knowledge Graph

Funds: This work was supported by the National Key Research and Development Program of China (2018YFC0831500) and the National Natural Science Foundation of China (U1936220, 61972047).
  • 摘要: 古诗词是中国宝贵的文化遗产.利用计算机对诗词进行辅助研究,对语言、文学、传承普及中华文化,具有重要意义.然而,关于诗词的知识是高度碎片化的,原因是互联网上的诗词知识,不仅存在于诗词本身,还分布于诗词的各种解读资料,比如诗词的注释、译文、赏析等.若以知识图谱的方式,捕捉古诗词中词语之间潜在的语义联系并将它们以知识的方式关联起来,能够将诗词碎片化的知识有条理地整合在一起,从而更好地对古诗词知识进行推理和分析.基于此,提出了一种古诗词知识图谱的构建方法.构建图谱的节点时,首先利用改进的Apriori算法产生诗词中的候选词,然后检验候选词是否出现在诗词注释和中文词典中,从而判断其是否构成图谱节点.构建图谱的边时,首先利用注释信息在词语之间建立语义联系,然后用人工构建的诗词分类体系在抽象的语义之间建立联系.最终得到一个内容覆盖全面且包含多层词语语义联系的古诗词图谱.古诗词图谱可用于对诗词各种不同维度的分析研究,相比于基于字的数据分析,利用古诗词图谱能够从语义的角度更加深入具体地辅助文学研究.以唐诗为例,说明了古诗词图谱在诗词分析中的必要性.此外,古诗词图谱还适用于各种关于诗词的推理和分析任务,以判定诗词题材和分析诗词情感这2个任务为例,证明了古诗词图谱的有效性和应用价值.
    Abstract: Classical Chinese poetry is a precious cultural heritage. It is significant to use the rich information in classical Chinese poetry to further investigate the language, literature and historical development of Chinese culture. However, the knowledge of classical Chinese poetry is highly fragmented. It not only exists in poetry itself, but also is widely distributed in the materials which are used to explain poetry, such as annotations, translations, appreciations, etc. Our aim is to obtain the potential semantic relationship between words and expressions, and use knowledge graph to link them. By doing this, we could integrate fragmented knowledge in a systematic way, which enables us to achieve better reasoning and analysis of classical Chinese poetry knowledge. In this paper, we propose a method to construct classical Chinese poetry knowledge graph (CCP-KG). About building nodes of CCP-KG, we use the improved Apriori algorithm to generate candidate words, then check if the candidate word appears in the annotations to determine when it can be a node of CCP-KG. About building edges of CCP-KG, the semantic relationship between words is established through the annotations, then we use the artificially constructed classical Chinese poetry hierarchical structure to establish the relationship between abstract semantics. Finally, we obtain CCP-KG, which covers every aspect of classical Chinese poetry and contains multi-layer semantic links between words. Taking Tang poetry as an example, CCP-KG can be used to analysis classical Chinese poems in different dimensions. Compared with character-based data analysis, the use of CCP-KG assists literary research more in-depth from the perspective of semantics. Therefore, CCP-KG is necessary in analyzing classical Chinese poems. In addition, CCP-KG can also be applied to various tasks like reasoning and analysis in classical Chinese poetry. We conduct experiments on the tasks of determining the theme of poetry and analyzing the emotion of poetry respectively, showing the effectiveness and application value of our constructed CCP-KG.
  • 期刊类型引用(10)

    1. 贺岩,潘俊杰. 基于Neo4j的太湖流域诗词知识图谱构建研究. 电脑编程技巧与维护. 2025(02): 145-148 . 百度学术
    2. 张强,高劲松,龙家庆,杨晓燕,夏红玉,蒋智慧. 基于知识重构的词人时空情感轨迹可视化研究——以辛弃疾为例. 情报学报. 2023(06): 729-739 . 百度学术
    3. 王亚楠. 镇江“大运河”主题诗词文化资源的组织性建构. 文化创新比较研究. 2023(18): 1-7 . 百度学术
    4. 宋雪雁,罗慧,杨芳芳. 知识重组视域下《全唐诗》送别诗的时空结构研究. 图书情报工作. 2023(20): 15-24 . 百度学术
    5. 宋雪雁,罗慧,杨芳芳. 《全唐诗》送别诗诗人社交网络分析. 兰台世界. 2023(12): 43-48+52 . 百度学术
    6. 宋雪雁,霍晓楠,刘寅鹏,邓君. 数字人文视角下《全唐诗》贬谪诗人社会关系研究. 现代情报. 2022(02): 14-21 . 百度学术
    7. 欧阳子薇,柳雨欣,于娜. 以弘扬古诗词文化为主题的移动应用设计研究. 包装工程. 2022(04): 197-202 . 百度学术
    8. 司莉,郭财强. 基于内容分析的数字人文领域中知识组织价值体现研究综述. 图书情报工作. 2022(13): 127-137 . 百度学术
    9. 张卫,王昊,李晓敏,Song Min. 数字人文视角下古诗意象知识抽取及其文化图式构建研究. 图书情报工作. 2022(24): 104-117 . 百度学术
    10. 李永卉,周树斌,周宇婷,卢章平. 基于图数据库Neo4j的宋代镇江诗词知识图谱构建研究. 大学图书馆学报. 2021(02): 52-61 . 百度学术

    其他类型引用(23)

计量
  • 文章访问数:  1887
  • HTML全文浏览量:  12
  • PDF下载量:  1066
  • 被引次数: 33
出版历程
  • 发布日期:  2020-05-31

目录

    /

    返回文章
    返回