SemreX中基于语义的文档参考文献元数据信息提取
Semantic Document Reference Metadata Extraction in SemreX
-
摘要: 为了实现科研工作者之间的文献知识的共享,结合语义网技术,提出了一种从文档中提取参考文献元数据信息的方法.该方法采用模式匹配方式,可以从文档中提取作者、标题、出版时间、期刊名等信息,并使用OWL本体描述语言进行形式化,为进一步的语义搜索奠定基础.实验数据证明了该方法的有效性.Abstract: In order to implement knowledge sharing between technical researchers, a new method using techniques of semantic Web is proposed, which can retrieve reference metadata from varied documents. By way of pattern matching, it can get some metadata such as authors, titles, publishing date and journal name, and uses the OWL ontology description language to formulate the metadata, which assists the semantic searching in the next step. Experiments prove its efficiency.