高级检索
    唐明珠, 杨 艳, 郭雪泉, 沈中辉, 钟颖莉. KWSDS:关系数据库中Top-k关键词搜索系统[J]. 计算机研究与发展, 2012, 49(10): 2251-2259.
    引用本文: 唐明珠, 杨 艳, 郭雪泉, 沈中辉, 钟颖莉. KWSDS:关系数据库中Top-k关键词搜索系统[J]. 计算机研究与发展, 2012, 49(10): 2251-2259.
    Tang Mingzhu, Yang Yan, Guo Xuequan, Shen Zhonghui, Zhong Yingli. KWSDS: A Top-k Keyword Search System in Relational Databases[J]. Journal of Computer Research and Development, 2012, 49(10): 2251-2259.
    Citation: Tang Mingzhu, Yang Yan, Guo Xuequan, Shen Zhonghui, Zhong Yingli. KWSDS: A Top-k Keyword Search System in Relational Databases[J]. Journal of Computer Research and Development, 2012, 49(10): 2251-2259.

    KWSDS:关系数据库中Top-k关键词搜索系统

    KWSDS: A Top-k Keyword Search System in Relational Databases

    • 摘要: 关系数据库中的关键词搜索技术已经成为信息检索领域的研究热点,它为没有任何SQL语法知识的用户提供了一个简单友好的接口.但是现存的关键词搜索系统主要依赖于数据图或模式图,而单独使用数据图或模式图的算法搜索效率不高,结果准确率也较低.设计实现了一个Top-k关键词搜索系统(keyword search system based on database graph and schema graph,KWSDS),用户提交关键词后,系统对关键词进行预处理,消除一些脏关键词.首次提出使用数据图与模式图相结合的方法,设计了同表查询算法和异表查询算法,分析了算法的正确性和时间复杂度,并且提出了相关性结果排序方法.KWSDS系统的搜索算法运行时间短,搜索结果准确性高,具有良好的查询性能.最后通过实验验证了KWSDS的效率.

       

      Abstract: Keyword search technology over relational databases has become one of hot topics in the field of information retrieval. It can provide the users with little SQL knowledge a simple and friendly interface. But the algorithms of some existing keyword search systems are mainly based on database graph or schema graph. However, the efficiency of them which use database graph or schema graph separately is low. The accurate rate of results is also not high. This paper devises and implements a top-k keyword search system KWSDS (keyword search system based on database graph and schema graph). After users input the keywords, it can eliminate some dirty keywords through pre-processing. The method of combining database graph and schema graph together to solve the problem of keyword search is proposed for the first time. This paper also devises the search algorithms between same table and different tables, prove the correctness of the algorithms and analyze time complexity of them. A sorting method based on relevance is proposed at the same time. The algorithms of KWSDS system run in shorter time than the existing algorithms, the results are output by KWSDS system with high accuracy. The system has excellent query performance. Finally, KWSDS is verified by abundant experiments.

       

    /

    返回文章
    返回