• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Li Peng, Wang Bin, Shi Zhiwei, Cui Yachao, and Li Hengxun. Tag-TextRank: A Webpage Keyword Extraction Method Based on Tags[J]. Journal of Computer Research and Development, 2012, 49(11): 2344-2351.
Citation: Li Peng, Wang Bin, Shi Zhiwei, Cui Yachao, and Li Hengxun. Tag-TextRank: A Webpage Keyword Extraction Method Based on Tags[J]. Journal of Computer Research and Development, 2012, 49(11): 2344-2351.

Tag-TextRank: A Webpage Keyword Extraction Method Based on Tags

More Information
  • Published Date: November 14, 2012
  • Keyword extraction is to extract representative keywords from texts and has been widely used in most text processing applications. In this paper, we explore the use of tags for improving the performance of webpage keyword extraction task. Specifically, we first analyze the characteristics of bookmarking behavior and find that people usually use the same tags to label multiple topic-related webpages, which is shown by the fact that over 90% of labeled webpages can find relevant webpages through their tag information. Based on the discovery, we propose a method called Tag-TextRank. As an extension of the classic keyword extraction method TextRank, Tag-TextRank calculates the term importance based on a weighted term graph and the edge weight for a term pair is estimated by the statistics of the relevant documents which are introduced by a certain tag of the target webpage. The final importance score for a term is the combination of the above tag dependent importance scores. Tag-TextRank can measure the term relations by utilizing more documents so as to better estimate the term importance. Experimental results on a publicly available corpus show that Tag-TextRank outperforms TextRank on various metrics.
  • Related Articles

    [1]Chen Chujie, Lü Jianming, Shen Huawei. Fine-Grained Interview Evaluation Method Based on Keyword Attention[J]. Journal of Computer Research and Development, 2021, 58(9): 2013-2024. DOI: 10.7544/issn1000-1239.2021.20200636
    [2]Zhang Shikun, Xie Rui, Ye Wei, Chen Long. Keyword-Based Source Code Summarization[J]. Journal of Computer Research and Development, 2020, 57(9): 1987-2000. DOI: 10.7544/issn1000-1239.2020.20190179
    [3]Guo Lifeng, Li Zhihao, Hu Lei. Efficient Public Encryption Scheme with Keyword Search for Cloud Storage[J]. Journal of Computer Research and Development, 2020, 57(7): 1404-1414. DOI: 10.7544/issn1000-1239.2020.20190671
    [4]Chen Yiqun, Zhou Ruqi, Zhu Weiheng, Li Mengting, Yin Jian. Mining Patent Knowledge for Automatic Keyword Extraction[J]. Journal of Computer Research and Development, 2016, 53(8): 1740-1752. DOI: 10.7544/issn1000-1239.2016.20160195
    [5]Han Jun, Fan Ju, Zhou Lizhu. Semantic-Enhanced Spatial Keyword Search[J]. Journal of Computer Research and Development, 2015, 52(9): 1954-1964. DOI: 10.7544/issn1000-1239.2015.20140686
    [6]Zhang Dongzhan, Su Zhifeng, Lin Ziyu, and Xue Yongsheng. top-k Aggregation Keyword Search over Relational Databases[J]. Journal of Computer Research and Development, 2014, 51(4): 918-929.
    [7]Tang Mingzhu, Yang Yan, Guo Xuequan, Shen Zhonghui, Zhong Yingli. KWSDS: A Top-k Keyword Search System in Relational Databases[J]. Journal of Computer Research and Development, 2012, 49(10): 2251-2259.
    [8]Ke Xiao, Li Shaozi, Cao Donglin. Automatic Image Annotation Based on Relevant Visual Keywords[J]. Journal of Computer Research and Development, 2012, 49(4): 846-855.
    [9]Cai Hongyan, Yao Jiali, and Wang Shan. DETECTOR: A Universal On-Line Keyword Search System over Relational Database[J]. Journal of Computer Research and Development, 2007, 44(1): 119-125.
    [10]Yuan Jiazheng, Xu De, Bao Hong. An Efficient XML Documents Classification Method Based on Structure and Keywords Frequency[J]. Journal of Computer Research and Development, 2006, 43(8): 1361-1367.

Catalog

    Article views (1687) PDF downloads (955) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return