• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Zhang Yu, Song Wei, Liu Ting, and Li Sheng. Query Classification Based on URL Topic[J]. Journal of Computer Research and Development, 2012, 49(6): 1298-1305.
Citation: Zhang Yu, Song Wei, Liu Ting, and Li Sheng. Query Classification Based on URL Topic[J]. Journal of Computer Research and Development, 2012, 49(6): 1298-1305.

Query Classification Based on URL Topic

More Information
  • Published Date: June 14, 2012
  • Many online resources contain crowd intelligence. Categorized website directory is one kind of resources constructed and maintained manually. It aims to organize websites according to a topical taxonomy. Based on the URLs with topical labels in website directory, a URL topical classifier could be designed. Together with pseudo relevance feedback technique and search engine query logs, an automatic, fast and efficient query topical classification method is proposed. In detail, the method combines two strategies. Strategy-1 is to predict a query’s topic by computing the topic distribution among the returned URLs of a search system. Strategy-2 is to train a statistical classifier using the automatically labeled queries in query logs based on the topic of clicked URLs. The experimental results show that our method can achieve better precision compared with a state of the art algorithm and is more efficient for online processing. It has good scalability and can construct large scale training data from query logs automatically.
  • Related Articles

    [1]Wei Jinxia, Long Chun, Fu Hao, Gong Liangyi, Zhao Jing, Wan Wei, Huang Pan. Malicious Domain Name Detection Method Based on Enhanced Embedded Feature Hypergraph Learning[J]. Journal of Computer Research and Development, 2024, 61(9): 2334-2346. DOI: 10.7544/issn1000-1239.202330117
    [2]Guo Yingjie, Liu Xiaoyan, Wu Chenxi, Guo Maozu, Li Ao. U-Statistics and Ensemble Learning Based Method for Gene-Gene Interaction Detection[J]. Journal of Computer Research and Development, 2018, 55(8): 1683-1693. DOI: 10.7544/issn1000-1239.2018.20180365
    [3]Liu Qiao, Han Minghao, Yang Xiaohui, Liu Yao, Wu Zufeng. Representation Learning Based Relational Inference Algorithm with Semantical Aspect Awareness[J]. Journal of Computer Research and Development, 2017, 54(8): 1682-1692. DOI: 10.7544/issn1000-1239.2017.20170200
    [4]Wang Youwei, Wang Weiping, Meng Dan. Query Optimization by Statistical Approach for Hive Data Warehouse[J]. Journal of Computer Research and Development, 2015, 52(6): 1452-1462. DOI: 10.7544/issn1000-1239.2015.20140403
    [5]Zhang Yingjie, Gong Zhonghan. Hybrid Differential Evolution Gravitation Search Algorithm Based on Threshold Statistical Learning[J]. Journal of Computer Research and Development, 2014, 51(10): 2187-2194. DOI: 10.7544/issn1000-1239.2014.20130395
    [6]Wu Yan, Zhang Qi, and Huang Xuanjing. Selecting Expansion Terms as a Set Via Integer Linear Programming[J]. Journal of Computer Research and Development, 2013, 50(8): 1737-1743.
    [7]Pu Qiang, He Daqing, Yang Guowei. An Estimation of Query Language Model Based on Statistical Semantic Clustering[J]. Journal of Computer Research and Development, 2011, 48(2): 224-231.
    [8]Liu Dayou, Yu Peng, Gao Ying, Qi Hong, and Sun Shuyang. Research Progress in Statistical Relational Learning[J]. Journal of Computer Research and Development, 2008, 45(12): 2110-2119.
    [9]Zhou Hongwei, Zhang Chengyi, and Zhang Minxuan. A Method of Statistics-Based Cache Leakage Power Estimation[J]. Journal of Computer Research and Development, 2008, 45(2): 367-374.
    [10]Xu Cunlu, Chen Yanqiu, Lu Hanqing. Statistical Landscape Features for Texture Retrieval[J]. Journal of Computer Research and Development, 2006, 43(4): 702-707.

Catalog

    Article views (1070) PDF downloads (646) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return