• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Xia Sibo, Ma Minghua, Jin Pengxiang, Cui Liyue, Zhang Shenglin, Jin Wa, Sun Yongqian, Pei Dan. Response Time Anomaly Diagnosis for Search Service[J]. Journal of Computer Research and Development, 2024, 61(6): 1573-1584. DOI: 10.7544/issn1000-1239.202330054
Citation: Xia Sibo, Ma Minghua, Jin Pengxiang, Cui Liyue, Zhang Shenglin, Jin Wa, Sun Yongqian, Pei Dan. Response Time Anomaly Diagnosis for Search Service[J]. Journal of Computer Research and Development, 2024, 61(6): 1573-1584. DOI: 10.7544/issn1000-1239.202330054

Response Time Anomaly Diagnosis for Search Service

Funds: This work was supported by the National Natural Science Foundation of China for Yong Scientists (61902200, 62072264) and the Natural Science Foundation of Tianjin (21JCQNJC00180).
More Information
  • Author Bio:

    Xia Sibo: born in 2000. Master candidate. His main research interests include knowledge graph, and failure detection and diagnosis

    Ma Minghua: born in 1993. PhD. Researcher at Microsoft Research Lab Asia. His main research interests include cloud intelligence and AIOps

    Jin Pengxiang: born in 1998. Master. His main research interests include failure diagnosis and root cause analysis

    Cui Liyue: born in 1997. Master. Her main research interests include anomaly detection, failure diagnosis, and machine learning

    Zhang Shenglin: born in 1989. PhD, associate professor. Member of CCF, IEEE, and ACM. His main research interests include failure detection, and diagnosis and prediction in data center networks

    Jin Wa: born in 2001. Bachelor. Her main research interests include failure diagnosis and root cause analysis

    Sun Yongqian: born in 1988. PhD, associate professor. Member of CCF, IEEE, and ACM. His main research interest includes anomaly detection, root cause analysis, and failure diagnosis in service management

    Pei Dan: born in 1973. PhD, associate professor. Member of CCF. Senior member of IEEE and ACM. His main research interest includes network and service management

  • Received Date: January 29, 2023
  • Revised Date: July 24, 2023
  • Available Online: March 13, 2024
  • The timely response of network services is crucial to improving user experience. Taking search engine as a typical example of network services, service providers need to ensure that the search response time is within one second. In practice, the search response time can be affected by many service attributes, such as user browsers, ISPs, and page loading methods. To optimize effectively, service providers need to identify the rules that cause high search response time, which are combinations of the above attributes. However, existing work encounters three challenges. First, the amount of search logs is large. Second, the search logs are unevenly distributed. Third, the rules with high generality are needed. Therefore, we propose a framework called Miner (multi-dimensional extraction of rules). Miner takes advantage of self-paced sampling to overcome the first and second challenges. To address the third challenge, Miner employs Corels to generate rules with high generality and recall. Our experiments use search logs provided from two top-tier search engine companies in China. The results show that Miner outperforms the state-of-the-art methods in terms of generality and recall. Operators adopt rules generated by Miner and optimize the performance of the search engine.

  • [1]
    Schurman E, Brutlag J. The user and business impact of server delays, additional bytes, and http chunking in web search[C/OL]//Proc of the Velocity Web Performance and Operations Conf. Sebastopol, CA: O'Reilly Media, 2009[2023-07-19].https://www.researchgate.net/publication/280113406
    [2]
    Sundaresan S, Magharei N, Feamster N, et al. Web performance bottlenecks in broadband access networks[C]//Proc of the 2013 ACM SIGMETRICS/Int Conf on Measurement and Modeling of Computer Systems. New York: ACM, 2013: 383−384
    [3]
    Yoon D Y, Niu N, Mozafari B. DBSherlock: A performance diagnostic tool for transactional databases[C]//Proc of the 2016 Int Conf on Management of Data. New York: ACM, 2016: 1599−1614
    [4]
    Jiang Junchen, Sekar V, Stoica I, et al. Shedding light on the structure of internet video quality problems in the wild[C]//Proc of the 9th ACM Conf on Emerging Networking Experiments and Technologies. New York: ACM, 2013: 357−368
    [5]
    Liu Dapeng, Zhao Youjian, Pei Dan, et al. Narrowing down the debugging space of slow search response time[C/OL]//Proc of the 34th Int Performance Computing and Communications Conf. Piscataway, NJ: IEEE, 2015[2023-07-19].https://ieeexplore.ieee.org/abstract/document/7410318
    [6]
    Liu Dapeng, Zhao Youjian, Sui Kaixin, et al. FOCUS: Shedding light on the high search response time in the wild[C]//Proc of the 35th Annual IEEE Int Conf on Computer Communications. Piscataway, NJ: IEEE, 2016: 1−9
    [7]
    Angelino E, Larus-Stone N, Alabi D, et al. Learning certifiably optimal rule lists for categorical data[J]. arXiv preprint, arXiv: 1704.01701, 2018
    [8]
    Khoussainova N, Balazinska M, Balazinska M. PerfXplain: Debugging mapreduce job performance[J]. arXiv preprint, arXiv: 1203.6400, 2012
    [9]
    Bansal C, Renganathan S, Asudani A, et al. DeCaf: Diagnosing and triaging performance issues in large-scale cloud services[C]//Proc of the 42nd ACM/IEEE Int Conf on Software Engineering: Software Engineering in Practice. New York: ACM, 2020: 201−210
    [10]
    Zhang Xu, Du Chao, Li Yifan, et al. HALO: Hierarchy-aware fault localization for cloud systems[C]//Proc of the 27th ACM SIGKDD Conf on Knowledge Discovery & Data Mining. New York: ACM, 2021: 3948−3958
    [11]
    Castelluccio M, Sansone C, Verdoliva L, et al. Automatically analyzing groups of crashes for finding correlations[C]//Proc of the 11th Joint Meeting on Foundations of Software Engineering. New York: ACM, 2017: 717−726
    [12]
    Lin F, Muzumdar K, Laptev N P, et al. Fast dimensional analysis for root cause investigation in a large-scale service environment[J/OL]. Proceedings of the ACM on Measurement and Analysis of Computing Systems, 2020[2023-07-19].https://dl.acm.org/doi/abs/10.1145/3392149
    [13]
    Liu Zhining, Cao Wei, Gao Zhifeng, et al. Self-paced ensemble for highly imbalanced massive data classification[C]//Proc of the 36th IEEE Int Conf on Data Engineering. Piscataway, NJ: IEEE, 2020: 841−852
    [14]
    Cheng Li, Wang Yijie, Liu Xinwang, et al. Outlier detection ensemble with embedded feature selection[C]//Proc of the 34th AAAl Conf on Artificial Intelligence. Palo Alto, CA: AAAI, 2020: 3503−3512
  • Related Articles

    [1]Li Gengsong, Liu Yi, Zheng Qibin, Li Xiang, Liu Kun, Qin Wei, Wang Qiang, Yang Changhong. Algorithm Selection Method Based on Multi-Objective Hybrid Ant Lion Optimizer[J]. Journal of Computer Research and Development, 2023, 60(7): 1533-1550. DOI: 10.7544/issn1000-1239.202220769
    [2]Sun Penghao, Lan Julong, Shen Juan, Hu Yuxiang. Pinning Control-Based Routing Policy Generation Using Deep Reinforcement Learning[J]. Journal of Computer Research and Development, 2021, 58(7): 1563-1572. DOI: 10.7544/issn1000-1239.2021.20200018
    [3]Hu Haiyang, Liu Runhua, Hu Hua. Multi-Objective Optimization for Task Scheduling in Mobile Cloud Computing[J]. Journal of Computer Research and Development, 2017, 54(9): 1909-1919. DOI: 10.7544/issn1000-1239.2017.20160757
    [4]Li Li, Wang Wanliang, Xu Xinli, Li Weikun. Multi-Objective Particle Swarm Optimization Based on Grid Ranking[J]. Journal of Computer Research and Development, 2017, 54(5): 1012-1023. DOI: 10.7544/issn1000-1239.2017.20160074
    [5]Bi Xiaojun, Zhang Lei, Xiao Jing. Constrained Multi-Objective Optimization Algorithm Based on Dual Populations[J]. Journal of Computer Research and Development, 2015, 52(12): 2813-2823. DOI: 10.7544/issn1000-1239.2015.20148025
    [6]Zhang Shiwen, Li Zhiyong, Chen Shaomiao, and Li Renfa. Dynamic Multi-Objective Optimization Algorithm Based on Ecological Strategy[J]. Journal of Computer Research and Development, 2014, 51(6): 1313-1330.
    [7]Wen Renqiang, Zhong Shaobo, Yuan Hongyong, Huang Quanyi. Emergency Resource Multi-Objective Optimization Scheduling Model and Multi-Colony Ant Optimization Algorithm[J]. Journal of Computer Research and Development, 2013, 50(7): 1464-1472.
    [8]Liu Chun'an, Wang Yuping. Dynamic Multi-Objective Optimization Evolutionary Algorithm Based on New Model[J]. Journal of Computer Research and Development, 2008, 45(4): 603-611.
    [9]Chang Yan, Liu Xu, Cheng Wenyuan, Xie Xianghui, Cui Degang. Research and Application of Multi-Objective Aircraft Optimization System Based on Grid[J]. Journal of Computer Research and Development, 2007, 44(1): 44-60.
    [10]Ma Ming, Zhou Chunguang, Zhang Libiao, Ma Jie. Fuzzy Neural Network Optimization by a Multi-Objective Particle Swarm Optimization Algorithm[J]. Journal of Computer Research and Development, 2006, 43(12): 2104-2109.
  • Cited by

    Periodical cited type(16)

    1. 戎珂,施新伟,吕若明. “i7算”赋能AI产业生态可持续发展. 科学学研究. 2025(01): 197-204 .
    2. 张浩严,吕文涛,余润泽,邓志江. 大语言模型研究现状. 无线电工程. 2025(01): 163-174 .
    3. 李东闻,钟震宇,孙羽菲,申峻宇,马子智,于川越,张玉志. 玲珑:一个小规模的高质量中文预训练语言模型. 计算机研究与发展. 2025(03): 682-693 . 本站查看
    4. 陶江垚,奚雪峰,盛胜利,崔志明,左严. 结构化思维提示增强大语言模型推理能力综述. 计算机工程与应用. 2025(06): 64-83 .
    5. 魏楚元,王昕,周小平,赵光哲,黄明. 大型语言模型及其在建筑行业应用研究综述. 北京建筑大学学报. 2024(02): 1-14+121 .
    6. 庞进喜. 大模型在汽车国际化多语言处理中的应用. 中国汽车. 2024(05): 14-20 .
    7. 王晓璐,杨云轩,谢阳斌. 创造人机对话式学习新形态——大语言模型的教育应用现状与展望. 中小学信息技术教育. 2024(05): 15-17 .
    8. 马伟民. 自然语言大模型技术在政务服务智能客服系统建设中的应用. 信息与电脑(理论版). 2024(08): 86-88 .
    9. 曾白凌. “被中介的真理”:Sora对媒介相合性的追问. 现代传播(中国传媒大学学报). 2024(05): 1-10 .
    10. 童俊杰,申佳,赫罡,张奎. 运营商智算中心建设思路及方案. 邮电设计技术. 2024(09): 68-73 .
    11. 刘同军. 生成式人工智能革新数学教学:场景与案例. 中学数学杂志. 2024(10): 1-4 .
    12. 尹为民. 一种基于预训练模型的类增量学习近似重放方法分析. 电子技术. 2024(10): 144-145 .
    13. 崔金满,李冬梅,田萱,孟湘皓,杨宇,崔晓晖. 提示学习研究综述. 计算机工程与应用. 2024(23): 1-27 .
    14. 王珍珍,向巴卓玛,赵岩松,马星光. 以ChatGPT为代表的大型语言模型在医学教学中的应用. 医学教育管理. 2024(06): 692-697 .
    15. 王琳. 大语言模型技术背景下重塑研究生论文评价与指导. 学位与研究生教育. 2024(12): 30-37 .
    16. 朱俊仪,朱尚明. 利用检索增强生成技术开发本地知识库应用. 通信学报. 2024(S2): 242-247 .

    Other cited types(7)

Catalog

    Article views (123) PDF downloads (50) Cited by(23)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return