Citation: | Guo Jing, Hu Cunchen, Bao Yungang. Survey on Guaranteeing the Performance of Co-Located Applications[J]. Journal of Computer Research and Development, 2024, 61(1): 43-65. DOI: 10.7544/issn1000-1239.202220333 |
The huge cost of investment and low resource utilization in the datacenter has long been a great concern to cloud providers. To address this issue, a straightforward way is co-locating more applications on the same hardware to improve resource efficiency. However, the shared resource contention caused by co-located applications leads to performance interference, affecting the application’s performance, quality of service (QoS) and user satisfaction. Therefore, how to guarantee the performance of the co-located application has been a key issue in the colocation scenario. We introduce the researches of guaranteeing the performance of co-located applications, including the background of co-location, challenges, and key technologies. The related work is summarized from four aspects: application and cluster characterization (basic), interference detection (premise), server-level resource allocation (micro-level policy), and cluster-level job scheduling (macro-level policy). In addition, due to the diverse characteristics of co-located applications and clusters, the research of guaranteeing the performance faces different challenges and problem complexity in the different co-located scenarios. For example, the number of co-located applications deployed on a unit resource will directly affect the time cost of searching resource space, and the running mode of applications will affect the competition intensity of shared resources. Therefore, from the perspective of problem complexity, we discuss and analyze the challenges of research work from three dimensions, cluster and application characteristics, resource interference dimension, and the number of co-located applications. At the end of this paper, we discuss the future research directions and the challenges in the high deployment density scenario. We conclude that the software/hardware co-designed full-stack approach is the trend to guarantee the performance in high deployment density clusters, and this approach can help to provide predictable performance and high resource efficiency in the datacenter.
[1] |
Dean J, Barroso L A. The tail at scale[J]. Communications of the ACM, 2013, 56(2): 74−80 doi: 10.1145/2408776.2408794
|
[2] |
Barroso L A, Hölzle U. The datacenter as a computer: An introduction to the design of warehouse-scale machines[J]. Synthesis Lectures on Computer Architecture, 2009, 4(1): 1−108
|
[3] |
Delimitrou C, Kozyrakis C. Quasar: Resource-efficient and QoS-aware cluster management[J]. ACM SIGPLAN Notices, 2014, 49(4): 127−144 doi: 10.1145/2644865.2541941
|
[4] |
Verma A, Pedrosa L, Korupolu M, et al. Large-scale cluster management at Google with Borg [C/OL] //Proc of the 10th European Conf on Computer Systems. New York: ACM, 2015[2023-01-11].https://dl.acm.org/doi/10.1145/2741948.2741964
|
[5] |
Guo Jing, Chang Zihao, Wang Sa, et al. Who limits the resource efficiency of my datacenter: An analysis of Alibaba datacenter traces [C/OL] //Proc of the 27th IEEE/ACM Int Symp on Quality of Service. Piscataway, NJ: IEEE, 2019[2023-01-11].https://ieeexplore.ieee.org/document/9068614
|
[6] |
Reiss C, Tumanov A, Ganger G R, et al. Heterogeneity and dynamicity of clouds at scale: Google trace analysis [C/OL] //Proc of the 3rd ACM Symp on Cloud Computing. New York: ACM, 2012[2023-01-11].https://dl.acm.org/doi/10.1145/2391229.2391236
|
[7] |
Tirmazi M, Barker A, Deng N, et al. Borg: The next generation [C/OL] //Proc of the 15th European Conf on Computer Systems. New York: ACM, 2020[2023-01-11]. doi: 10.1145/3342195.3387517
|
[8] |
Nathuji R, Kansal A, Ghaffarkhah A. Q-clouds: Managing performance interference effects for QoS-aware clouds [C] //Proc of the 5th European Conf on Computer Systems. New York: ACM, 2010: 237−250
|
[9] |
Chen Shuang, Delimitrou C, Martínez J F. PARTIES: QoS-aware resource partitioning for multiple interactive services [C] //Proc of the 24th Int Conf on Architectural Support for Programming Languages and Operating Systems. New York: ACM, 2019: 107−120
|
[10] |
Zhuravlev S, Blagodurov S, Fedorova A. Addressing shared resource contention in multicore processors via scheduling[J]. ACM SIGPLAN Notices, 2010, 45(3): 129−142 doi: 10.1145/1735971.1736036
|
[11] |
徐志伟,李春典. 低熵云计算系统[J]. 中国科学:信息科学,2017,47:1149−1163
Xu Zhiwei, Li Chundian. Low-entropy cloud computing systems[J]. SCIENTIA SINICA Informationis, 2017, 47: 1149−1163 (in Chinese)
|
[12] |
Chandra D, Guo Fei, Kim S, et al. Predicting inter-thread cache contention on a chip multi-processor architecture [C] // Proc of the 11th Int Symp on High-Performance Computer Architecture. Piscataway, NJ: IEEE, 2005: 340−351
|
[13] |
Tang Lingjia, Mars J, Vachharajani N, et al. The impact of memory subsystem resource sharing on datacenter applications [C] // Proc of the 38th Annual Int Symp on Computer Architecture (ISCA). New York: ACM, 2011: 283−294
|
[14] |
Lo D, Cheng Liqun, Govindaraju R, et al. Heracles: Improving resource efficiency at scale [C] //Proc of the 42nd Annual Int Symp on Computer Architecture. New York: ACM, 2015: 450−462
|
[15] |
Linden G. Marissa Mayer at Web 2.0 [EB/OL]. 2006 [2022-03-29]. http://glinden.blogspot.com/2006/11/marissa-mayer-at-web-20.html
|
[16] |
Schurman E, Brutlag J. The user and business impact of server delays, additional bytes, and http chunking in web search [EB/OL]. 2009[2023-01-11]. https://vdocuments.mx/the-user-and-business-impact-of-server-delays-additional-bytes-and-http-chunking.html
|
[17] |
Einav Y. Amazon found every 100ms of latency cost them 1% in sales [EB/OL]. 2019[2022-03-29].https://www.gigaspaces.com/blog/amazon-found-every-100ms-of-latency-cost-them-1-in-sales
|
[18] |
Haque M E, He Yuxiong, Elnikety S, et al. Exploiting heterogeneity for tail latency and energy efficiency [C] //Proc of the 50th Annual IEEE/ACM Int Symp on Microarchitecture. New York: ACM, 2017: 625−638
|
[19] |
张鲁飞,陈左宁. 虚拟集群上面向功耗的形式化的VM调度策略[J]. 计算机科学,2014,41(8):38−41
Zhang Lufei, Chen Zuoning. Power-efficient formal scheduling policy of VMs in virtualized clusters[J]. Computer Science, 2014, 41(8): 38−41 (in Chinese)
|
[20] |
Leverich J, Kozyrakis C. Reconciling high server utilization and sub-millisecond quality-of-service [C/OL] //Proc of the 9th European Conf on Computer Systems. New York: ACM, 2014[2023-01-11].https://dl.acm.org/doi/10.1145/2592798.2592821
|
[21] |
Lin Jiang, Lu Qingda, Ding Xiaoning, et al. Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems [C] //Proc of the 14th Int Symp on High Performance Computer Architecture. Piscataway, NJ: IEEE, 2008: 367−378
|
[22] |
Sherwood T, Calder B, Emer J. Reducing cache misses using hardware and software page placement [C] //Proc of the 13th Int Conf on Supercomputing. New York: ACM, 1999: 155−164
|
[23] |
Ye Ying, West R, Cheng Zhuoqun, et al. Coloris: A dynamic cache partitioning system using page coloring [C] // Proc of the 23rd Int Conf on Parallel Architecture and Compilation Techniques (PACT). Piscataway, NJ: IEEE, 2014: 381−392
|
[24] |
邱杰凡,华宗汉,范菁,等. 内存体系划分技术的研究与发展[J]. 软件学报,2022,33(2):751−769 doi: 10.13328/j.cnki.jos.006370
Qiu Jiefan, Hua Zonghan, Fan Jing, et al. Evolution of memory partitioning technologies: Case study through page coloring[J]. Journal of Software, 2022, 33(2): 751−769 (in Chinese) doi: 10.13328/j.cnki.jos.006370
|
[25] |
Albonesi D H. Selective cache ways: On-demand cache resource allocation [C] // Proc of the 32nd Annual ACM/IEEE Int Symp on Microarchitecture. Piscataway, NJ: IEEE, 1999: 248−259
|
[26] |
Balasubramonian R, Albonesi D, Buyuktosunoglu A, et al. Memory hierarchy reconfiguration for energy and performance in general-purpose processor architectures [C] //Proc of the 33rd Annual ACM/IEEE Int Symp on Microarchitecture. Piscataway, NJ: IEEE, 2000: 245−257
|
[27] |
Chiou D, Jain P, Rudolph L, et al. Application-specific memory management for embedded systems using software-controlled caches [C] //Proc of the 37th Annual Design Automation Conf. New York: ACM, 2000: 416−419
|
[28] |
Ranganathan P, Adve S, Jouppi N P. Reconfigurable caches and their application to media processing[J]. ACM SIGARCH Computer Architecture News, 2000, 28(2): 214−224 doi: 10.1145/342001.339685
|
[29] |
Liu Fang, Jiang Xiaowei, Solihin Y. Understanding how off-chip memory bandwidth partitioning in chip multiprocessors affects system performance [C/OL] //Proc of the 16th Int Symp on High-Performance Computer Architecture. Piscataway, NJ: IEEE, 2010[2023-01-11].https://ieeexplore.ieee.org/document/5416655/
|
[30] |
Yun H, Yao Gang, Pellizzoni R, et al. Memguard: Memory bandwidth reservation system for efficient performance isolation in multi-core platforms [C] //Proc of the 19th Real-Time and Embedded Technology and Applications Symp. Piscataway, NJ: IEEE, 2013: 55−64
|
[31] |
Iyer R, Zhao Li, Guo Fei, et al. QoS policies and architecture for cache/memory in CMP platforms[J]. ACM SIGMETRICS Performance Evaluation Review, 2007, 35(1): 25−36 doi: 10.1145/1269899.1254886
|
[32] |
Herdrich A, Illikkal R, Iyer R, et al. Rate-based QoS techniques for cache/memory in CMP platforms [C] //Proc of the 23rd Int Conf on Supercomputing. New York: ACM, 2009: 479−488
|
[33] |
Shahrad M, Balkind J, Wentzlaff D. Architectural implications of function-as-a-service computing [C] //Proc of the 52nd Annual IEEE/ACM Int Symp on Microarchitecture. New York: ACM, 2019: 1063−1075
|
[34] |
Park J, Park S, Baek W. CoPart: Coordinated partitioning of last-level cache and memory bandwidth for fairness-aware workload consolidation on commodity servers [C/OL] //Proc of the 14th EuroSys Conf. New York: ACM, 2019[2023-01-11].https://dl.acm.org/doi/10.1145/3302424.3303963
|
[35] |
Bashir N, Deng Nan, Rzadca K, et al. Take it to the limit: Peak prediction-driven resource overcommitment in datacenters [C] //Proc of the 16th European Conf on Computer Systems. New York: ACM, 2021: 556−573
|
[36] |
Lagar-Cavilla A, Ahn J, Souhlal S, et al. Software-defined far memory in warehouse-scale computers [C] //Proc of the 24th Int Conf on Architectural Support for Programming Languages and Operating Systems. New York: ACM, 2019: 317−330
|
[37] |
Waldspurger C A. Memory resource management in VMware ESX server[J]. ACM SIGOPS Operating Systems Review, 2002, 36(SI): 181−194 doi: 10.1145/844128.844146
|
[38] |
Gu Juncheng, Lee Y, Zhang Yiwen, et al. Efficient memory disaggregation with infiniswap [C] //Proc of the 14th USENIX Symp on Networked Systems Design and Implementation. Berkeley, CA: USENIX Association, 2017: 649−667
|
[39] |
Liang Shuang, Noronha R, Panda D K. Swapping to remote memory over infiniband: An approach using a high performance network block device [C/OL] //Proc of the 7th IEEE Int Conf on Cluster Computing. Piscataway, NJ: IEEE, 2005[2023-01-11].https://ieeexplore.ieee.org/document/4154093
|
[40] |
Brown M A. Traffic control howto [EB/OL]. 2006 [2022-03-29]. http://linux-ip.net/articles/Traffic-Control-HOWTO/
|
[41] |
Hong Chiyao, Caesar M, Godfrey P B. Finishing flows quickly with preemptive scheduling[J]. ACM SIGCOMM Computer Communication Review, 2012, 42(4): 127−138 doi: 10.1145/2377677.2377710
|
[42] |
Hu Shuihai, Bai Wei, Chen Kai, et al. Providing bandwidth guarantees, work conservation and low latency simultaneously in the cloud[J]. IEEE Transactions on Cloud Computing, 2018, 9(2): 763−776
|
[43] |
Grosvenor M P, Schwarzkopf M, Gog I, et al. Queues don’t matter when you can JUMP Them! [C/OL] //Proc of the 12th USENIX Symp on Networked Systems Design and Implementation. Berkeley, CA: USENIX Association, 2015[2023-01-11].https://dl.acm.org/doi/abs/10.5555/2789770.2789771
|
[44] |
Perry J, Ousterhout A, Balakrishnan H, et al. Fastpass: A centralized"zero-queue" datacenter network [C] //Proc of the 28th ACM Conf on SIGCOMM. New York: ACM, 2014: 307−318
|
[45] |
Nagaraj K, Bharadia D, Mao Hongzi, et al. Numfabric: Fast and flexible bandwidth allocation in datacenters [C] //Proc of the 30th ACM SIGCOMM Conf. New York: ACM, 2016: 188−201
|
[46] |
Wang Shuai, Gao Kaihui, Qian Kun, et al. Predictable vFabric on informative data plane [C] //Proc of the 36th ACM SIGCOMM Conf. New York: ACM, 2022: 615−632
|
[47] |
Ma Jiuyue, Sui Xiufeng, Sun Ninghui, et al. Supporting differentiated services in computers via programmable architecture for resourcing-on-demand (PARD) [C] //Proc of the 20th Int Conf on Architectural Support for Programming Languages and Operating Systems. New York: ACM, 2015: 131−143
|
[48] |
Nanda S, Chiueh T. A survey on virtualization technologies [EB/OL]. 2011 [2023-01-11].https://rtcl.eecs.umich.edu/papers/publications/2011/TR179.pdf
|
[49] |
Jonas E, Schleier-Smith J, Sreekanti V, et al. Cloud programming simplified: A Berkeley view on serverless computing [J]. arXiv preprint, arXiv: 1902. 03383, 2019
|
[50] |
Armbrust M, Fox A, Griffith R, et al. Above the clouds: A Berkeley view of cloud computing, UCB/EECS-2009-28[R]. Berkeley, CA: University of California, 2009
|
[51] |
Google Inc. Borg cluster workload traces [EB/OL]. 2019 [2022-03-20].https://github.com/google/cluster-data.
|
[52] |
Alibaba Group. Alibaba cluster trace program [EB/OL]. 2021 [2022-03-29].https://github.com/alibaba/clusterdata
|
[53] |
Microsoft. Azure public dataset [EB/OL]. 2020 [2022-03-29].https://github.com/Azure/AzurePublicDataset
|
[54] |
Chen Tianshi, Guo Qi, Temam O, et al. Statistical performance comparisons of computers[J]. IEEE Transactions on Computers, 2014, 64(5): 1442−1455
|
[55] |
Krushevskaja D, Sandler M. Understanding latency variations of black box services [C] //Proc of the 22nd Int Conf on World Wide Web. New York: ACM, 2013: 703−714
|
[56] |
Ravindranath L, Padhye J, Mahajan R, et al. Timecard: Controlling user-perceived delays in server-based mobile applications [C] //Proc of the 24th ACM Symp on Operating Systems Principles. New York: ACM, 2013: 85−100
|
[57] |
Ravindranath L, Padhye J, Agarwal S, et al. AppInsight: Mobile App performance monitoring in the wild [C] //Proc of the 10th USENIX Symp on Operating Systems Design and Implementation. Berkeley, CA: USENIX Association, 2012: 107−120
|
[58] |
Amvrosiadis G, Park J W, Ganger G R, et al. On the diversity of cluster workloads and its impact on research results [C] //Proc of the 23rd USENIX Annual Technical Conf. Berkeley, CA: USENIX Association, 2018: 533−546
|
[59] |
Delimitrou C, Kozyrakis C. iBench: Quantifying interference for datacenter applications [C] //Proc of the 9th IEEE Int Symp on Workload Characterization. Piscataway, NJ: IEEE, 2013: 23−33
|
[60] |
Qiu Haoran, Banerjee S S, Jha S, et al. FIRM: An intelligent fine-grained resource management framework for SLO-oriented microservices [C] //Proc of the 14th USENIX Symp on Operating Systems Design and Implementation. Berkeley, CA: USENIX Association, 2020: 805−825
|
[61] |
Patel T, Tiwari D. CLITE: Efficient and QoS-aware co-location of multiple latency-critical jobs for warehouse scale computers [C] // Proc of the 26th IEEE Int Symp on High Performance Computer Architecture. Piscataway, NJ: IEEE, 2020: 193−206
|
[62] |
Mars J, Tang Lingjia, Skadron K, et al. Increasing utilization in modern warehouse-scale computers using bubble-up[J]. IEEE Micro, 2012, 32(3): 88−99 doi: 10.1109/MM.2012.22
|
[63] |
Yang Hailong, Breslow A, Mars J, et al. Bubble-flux: Precise online QoS management for increased utilization in warehouse scale computers[J]. ACM SIGARCH Computer Architecture News, 2013, 41(3): 607−618 doi: 10.1145/2508148.2485974
|
[64] |
Chen Quan, Yang Hailong, Guo Minyi, et al. Prophet: Precise QoS prediction on non-preemptive accelerators to improve utilization in warehouse-scale computers [C] //Proc of the 22nd Int Conf on Architectural Support for Programming Languages and Operating Systems. New York: ACM, 2017: 17−32
|
[65] |
Zhang Yunqi, Prekas G, Fumarola G M, et al. History-based harvesting of spare cycles and storage in large-scale datacenters [C] //Proc of the 12th USENIX Symp on Operating Systems Design and Implementation. Berkeley, CA: USENIX Association, 2016: 755−770
|
[66] |
Yang Yanan, Zhao Laiping, Li Yiming, et al. INFless: A native serverless system for low-latency, high-throughput inference [C] //Proc of the 27th ACM Int Conf on Architectural Support for Programming Languages and Operating Systems. New York: ACM, 2022: 768−781
|
[67] |
Shahrad M, Fonseca R, Goiri Í, et al. Serverless in the wild: Characterizing and optimizing the serverless workload at a large cloud provider [C] //Proc of the 25th USENIX Annual Technical Conf. Berkeley, CA: USENIX Association, 2020: 205−218
|
[68] |
Cortez E, Bonde A, Muzio A, et al. Resource central: Understanding and predicting workloads for improved resource management in large cloud platforms [C] //Proc of the 26th Symp on Operating Systems Principles. New York: ACM, 2017: 153−167
|
[69] |
Iorgulescu C, Azimi R, Kwon Y, et al. PerfIso: Performance isolation for commercial latency-sensitive services [C] //Proc of the 23rd USENIX Annual Technical Conf. Berkeley, CA: USENIX Association, 2018: 519−532
|
[70] |
Luo Shutian, Xu Huanle, Lu Chengzhi, et al. Characterizing microservice dependency and performance: Alibaba trace analysis [C] //Proc of the 12th ACM Symp on Cloud Computing. New York: ACM, 2021: 412−426
|
[71] |
Zhang Yanqi, Hua Weizhe, Zhou Zhuangzhuang, et al. Sinan: ML-based and QoS-aware resource management for cloud microservices [C] //Proc of the 26th ACM Int Conf on Architectural Support for Programming Languages and Operating Systems. New York: ACM, 2021: 167−181
|
[72] |
Gan Yu, Zhang Yanqi, Cheng Dailun, et al. An open-source benchmark suite for microservices and their hardware-software implications for cloud & edge systems [C] //Proc of the 24th Int Conf on Architectural Support for Programming Languages and Operating Systems. New York: ACM, 2019: 3−18
|
[73] |
Gan Yu, Zhang Yanqi, Hu K, et al. Seer: Leveraging big data to navigate the complexity of performance debugging in cloud microservices [C] //Proc of the 24th Int Conf on Architectural Support for Programming Languages and Operating Systems. New York: ACM, 2019: 19−33
|
[74] |
Tian Huangshi, Zheng Yunchuan, Wang Wei. Characterizing and synthesizing task dependencies of data-parallel jobs in Alibaba cloud [C] //Proc of the 10th ACM Symp on Cloud Computing. New York: ACM, 2019: 139−151
|
[75] |
Sriraman A, Dhanotia A, Wenisch T F. Softsku: Optimizing server architectures for microservice diversity@ scale [C] //Proc of the 46th Int Symp on Computer Architecture. New York: ACM, 2019: 513−526
|
[76] |
Delimitrou C, Kozyrakis C. Paragon: QoS-aware scheduling for heterogeneous datacenters[J]. ACM SIGPLAN Notices, 2013, 48(4): 77−88 doi: 10.1145/2499368.2451125
|
[77] |
Shan Yizhou, Huang Yutong, Chen Yilun, et al. LegoOS: A disseminated, distributed OS for hardware resource disaggregation [C] //Proc of the 13th USENIX Symp on Operating Systems Design and Implementation. Berkeley, CA: USENIX Association, 2018: 69−87
|
[78] |
Lu Chengzhi, Ye Kejiang, Xu Guoyao, et al. Imbalance in the cloud: An analysis on Alibaba cluster trace [C] //Proc of the 5th IEEE Int Conf on Big Data. Piscataway, NJ: IEEE, 2017: 2884−2892
|
[79] |
王康瑾,贾统,李影. 在离线混部作业调度与资源管理技术研究综述[J]. 软件学报,2020,31(10):3100−3119
Wang Kangjin, Jia Tong, Li Ying. State-of-the-art survey of scheduling and resource management technology for colocation jobs[J]. Journal of Software, 2020, 31(10): 3100−3119 (in Chinese)
|
[80] |
Liu Qixiao, Yu Zhibin. The elasticity and plasticity in semi-containerized co-locating cloud workload: A view from Alibaba trace [C] //Proc of the 9th ACM Symp on Cloud Computing. New York: ACM, 2018: 347−360
|
[81] |
Zhao Laiping, Yang Yanan, Zhang Kaixuan, et al. Rhythm: Component-distinguishable workload deployment in datacenters [C/OL] //Proc of the 15th European Conf on Computer Systems. New York: ACM, 2020[2023-01-11].https://dl.acm.org/doi/abs/10.1145/3342195.3387534
|
[82] |
Wang Liang, Li Mengyuan, Zhang Yinqian, et al. Peeking behind the curtains of serverless platforms [C] //Proc of the 23rd USENIX Annual Technical Conf. Berkeley, CA: USENIX Association, 2018: 133−146
|
[83] |
Delimitrou C, Kozyrakis C. HCloud: Resource-efficient provisioning in shared cloud systems [C] //Proc of the 21st Int Conf on Architectural Support for Programming Languages and Operating Systems. New York: ACM, 2016: 473−488
|
[84] |
Zhang Xiao, Tune E, Hagmann R, et al. CPI2: CPU performance isolation for shared compute clusters [C] //Proc of the 8th ACM European Conf on Computer Systems. New York: ACM, 2013: 379−391
|
[85] |
Ousterhout A, Fried J, Behrens J, et al. Shenango: Achieving high CPU efficiency for latency-sensitive datacenter workloads [C] //Proc of the 16th USENIX Symp on Networked Systems Design and Implementation. Berkeley, CA: USENIX Association, 2019: 361−378
|
[86] |
Fried J, Ruan Zhenyuan, Ousterhout A, et al. Caladan: Mitigating interference at microsecond timescales [C] //Proc of the 14th USENIX Symp on Operating Systems Design and Implementation. Berkeley, CA: USENIX Association, 2020: 281−297
|
[87] |
Chen Quan, Xue Shuai, Zhao Shang, et al. Alita: Comprehensive performance isolation through bias resource management for public clouds [C/OL] //Proc of the 33rd Int Conf for High Performance Computing, Networking, Storage and Analysis. Piscataway, NJ: IEEE, 2020[2023-01-11].https://ieeexplore.ieee.org/document/9355282
|
[88] |
Novaković D, Vasić N, Novaković S, et al. DeepDive: Transparently identifying and managing performance interference in virtualized environments [C] //Proc of the 18th USENIX Annual Technical Conf. Berkeley, CA: USENIX Association, 2013: 219−230
|
[89] |
Li Yusen, Shan Chuxu, Chen Ruobing, et al. GAugur: Quantifying performance interference of colocated games for improving resource utilization in cloud gaming [C] //Proc of the 28th Int Symp on High-Performance Parallel and Distributed Computing. New York: ACM, 2019: 231−242
|
[90] |
Li Zijun, Chen Quan, Xue Shuai, et al. Amoeba: QoS-awareness and reduced resource usage of microservices with serverless computing [C] //Proc of the 34th IEEE Int Parallel and Distributed Processing Symp. Piscataway, NJ: IEEE, 2020: 399−408
|
[91] |
Zhao Jiacheng, Cui Huimin, Xue Jingling, et al. Predicting cross-core performance interference on multicore processors with regression analysis[J]. IEEE Transactions on Parallel and Distributed Systems, 2015, 27(5): 1443−1456
|
[92] |
Wang Sa, Zhu Yanhai, Chen Shanpei, et al. A case for adaptive resource management in Alibaba datacenter using neural networks[J]. Journal of Computer Science and Technology, 2020, 35(1): 209−220 doi: 10.1007/s11390-020-9732-x
|
[93] |
李杰,张静,李伟东,等. 一种基于共享公平和时变资源需求的公平分配策略[J]. 计算机研究与发展,2019,56(7):1534−1544
Li Jie, Zhang Jing, Li Weidong, et al. A fair distribution strategy based on shared fair and time-varying resource demand[J]. Journal of Computer Research and Development, 2019, 56(7): 1534−1544 (in Chinese)
|
[94] |
王金海,黄传河,王晶,等. 异构云计算体系结构及其多资源联合公平分配策略[J]. 计算机研究与发展,2015,52(6):1288−1302
Wang Jinhai, Huang Chuanhe, Wang Jing, et al. A heterogeneous cloud computing architecture and multi-resource-joint fairness allocation strategy[J]. Journal of Computer Research and Development, 2015, 52(6): 1288−1302 (in Chinese)
|
[95] |
Ghodsi A, Zaharia M, Hindman B, et al. Dominant resource fairness: Fair allocation of multiple resource types [C] //Proc of the 8th USENIX Symp on Networked Systems Design and Implementation. Berkeley, CA: USENIX Association, 2011: 323–336
|
[96] |
Mukkara A, Beckmann N, Sanchez D. Whirlpool: Improving dynamic cache management with static data classification[J]. ACM SIGARCH Computer Architecture News, 2016, 44(2): 113−127 doi: 10.1145/2980024.2872363
|
[97] |
Qureshi M K, Patt Y N. Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches [C] //Proc of the 39th Annual IEEE/ACM Int Symp on Microarchitecture. Piscataway, NJ: IEEE, 2006: 423−432
|
[98] |
El-Sayed N, Mukkara A, Tsai P A, et al. KPart: A hybrid cache partitioning-sharing technique for commodity multicores [C] //Proc of the 24th IEEE Int Symp on High Performance Computer Architecture. Piscataway, NJ: IEEE, 2018: 104−117
|
[99] |
Chen Quan, Wang Zhenning, Leng Jingwen, et al. Avalon: Towards QoS awareness and improved utilization through multi-resource management in datacenters [C] //Proc of the 33rd ACM Int Conf on Supercomputing. New York: ACM, 2019: 272−283
|
[100] |
Lin Changyuan, Khazaei H. Modeling and optimization of performance and cost of serverless applications[J]. IEEE Transactions on Parallel and Distributed Systems, 2020, 32(3): 615−632
|
[101] |
Roy R B, Patel T, Tiwari D. SATORI: Efficient and fair resource partitioning by sacrificing short-term benefits for long-term gains [C] //Proc of the 48th ACM/IEEE Annual Int Symp on Computer Architecture. Piscataway, NJ: IEEE, 2021: 292−305
|
[102] |
Chen Ruobing, Wu Jinping, Shi Haosen, et al. DRLPart: A deep reinforcement learning framework for optimally efficient and robust resource partitioning on commodity servers [C] //Proc of the 30th Int Symp on High-Performance Parallel and Distributed Computing. New York: ACM, 2021: 175−188
|
[103] |
Nishtala R, Carpenter P, Petrucci V, et al. Hipster: Hybrid task manager for latency-critical cloud workloads [C] //Proc of the 23rd IEEE Int Symp on High Performance Computer Architecture. Piscataway, NJ: IEEE, 2017: 409−420
|
[104] |
Kulkarni N, Gonzalez-Pumariega G, Khurana A, et al. CuttleSys: Data-driven resource management for interactive services on reconfigurable multicores [C] //Proc of the 53rd Annual IEEE/ACM Int Symp on Microarchitecture. Piscataway, NJ: IEEE, 2020: 650−664
|
[105] |
Zhou Hao, Chen Ming, Lin Qian, et al. Overload control for scaling Wechat microservices [C] //Proc of the 9th ACM Symp on Cloud Computing. New York: ACM, 2018: 149−161
|
[106] |
Grandl R, Chowdhury M, Akella A, et al. Altruistic scheduling in multi-resource clusters [C] //Proc of the 12th USENIX Symp on Operating Systems Design and Implementation. Berkeley, CA: USENIX Association, 2016: 65−80
|
[107] |
李青,李勇,涂碧波,等. QoS保证的数据中心动态资源供应方法[J]. 计算机学报,2014,37(12):2395−2407
Li Qing, Li Yong, Tu Bibo, et al. QoS-guaranteed dynamic resource provision in Internet data centers[J]. Chinese Journal of Computers, 2014, 37(12): 2395−2407 (in Chinese)
|
[108] |
Romero F, Delimitrou C. Mage: Online and interference-aware scheduling for multi-scale heterogeneous systems [C/OL] //Proc of the 27th Int Conf on Parallel Architectures and Compilation Techniques. New York: ACM, 2018[2023-01-11].https://dl.acm.org/doi/10.1145/3243176.3243183
|
[109] |
Zhao Laiping, Yang Yanan, Li Yiming, et al. Understanding, predicting and scheduling serverless workloads under partial interference [C/OL] //Proc of the 34th Int Conf for High Performance Computing, Networking, Storage and Analysis. New York: ACM, 2021[2023-01-11].https://ieeexplore.ieee.org/document/9910093
|
[110] |
Xu Ran, Mitra S, Rahman J, et al. Pythia: Improving datacenter utilization via precise contention prediction for multiple co-located workloads [C] //Proc of the 19th Int Middleware Conf. New York: ACM, 2018: 146−160
|
[111] |
Agache A, Brooker M, Iordache A, et al. Firecracker: Lightweight virtualization for serverless applications [C] //Proc of the 17th USENIX Symp on Networked Systems Design and Implementation. Berkeley, CA: USENIX Association, 2020: 419−434
|
[112] |
Ferdman M, Adileh A, Kocberber O, et al. Clearing the clouds: A study of emerging scale-out workloads on modern hardware[J]. ACM SIGPLAN Notices, 2012, 47(4): 37−48 doi: 10.1145/2248487.2150982
|
[113] |
Kaffes K, Yadwadkar N J, Kozyrakis C. Centralized core-granular scheduling for serverless functions [C] // Proc of the 10th ACM Symp on Cloud Computing. New York: ACM, 2019: 158−164
|
[1] | Cao Yiran, Zhu Youwen, He Xingyu, Zhang Yue. Utility-Optimized Local Differential Privacy Set-Valued Data Frequency Estimation Mechanism[J]. Journal of Computer Research and Development, 2022, 59(10): 2261-2274. DOI: 10.7544/issn1000-1239.20220504 |
[2] | Hong Jinxin, Wu Yingjie, Cai Jianping, Sun Lan. Differentially Private High-Dimensional Binary Data Publication via Attribute Segmentation[J]. Journal of Computer Research and Development, 2022, 59(1): 182-196. DOI: 10.7544/issn1000-1239.20200701 |
[3] | Wu Wanqing, Zhao Yongxin, Wang Qiao, Di Chaofan. A Safe Storage and Release Method of Trajectory Data Satisfying Differential Privacy[J]. Journal of Computer Research and Development, 2021, 58(11): 2430-2443. DOI: 10.7544/issn1000-1239.2021.20210589 |
[4] | Zhang Yuxuan, Wei Jianghong, Li Ji, Liu Wenfen, Hu Xuexian. Graph Degree Histogram Publication Method with Node-Differential Privacy[J]. Journal of Computer Research and Development, 2019, 56(3): 508-520. DOI: 10.7544/issn1000-1239.2019.20170886 |
[5] | Zhu Weijun, You Qingguang, Yang Weidong, Zhou Qinglei. Trajectory Privacy Preserving Based on Statistical Differential Privacy[J]. Journal of Computer Research and Development, 2017, 54(12): 2825-2832. DOI: 10.7544/issn1000-1239.2017.20160647 |
[6] | Wu Yingjie, Zhang Liqun, Kang Jian, Wang Yilei. An Algorithm for Differential Privacy Streaming Data Adaptive Publication[J]. Journal of Computer Research and Development, 2017, 54(12): 2805-2817. DOI: 10.7544/issn1000-1239.2017.20160555 |
[7] | Wang Liang, Wang Weiping, Meng Dan. Privacy Preserving Data Publishing via Weighted Bayesian Networks[J]. Journal of Computer Research and Development, 2016, 53(10): 2343-2353. DOI: 10.7544/issn1000-1239.2016.20160465 |
[8] | Lu Guoqing, Zhang Xiaojian, Ding Liping, Li Yanfeng, Liao Xin. Frequent Sequential Pattern Mining under Differential Privacy[J]. Journal of Computer Research and Development, 2015, 52(12): 2789-2801. DOI: 10.7544/issn1000-1239.2015.20140516 |
[9] | Ouyang Jia, Yin Jian, Liu Shaopeng, Liu Yubao. An Effective Differential Privacy Transaction Data Publication Strategy[J]. Journal of Computer Research and Development, 2014, 51(10): 2195-2205. DOI: 10.7544/issn1000-1239.2014.20130824 |
[10] | Ni Weiwei, Chen Geng, Chong Zhihong, Wu Yingjie. Privacy-Preserving Data Publication for Clustering[J]. Journal of Computer Research and Development, 2012, 49(5): 1095-1104. |