高级检索
    李胜梅, 程步奇, 高兴誉, 乔 林, 汤志忠. 主成分线性回归模型分析应用程序性能[J]. 计算机研究与发展, 2009, 46(11): 1949-1955.
    引用本文: 李胜梅, 程步奇, 高兴誉, 乔 林, 汤志忠. 主成分线性回归模型分析应用程序性能[J]. 计算机研究与发展, 2009, 46(11): 1949-1955.
    Li Shengmei, Cheng buqi, Gao Xingyu, Qiao Lin, Tang Zhizhong. Principal Component Linear Regression Analysis on Performance of Applications[J]. Journal of Computer Research and Development, 2009, 46(11): 1949-1955.
    Citation: Li Shengmei, Cheng buqi, Gao Xingyu, Qiao Lin, Tang Zhizhong. Principal Component Linear Regression Analysis on Performance of Applications[J]. Journal of Computer Research and Development, 2009, 46(11): 1949-1955.

    主成分线性回归模型分析应用程序性能

    Principal Component Linear Regression Analysis on Performance of Applications

    • 摘要: 应用程序的性能分析能够给体系架构设计者和性能优化者提供有效的参考和指导.采用主成分线性回归模型分析了SPEC CPU2006的整型程序性能.模型选取性能监测单元采样到的事件为自变量,每条指令的时钟周期数(CPI)作为因变量.模型中采用主成分分析法消除了性能事件之间的相关性.实验结果表明,模型的拟合优度在90%以上,对性能进行预测的平均相对误差为15%.模型从量化上分析了L1,L2高速缓存缺失作为影响性能的关键因素是怎样影响程序性能的.

       

      Abstract: The factors influencing application performance are various and the extents of influence are different. Analyzing and distinguishing the extents of influence caused by various factors can guide the architects in the architecture design and help programmers in the optimization. However, it is not easy to distinguish the extents of influence because the factors may correlate each other themselves. In this paper, a principal component linear regression model aiming at performance of SPEC CPU2006 integer benchmarks is set up. Cycles per instruction(CPI) is used to represent the application performance and the performance events monitored by performance monitor unit (PMU) are used to represent the influencing factors. Principal component analysis is implemented to eliminate the linear correlation among performance events. Then linear regression model is set up which uses CPI as the dependent variable and principal components as the independent variables. This model can analyze the influence on CPI caused by the performance events i.e. L1 data cache miss, L2 cache miss, DTLB miss, branch mis-prediction, micro-fusion, memory disambiguation events quantitatively. The model is validated by the t test and F test with goodness of fit over 90%. The average relative prediction error of the model is 15%. The results show quantitatively how L1 and L2 cache misses dominate the performance of the applications.

       

    /

    返回文章
    返回