高级检索
    吴佳骏, 冯晓兵, 张兆庆. 非线性规律访存操作的数据预取技术[J]. 计算机研究与发展, 2007, 44(2): 355-360.
    引用本文: 吴佳骏, 冯晓兵, 张兆庆. 非线性规律访存操作的数据预取技术[J]. 计算机研究与发展, 2007, 44(2): 355-360.
    Wu Jiajun, Feng Xiaobing, Zhang Zhaoqing. Data Prefetching Technique of Nonlinear Memory Access[J]. Journal of Computer Research and Development, 2007, 44(2): 355-360.
    Citation: Wu Jiajun, Feng Xiaobing, Zhang Zhaoqing. Data Prefetching Technique of Nonlinear Memory Access[J]. Journal of Computer Research and Development, 2007, 44(2): 355-360.

    非线性规律访存操作的数据预取技术

    Data Prefetching Technique of Nonlinear Memory Access

    • 摘要: 编译器在静态分析方式下很难对程序的非线性规律访存操作进行正确的数据预取.但采用profiling技术可以得到程序运行时候的访存规律,利用这些信息可以精确地插入数据预取指令.基于stride profiling技术,提出了新的信息收集类型stride iterative,更精确地反映程序执行时访存指令的实际行为,并结合别名分析的结果调整对同一cache行的数据预取,得到比普通数据预取更好的预取性能.安腾2上运行CPU2000的12个整型测试例子平均有8.54%的性能提升,其中mcf性能提升达到了77.87%.

       

      Abstract: By static analysis, the compiler can hardly correctly prefetch data that are nonlinear accessed. But by using profiling techniques one can get the regulation by which the program accesses memory, and then by using these profiling information, the compiler is guided to accurately insert prefetch instructions. Based on stride profiling technique, a new information type named stride iterative is put forward, which is more accurate than normal profiling. Together with the alias information to adjust the data prefetch for the same cache line, the compiler gets a better performance than the normal data prefetch. The CPU2000 12 INT cases get 8.54% performance improvement on the average, and mcf gets an 77.87% performance increase.

       

    /

    返回文章
    返回