高级检索
    刘 磊, 黄 河, 唐志敏. 支持多核并行程序确定性重放的高效访存冲突记录方法[J]. 计算机研究与发展, 2012, 49(1): 64-75.
    引用本文: 刘 磊, 黄 河, 唐志敏. 支持多核并行程序确定性重放的高效访存冲突记录方法[J]. 计算机研究与发展, 2012, 49(1): 64-75.
    Liu Lei, Huang He, Tang Zhimin. High Efficient Memory Race Recording Scheme for Parallel Program Deterministic Replay Under Multi-Core Architecture[J]. Journal of Computer Research and Development, 2012, 49(1): 64-75.
    Citation: Liu Lei, Huang He, Tang Zhimin. High Efficient Memory Race Recording Scheme for Parallel Program Deterministic Replay Under Multi-Core Architecture[J]. Journal of Computer Research and Development, 2012, 49(1): 64-75.

    支持多核并行程序确定性重放的高效访存冲突记录方法

    High Efficient Memory Race Recording Scheme for Parallel Program Deterministic Replay Under Multi-Core Architecture

    • 摘要: 多核系统中并行程序执行过程的不确定性给程序调试带来了很大的困难.准确记录初始执行中冲突访存的次序是并行程序确定性重放的基础.提出了通过建立精确happens-before关系记录访存冲突的方法.此方法利用简洁高效的地址冲突检测机制确定冲突访存操作在执行中所处happens-before序关系的位置,可以抑制部分记录信息的产生,从而有效减少记录信息.与其他方式方法相比,可以进一步压缩17%的记录条数.采用逻辑向量时钟描述冲突访存操作间的happens-before关系,与采用标量时钟相比,可以避免happens-before关系的误识,降低重放执行时并行度的损失.

       

      Abstract: Current shared memory multi-core and multiprocessor systems are nondeterministic. When these systems execute a multithreaded application, even if supplied with the same input, they could produce a different output each time. It frustrates debugging and limits the ability to properly test multithreaded code, and is becoming a major stumbling block to the much-needed widespread adoption of parallel programming. The support for deterministic replay of multithreaded execution is greatly helpful in finding concurrency bugs. A memory race recording scheme, named Rainbow, is proposed. Its core idea is to make inter-thread communications fully deterministic. The unique feature of Rainbow is that it precisely sets up happens-before relationships between conflicting memory operations among different threads. By using effective, bloom-filter based, coherence history queue, Rainbow removes redundant happens-before relations implied in the already generated log and enables a compact log. Rainbow adds the modest hardware to the base multi-core processors, and the coherence protocol is unmodified. The analysis results show that Rainbow reduces the log size by 17% of a state-of-the-art scheme, and the records execution speed is similar to that of release consistency (RC) execution and replays at about 93% of its speed. The determinism can be provided with little performance cost using our architecture proposals on the state-of-the-art hardware, and the software-only approaches can be utilized on existing systems without problem.

       

    /

    返回文章
    返回