高级检索
    宋 伟, 杨学军. 基于冗余事务的事务存储系统的错误检测[J]. 计算机研究与发展, 2013, 50(5): 1085-1099.
    引用本文: 宋 伟, 杨学军. 基于冗余事务的事务存储系统的错误检测[J]. 计算机研究与发展, 2013, 50(5): 1085-1099.
    Song Wei, Yang Xuejun. Error Detection by Redundant Transaction in Transactional Memory System[J]. Journal of Computer Research and Development, 2013, 50(5): 1085-1099.
    Citation: Song Wei, Yang Xuejun. Error Detection by Redundant Transaction in Transactional Memory System[J]. Journal of Computer Research and Development, 2013, 50(5): 1085-1099.

    基于冗余事务的事务存储系统的错误检测

    Error Detection by Redundant Transaction in Transactional Memory System

    • 摘要: 针对事务存储系统下的错误检测问题,提出了一种基于冗余事务的事务存储系统的错误检测方法(error detection by redundant transaction, EDRT).该方法为每个事务创建一个副本事务,并利用富余的处理器核资源同时执行原始事务和副本事务,通过比较原始事务和副本事务的执行结果达到检测错误的目的.在检错比较数据集的获取上,EDRT方法利用了事务存储系统自身的版本管理机制,实现了对用户透明的在线接近最小数据比较集的获取.将EDRT方法应用于日志事务存储系统,提出了针对容错日志事务存储系统的设计问题的解决方法.最后,使用了包括4个SPLASH-2典型用例在内的5个测试程序对该方法进行了验证测试,实验结果表明EDRT检错方法相对整个程序的平均检错开销在3.68%左右,而相对于程序内事务部分的平均检错开销也只有12.07%左右.通过与双模冗余检错方法(dual modular redundancy error detection mechanism, DMR)的对比,EDRT方法与DMR方法的平均检错开销比只有0.05%左右.

       

      Abstract: This paper addresses the issue of error detection in transactional memory, and proposes a new method of error detection based on redundant transaction (EDRT). This method creates a transaction copy for every transaction, and executes both original transactions and transaction copies on adequate processor cores, and achieves error detection by comparing the execution results. EDRT utilizes the data-versioning mechanism of transactional memory to achieve the acquisition of an approximate minimum error detection comparison data set, and the acquisition is transparent and online. This paper applies EDRT to the log-based transactional memory, and proposes the solution to the designing problems of the fault-tolerant log-based transactional memory. Finally, this paper validates the EDRT through five test programs, including four SPLASH-2 benchmarks. The experimental results show that the average error detecting cost is about 3.68% relative to the whole program, and it is only about 12.07% relative to the transaction parts of the program. Compared with the dual modular redundancy error detection mechanism (DMR), the average error detecting cost ratio between the EDRT and the DMR is only about 0.05%.

       

    /

    返回文章
    返回