• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Jia Jia, Yang Xuejun, Li Zhiling. A Redundancy-Multithread-Based Multiple GPU Copies Fault-Tolerance Technique[J]. Journal of Computer Research and Development, 2013, 50(7): 1551-1562.
Citation: Jia Jia, Yang Xuejun, Li Zhiling. A Redundancy-Multithread-Based Multiple GPU Copies Fault-Tolerance Technique[J]. Journal of Computer Research and Development, 2013, 50(7): 1551-1562.

A Redundancy-Multithread-Based Multiple GPU Copies Fault-Tolerance Technique

More Information
  • Published Date: July 14, 2013
  • With the increasing of GPGPU's performance, heterogeneous systems that consist of CPUs and GPUs are becoming attractive research hotspots in high-performance computing fields. However, as higher performance is achieved, lower reliability becomes the bottleneck of parallel computing systems that scales up to large size. Since commercial GPGPUs have low fault-tolerance ability, the reliability problem is very acute and lack of practical fault-tolerance solutions in CPU-GPU heterogeneous systems. To address this problem, this paper proposes a redundancy-multithread-based multiple GPU copies fault-tolerance technique: RB-TMR. Towards the programming model of heterogeneous system and the characterization of application, detailed realization and the compiling framework of this fault-tolerance technique for heterogeneous systems are given. In experiments, 10 cases are performed to evaluate this technique's performance, and the results demonstrated that this technique exhibits a novel direction to study fault-tolerance techniques in heterogeneous systems.
  • Related Articles

    [1]Hu Jun, Chen Yan, Zhang Qinghua, Wang Guoyin. Optimal Scale Selection for Generalized Multi-Scale Set-Valued Decision Systems[J]. Journal of Computer Research and Development, 2022, 59(9): 2027-2038. DOI: 10.7544/issn1000-1239.20210196
    [2]Wang Nian, Peng Zhenghong, Cui Li. EasiFFRA: A Fast Feature Reduction Algorithm Based on Neighborhood Rough Set[J]. Journal of Computer Research and Development, 2019, 56(12): 2578-2588. DOI: 10.7544/issn1000-1239.2019.20180541
    [3]Xie Qin, Zhang Qinghua, Wang Guoyin. An Adaptive Three-way Spam Filter with Similarity Measure[J]. Journal of Computer Research and Development, 2019, 56(11): 2410-2423. DOI: 10.7544/issn1000-1239.2019.20180793
    [4]Wu Weizhi, Yang Li, Tan Anhui, Xu Youhong. Granularity Selections in Generalized Incomplete Multi-Granular Labeled Decision Systems[J]. Journal of Computer Research and Development, 2018, 55(6): 1263-1272. DOI: 10.7544/issn1000-1239.2018.20170233
    [5]Yao Sheng, Xu Feng, Zhao Peng, Ji Xia. Intuitionistic Fuzzy Entropy Feature Selection Algorithm Based on Adaptive Neighborhood Space Rough Set Model[J]. Journal of Computer Research and Development, 2018, 55(4): 802-814. DOI: 10.7544/issn1000-1239.2018.20160919
    [6]Fu Zhiyao, Gao Ling, Sun Qian, Li Yang, Gao Ni. Evaluation of Vulnerability Severity Based on Rough Sets and Attributes Reduction[J]. Journal of Computer Research and Development, 2016, 53(5): 1009-1017. DOI: 10.7544/issn1000-1239.2016.20150065
    [7]Duan Jie, Hu Qinghua, Zhang Lingjun, Qian Yuhua, Li Deyu. Feature Selection for Multi-Label Classification Based on Neighborhood Rough Sets[J]. Journal of Computer Research and Development, 2015, 52(1): 56-65. DOI: 10.7544/issn1000-1239.2015.20140544
    [8]Hu Xiaojian, Yang Shanlin, Hu Xiaoxuan, Fang Fang. Optimal Decomposition of Decision Table Systems Based on Bayesian Networks[J]. Journal of Computer Research and Development, 2007, 44(4): 667-673.
    [9]Wei Lai, Miao Duoqian, Xu Feifei, and Xia Fuchun. Research on a Covering Rough Fuzzy Set Model[J]. Journal of Computer Research and Development, 2006, 43(10): 1719-1723.
    [10]Yi Gaoxiang and Hu Heping. A Web Search Result Clustering Based on Tolerance Rough Set[J]. Journal of Computer Research and Development, 2006, 43(2): 275-280.

Catalog

    Article views (1067) PDF downloads (440) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return