• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Ouyang Yiming, Sun Chenglong, Li Jianhua, Liang Huaguo, Huang Zhengfeng, Du Gaoming. Addressing Transient and Intermittent Link Faults in NoC with Fault-Tolerant Method[J]. Journal of Computer Research and Development, 2017, 54(5): 1109-1120. DOI: 10.7544/issn1000-1239.2017.20151017
Citation: Ouyang Yiming, Sun Chenglong, Li Jianhua, Liang Huaguo, Huang Zhengfeng, Du Gaoming. Addressing Transient and Intermittent Link Faults in NoC with Fault-Tolerant Method[J]. Journal of Computer Research and Development, 2017, 54(5): 1109-1120. DOI: 10.7544/issn1000-1239.2017.20151017

Addressing Transient and Intermittent Link Faults in NoC with Fault-Tolerant Method

More Information
  • Published Date: April 30, 2017
  • As the link is the critical path between routers in NoC,it will seriously affect the network performance when faults occur in the link. For this reason, we propose a high reliable fault-tolerant method addressing transient and intermittent link faults. The method can detect real-time data error occurring in the network, and then define that whether the fault is transient fault or intermittent fault, thereby realizing fault-tolerance. As a result, it not only alleviates the network congestion and decreases the data delay, but also ensures the correct transmission of data, effectively guaranteeing the high reliability of the system. It is well known that when a transient fault occurs in the link, the fault link will result in a data error, which cannot be corrected properly. Therefore, the proposed method set up the retransmission buffer and then the backup data will be retransmitted. If an intermittent fault occurs, the packet transmission is truncated. To solve this problem, the proposed method adds a pseudo head flit and a pseudo tail flit to the truncated data, then re-routing begins and the occupied resource is released. Experimental results show that, in different fault conditions, this method outperforms the comparison objects with significant reduction in average packet latency and obvious improvement in throughput. In a word, this scheme can effectively improve network reliability in addition to ensuring network performance.
  • Related Articles

    [1]Gao Jiangang, Hu Jin, Gong Daoyong, Fang Yanfei, Liu Xiao, He Wangquan, Jin Lifeng, Zheng Fang, Li Hongliang. Design and Analysis of Reliability and Availability on Sunway TaihuLight[J]. Journal of Computer Research and Development, 2021, 58(12): 2696-2707. DOI: 10.7544/issn1000-1239.2021.20200967
    [2]Duan Wenxue, Hu Ming, Zhou Qiong, Wu Tingming, Zhou Junlong, Liu Xiao, Wei Tongquan, Chen Mingsong. Reliability in Cloud Computing System: A Review[J]. Journal of Computer Research and Development, 2020, 57(1): 102-123. DOI: 10.7544/issn1000-1239.2020.20180675
    [3]Li Junfei, Hu Yuxiang, Wu Jiangxing. Research on Improving the Control Plane’s Reliability in SDN Based on Byzantine Fault-Tolerance[J]. Journal of Computer Research and Development, 2017, 54(5): 952-960. DOI: 10.7544/issn1000-1239.2017.20160055
    [4]Lou Jungang, Jiang Jianhui, Shen Zhangguo, Jiang Yunliang. Software Reliability Prediction Modeling with Relevance Vector Machine[J]. Journal of Computer Research and Development, 2013, 50(7): 1542-1550.
    [5]Zhang Hongcan and Xue Wei. Reliability Analysis of Cluster RAID5 Storage System[J]. Journal of Computer Research and Development, 2010, 47(4): 727-735.
    [6]Huang Yongqin, Jin Lifeng, and Liu Yao. Current Situation and Trend of Reliability Technology in High Performance Computers[J]. Journal of Computer Research and Development, 2010, 47(4): 589-594.
    [7]Ren Xiaoxi, Li Renfa, Jin Shengzhen, Zhang Kehuan, Wu Qiang. Research on Reliability of a Reconfigurable Data Processing System Based on JBits[J]. Journal of Computer Research and Development, 2007, 44(4): 722-728.
    [8]Zhou Xuehai, Yu Jie, Li Xi, and Wand Zhigang. Research on Reliability Evaluation of Cache Based on Instruction Behavior[J]. Journal of Computer Research and Development, 2007, 44(4): 553-559.
    [9]Zhao Jing, Liu Hongwei, Cui Gang, and Yang Xiaozong. A Software Reliability Growth Model Considering Testing Environment and Actual Operation Environment[J]. Journal of Computer Research and Development, 2006, 43(5): 881-887.
    [10]Zhao Jing, Liu Hongwei, Cui Gang, and Yang Xiaozong. A Software Reliability Growth Model Considering Differences Between Testing and Operation[J]. Journal of Computer Research and Development, 2006, 43(3): 503-508.

Catalog

    Article views (1647) PDF downloads (648) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return