• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
高级检索

针对瞬时故障和间歇性故障的NoC链路容错方法

欧阳一鸣, 孙成龙, 李建华, 梁华国, 黄正峰, 杜高明

欧阳一鸣, 孙成龙, 李建华, 梁华国, 黄正峰, 杜高明. 针对瞬时故障和间歇性故障的NoC链路容错方法[J]. 计算机研究与发展, 2017, 54(5): 1109-1120. DOI: 10.7544/issn1000-1239.2017.20151017
引用本文: 欧阳一鸣, 孙成龙, 李建华, 梁华国, 黄正峰, 杜高明. 针对瞬时故障和间歇性故障的NoC链路容错方法[J]. 计算机研究与发展, 2017, 54(5): 1109-1120. DOI: 10.7544/issn1000-1239.2017.20151017
Ouyang Yiming, Sun Chenglong, Li Jianhua, Liang Huaguo, Huang Zhengfeng, Du Gaoming. Addressing Transient and Intermittent Link Faults in NoC with Fault-Tolerant Method[J]. Journal of Computer Research and Development, 2017, 54(5): 1109-1120. DOI: 10.7544/issn1000-1239.2017.20151017
Citation: Ouyang Yiming, Sun Chenglong, Li Jianhua, Liang Huaguo, Huang Zhengfeng, Du Gaoming. Addressing Transient and Intermittent Link Faults in NoC with Fault-Tolerant Method[J]. Journal of Computer Research and Development, 2017, 54(5): 1109-1120. DOI: 10.7544/issn1000-1239.2017.20151017
欧阳一鸣, 孙成龙, 李建华, 梁华国, 黄正峰, 杜高明. 针对瞬时故障和间歇性故障的NoC链路容错方法[J]. 计算机研究与发展, 2017, 54(5): 1109-1120. CSTR: 32373.14.issn1000-1239.2017.20151017
引用本文: 欧阳一鸣, 孙成龙, 李建华, 梁华国, 黄正峰, 杜高明. 针对瞬时故障和间歇性故障的NoC链路容错方法[J]. 计算机研究与发展, 2017, 54(5): 1109-1120. CSTR: 32373.14.issn1000-1239.2017.20151017
Ouyang Yiming, Sun Chenglong, Li Jianhua, Liang Huaguo, Huang Zhengfeng, Du Gaoming. Addressing Transient and Intermittent Link Faults in NoC with Fault-Tolerant Method[J]. Journal of Computer Research and Development, 2017, 54(5): 1109-1120. CSTR: 32373.14.issn1000-1239.2017.20151017
Citation: Ouyang Yiming, Sun Chenglong, Li Jianhua, Liang Huaguo, Huang Zhengfeng, Du Gaoming. Addressing Transient and Intermittent Link Faults in NoC with Fault-Tolerant Method[J]. Journal of Computer Research and Development, 2017, 54(5): 1109-1120. CSTR: 32373.14.issn1000-1239.2017.20151017

针对瞬时故障和间歇性故障的NoC链路容错方法

基金项目: 国家自然科学基金项目(61474036,61274036,61371025,61574052);国家自然科学基金青年科学基金项目(61402145);安徽省自然科学基金青年基金项目(1508085QF138);安徽省自然科学基金项目(1508085MF117)
详细信息
  • 中图分类号: TP302

Addressing Transient and Intermittent Link Faults in NoC with Fault-Tolerant Method

  • 摘要: 片上网络中链路是路由器之间连接的关键通路,其发生故障将严重影响网络性能.针对这一问题,提出了一种针对瞬时和间歇性故障的高可靠链路容错方法,该方法可以在网络中实时检测数据是否发生错误,并以此定义瞬时故障和间歇性故障,从而进行容错.在减轻网络拥塞和延时的同时,保证了数据的正确传输,有效保障了系统的高可靠性.当链路中发生瞬时故障导致数据出错且不能正确纠正时,通过设置的重传缓冲区内备份的数据重新进行传输.当链路中发生间歇性故障导致数据出错且不能正确纠正时,数据包传输被截断,对被截断的数据重新添加头微片或尾微片,从而进行重新路由或资源释放.实验结果表明:该容错方法在不同故障情况下较对比对象,均较大地降低了延时,提高了吞吐率,该方法能有效地提高网络的可靠性,保证了系统性能.
    Abstract: As the link is the critical path between routers in NoC,it will seriously affect the network performance when faults occur in the link. For this reason, we propose a high reliable fault-tolerant method addressing transient and intermittent link faults. The method can detect real-time data error occurring in the network, and then define that whether the fault is transient fault or intermittent fault, thereby realizing fault-tolerance. As a result, it not only alleviates the network congestion and decreases the data delay, but also ensures the correct transmission of data, effectively guaranteeing the high reliability of the system. It is well known that when a transient fault occurs in the link, the fault link will result in a data error, which cannot be corrected properly. Therefore, the proposed method set up the retransmission buffer and then the backup data will be retransmitted. If an intermittent fault occurs, the packet transmission is truncated. To solve this problem, the proposed method adds a pseudo head flit and a pseudo tail flit to the truncated data, then re-routing begins and the occupied resource is released. Experimental results show that, in different fault conditions, this method outperforms the comparison objects with significant reduction in average packet latency and obvious improvement in throughput. In a word, this scheme can effectively improve network reliability in addition to ensuring network performance.
  • 期刊类型引用(11)

    1. 李萍,刘金金. 基于改进模糊聚类算法的大数据随机挖掘仿真. 计算机仿真. 2024(02): 496-499+521 . 百度学术
    2. 李来存. 基于物联网技术的信息系统数据存储系统. 信息技术. 2024(05): 120-126+132 . 百度学术
    3. 何芳州,王祉淇. 知识图谱特征重构下无线传感网络数据存储恢复. 传感技术学报. 2024(07): 1265-1270 . 百度学术
    4. 万晓云,张泰,程妍. 基于弹性空间模型的实验室网络数据存储算法. 计算机仿真. 2024(09): 368-371+428 . 百度学术
    5. 梁志宏. 电力异构数据集群存储动态副本选择系统. 电子设计工程. 2024(24): 105-109 . 百度学术
    6. 孙淳晔,庞亚南,邓芳. 分布式存储在运营商中的应用与研究. 广东通信技术. 2023(02): 71-74 . 百度学术
    7. 谢振杰,付伟. 基于可审计多副本的云存储差错副本恢复机制. 计算机应用. 2023(04): 1102-1108 . 百度学术
    8. 姜宇鸣,周益民. 海量机载激光点云数据分布式分片存储方法研究. 电子器件. 2023(04): 978-983 . 百度学术
    9. 辛明勇,祝健杨,徐长宝,姚浩,刘德宏. 基于循环神经网络的多核处理器层次化存储技术. 电子设计工程. 2023(22): 121-124+129 . 百度学术
    10. 梁杨,丁长松,胡志刚. 基于“推荐-学习”的两阶段数据布局策略. 南京师大学报(自然科学版). 2023(04): 80-90 . 百度学术
    11. 白亮,郭新营,潘旭东,叶德力·波拉提,古再奴尔·艾再孜. 基于大数据的信息系统资源利用率人工智能预测方法. 电力大数据. 2022(06): 43-48 . 百度学术

    其他类型引用(8)

计量
  • 文章访问数:  1647
  • HTML全文浏览量:  1
  • PDF下载量:  648
  • 被引次数: 19
出版历程
  • 发布日期:  2017-04-30

目录

    /

    返回文章
    返回