High Performance Artificial Intelligence Data Center: Studies, Challenges and Trends
-
-
Abstract
Artificial intelligence data centers (AIDCs) are the new form of data center nowadays. Compared with traditional cloud data centers that mainly provide virtualization of computing resources, AIDCs mainly deliver services for new types of high-performance computing businesses represented by artificial intelligence. This paper makes an in-depth explanation and analysis of the AIDC network business requirements, topology, communication patterns, and traffic characteristics. This paper analyzes each of the new issues and challenges brought about by those unique characteristics. Then, according to the network layering model, this paper sorts out the key technology framework of high performance AIDC network including collective communication, transmission control, load balancing, data link flow control and incident management with a detailed summary of existing studies. Finally, based on the analysis of the current technology development, this paper proposes the future development trends such as the integration of AIDCs and general computing data centers, the specialization of AIDCs technology, and the multi-tenancy of computing power provided by AIDCs.
-
-