Cross Face-Voice Matching via Double-Stream Networks and Bi-Quintuple Loss
Liu Xin1,2,3, Wang Rui1,3, Zhong Bineng4, Wang Nannan2
1(College of Computer Science and Technology, Huaqiao University, Xiamen, Fujian 361021);2(State Key Laboratory of Integrated Services Networks (Xidian University), Xi’an 710071);3(Xiamen Key Laboratory of Computer Vision and Pattern Recognition (Huaqiao University), Xiamen, Fujian 361021);4(School of Computer Science and Information Engineering, Guangxi Normal University, Guilin, Guangxi 541004)
Online:2022-03-07
Supported by:
This work was supported by the National Natural Science Foundation of China (61673185, 61922066, 61972167), the Project of State Key Laboratory of Integrated Services Networks (ISN20-11), the Natural Science Foundation of Fujian Province (2020J01084), and the Zhejiang Laboratory (2021KH0AB01).
Liu Xin, Wang Rui, Zhong Bineng, Wang Nannan. Cross Face-Voice Matching via Double-Stream Networks and Bi-Quintuple Loss[J]. Journal of Computer Research and Development, 2022, 59(3): 694-705.