• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Zhang Qiang, Yang Jibin, Zhang Xiongwei, Cao Tieyong, Zheng Changyan. CS-Softmax: A Cosine Similarity-Based Softmax Loss Function[J]. Journal of Computer Research and Development, 2022, 59(4): 936-949. DOI: 10.7544/issn1000-1239.20200879
Citation: Zhang Qiang, Yang Jibin, Zhang Xiongwei, Cao Tieyong, Zheng Changyan. CS-Softmax: A Cosine Similarity-Based Softmax Loss Function[J]. Journal of Computer Research and Development, 2022, 59(4): 936-949. DOI: 10.7544/issn1000-1239.20200879

CS-Softmax: A Cosine Similarity-Based Softmax Loss Function

Funds: This work was supported by the National Natural Science Foundation of China (61602031), the Fundamental Research Funds for the Central Universities (FRF-BD-19-012A, FRF-IDRY-19-023), and the National Key 
More Information
  • Published Date: March 31, 2022
  • Convolutional neural networks (CNNs)-based classification framework has achieved significant effects in pattern classification tasks, where the Softmax function with the cross-entropy loss (Softmax loss) can make CNNs learn separable embeddings. However, for some multi-classification problems, training with Softmax loss does not encourage increasing intra-class compactness and inter-class separability, which means it hardly generates the embedding with strong discriminability, making it hard to improve the performance further. In order to enhance the discriminability of learned embeddings, a cosine similarity-based Softmax (CS-Softmax) loss function is proposed. Without changing the network structure, the CS-Softmax loss introduces some parameters such as margin factor, scale factor and weight update factor to calculate the positive similarity and negative similarity between embeddings and different class weights based on the Softmax loss, so as to achieve the objectives of enhancing intra-class compactness and inter-class separability. Furthermore, the size of classification decision margin can be modified flexibly. These characteristics further enhance the discriminability of learned embeddings in CNNs. Classification experimental results on typical audio and image datasets show that the CS-Softmax loss can effectively improve the classification performance without increasing the computational complexity. The classification accuracies of the proposed loss are 99.81%, 95.46%, and 76.46% on the MNIST, CIFAR10, and CIFAR100 classification tasks, respectively.
  • Related Articles

    [1]Zhang Hao, Wu Jianxin. A Survey on Unsupervised Image Retrieval Using Deep Features[J]. Journal of Computer Research and Development, 2018, 55(9): 1829-1842. DOI: 10.7544/issn1000-1239.2018.20180058
    [2]Shan Yanhu, Zhang Zhang, Huang Kaiqi. Visual Human Action Recognition: History, Status and Prospects[J]. Journal of Computer Research and Development, 2016, 53(1): 93-112. DOI: 10.7544/issn1000-1239.2016.20150403
    [3]Li Kenli, Guo Li, Tang Zhuo, Jiang Yong, and Li Renfa. A Molecular Solution for the Ramsey Number on DNA-Based Supercomputing[J]. Journal of Computer Research and Development, 2011, 48(3): 447-454.
    [4]Zhai Weiming, Sheng Lin, Song Yixu, Zhao Yangnan, Wang Hong, Jia Peifa. Image Guided Computer Assisted Microwave Ablation for Liver Cancer[J]. Journal of Computer Research and Development, 2011, 48(2): 281-288.
    [5]Guo Kangde, Zhang Mingmin, Sun Chao, Li Yang, Tang Xing. 3D Fingertip Tracking Algorithm Based on Computer Vision[J]. Journal of Computer Research and Development, 2010, 47(6): 1013-1019.
    [6]Shu Bo, Qiu Xianjie, Wang Zhaoqi. Survey of Shape from Image[J]. Journal of Computer Research and Development, 2010, 47(3): 549-560.
    [7]Zhao Hongwei, Wang Hui, Liu Pingping, and Dai Jinbo. A Computer Model of Directional Visual Attention[J]. Journal of Computer Research and Development, 2009, 46(7): 1192-1197.
    [8]Li Kenli, Liu Jie, Yang Lei, Liu Wenbin. An O(1.414\+n) Volume Molecular Solutions for the Exact Cover Problem on DNA-Based Supercomputing[J]. Journal of Computer Research and Development, 2008, 45(10): 1782-1788.
    [9]Xiong Tinggang, Ma Zhong, Yuan Youguang. Research on Synchronization Technology of Fault-Tolerant Computer System Based on Operating System Calls[J]. Journal of Computer Research and Development, 2006, 43(11): 1985-1992.
    [10]Zhang Liangguo, Gao Wen, Chen Xilin, Chen Yiqiang, Wang Chunli. A Medium Vocabulary Visual Recognition System for Chinese Sign Language[J]. Journal of Computer Research and Development, 2006, 43(3): 476-482.
  • Cited by

    Periodical cited type(6)

    1. 王博,万良,叶金贤,刘明盛,孙菡迪. 融合稀疏注意力机制在DDoS攻击检测中的应用. 计算机工程与设计. 2024(05): 1312-1320 .
    2. 刘泽坤,宫鑫,刘秀,安龙,吕延滨,刘欣. 基于电力数据中台的行为审计工具建设. 电力大数据. 2024(02): 62-68 .
    3. 崔峻玮,翟亚红. 近邻成分分析下的DDoS攻击检测. 湖北汽车工业学院学报. 2023(02): 36-41 .
    4. 冯景瑜,张静,时翌飞. 物联网中具备终端匿名的加密流量双层过滤方法. 西安邮电大学学报. 2023(02): 72-81 .
    5. 王冲,魏子令,陈曙晖. 基于自注意力机制的无边界应用动作识别方法. 计算机研究与发展. 2022(05): 1092-1104 . 本站查看
    6. 邹福泰,俞汤达,许文亮. 基于隐马尔可夫模型的加密恶意流量检测. 软件学报. 2022(07): 2683-2698 .

    Other cited types(4)

Catalog

    Article views (959) PDF downloads (382) Cited by(10)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return