Processing math: 100%
  • 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Lang Xuancong, Li Chunsheng, Liu Yong, Wang Mei. Regret Bounds for Online Pairwise Learning with Non-Convex Loss Functions Using Stability Analysis[J]. Journal of Computer Research and Development, 2023, 60(12): 2806-2813. DOI: 10.7544/issn1000-1239.202220221
Citation: Lang Xuancong, Li Chunsheng, Liu Yong, Wang Mei. Regret Bounds for Online Pairwise Learning with Non-Convex Loss Functions Using Stability Analysis[J]. Journal of Computer Research and Development, 2023, 60(12): 2806-2813. DOI: 10.7544/issn1000-1239.202220221

Regret Bounds for Online Pairwise Learning with Non-Convex Loss Functions Using Stability Analysis

Funds: This work was supported by the National Natural Science Foundation of China (51774090, 62076234), the Heilongjiang Provincial Postdoctoral Research Startup Fund Project (LBH-Q20080), the Natural Science Foundation of Heilongjiang Province (LH2020F003), and the Basic Research Fund Project of Universities in Heilongjiang Province (KYCXTD201903, YYYZX202105).
More Information
  • Author Bio:

    Lang Xuancong: born in 1998. PhD candidate. Student member of CCF. Her main research interest includes large-scale machine learning

    Li Chunsheng: born in 1960. PhD, professor, PhD supervisor. Member of CCF. His main research interests include data mining and agent systems, multi-agent theory and application, and intelligent architecture

    Liu Yong: born in 1986. PhD, accociate professor, PhD supervisor. His main research interests include machine learning, large-scale machine learning, and statistical learning theory

    Wang Mei: born in 1976. PhD, professor, master supervisor. Member of CCF. Her main research interests include machine learning, kernel methods, and model selection

  • Received Date: March 13, 2022
  • Revised Date: January 29, 2023
  • Available Online: September 19, 2023
  • Pairwise learning refers to a learning task which involves a loss function depending on pairs of instances. Recently, there is a growing interest in studying pairwise learning since it includes many important machine learning tasks as specific examples, e.g., metric learning, AUC maximization and ranking. Regret bounds are particularly important for generalization analysis of online pairwise learning. The existing online pairwise learning analysis provides regret bounds only with convex loss functions. To fill the gap in the theoretical study of online pairwise learning with non-convex loss functions, we present a systematic study on the generalization analysis for online pairwise learning and propose regret bounds for non-convex online pairwise learning in this paper. We consider online learning in an adversarial, non-convex setting under the assumption that the learner has access to an offline optimization oracle and the learner’s prediction with expert advice. We first propose a general online pairwise learning framework and establish the stability of online pairwise learning with non-convex loss functions. Then, the regret bounds can be derived naturally from stability. Finally, we show that the general online pairwise learning framework with non-convex loss functions achieves optimal regret bounds of O(T1/2) when the learner has access to an offline optimization oracle.

  • [1]
    李志杰,李元香,王峰,等. 面向大数据分析的在线学习算法综述[J]. 计算机研究与发展,2015,52(8):1707−1721

    Li Zhijie, Li Yuanxiang, Wang Feng, et al. Online learning algorithms for big data analytics: A survey[J]. Journal of Computer Research and Development, 2015, 52(8): 1707−1721 (in Chinese)
    [2]
    Clémençon S, Lugosi G, Vayatis N. Ranking and empirical minimization of U-statistics[J]. The Annals of Statistics, 2008, 36(2): 844−874
    [3]
    Agarwal S, Niyogi P. Generalization bounds for ranking algorithms via algorithmic stability[J]. Journal of Machine Learning Research, 2009, 10(2): 441−474
    [4]
    Rejchel W. On ranking and generalization bounds[J]. Journal of Machine Learning Research, 2012, 13(5): 1373−1392
    [5]
    Rejchel W. Fast rates for ranking with large families[J]. Neurocomputing, 2015, 168: 1104−1110 doi: 10.1016/j.neucom.2015.05.013
    [6]
    Zhao Peilin, Hoi S, Jin Rong, et al. Online AUC maximization[C] //Proc of the 28th Int Conf on Machine Learning. New York: ACM, 2011: 233−240
    [7]
    Weinberger K Q, Saul L K. Distance metric learning for large margin nearest neighbor classification[J]. Journal of Machine Learning Research, 2009, 10(2): 207−244
    [8]
    Wang Yuyang, Khardon R, Pechyony D, et al. Generalization bounds for online learning algorithms with pairwise loss functions[C] //Proc of the 25th Conf on Learning Theory. Berlin: Springer, 2012: 1−22
    [9]
    Ying Yiming, Zhou Dingxuan. Unregularized online learning algorithms with general loss functions[J]. Applied and Computational Harmonic Analysis, 2017, 42(2): 224−244 doi: 10.1016/j.acha.2015.08.007
    [10]
    Ying Yiming, Zhou Dingxuan. Online pairwise learning algorithms[J]. Neural Computation, 2016, 28(4): 743−777 doi: 10.1162/NECO_a_00817
    [11]
    Chen Xiaming, Lei Yunwen. Refined bounds for online pairwise learning algorithms[J]. Neurocomputing, 2018, 275: 2656−2665 doi: 10.1016/j.neucom.2017.11.049
    [12]
    Guo Zhengchu, Ying Yiming, Zhou Dingxuan. Online regularized learning with pairwise loss functions[J]. Advances in Computational Mathematics, 2017, 43(1): 127−150 doi: 10.1007/s10444-016-9479-7
    [13]
    Wang Cheng, Hu Ting. Online regularized pairwise learning with least squares loss[J]. Analysis and Applications, 2020, 18(1): 49−78 doi: 10.1142/S0219530519410070
    [14]
    Hu Ting, Fan Jun, Xiang Daohong. Convergence analysis of distributed multi-penalty regularized pairwise learning[J]. Analysis and Applications, 2020, 18(1): 109−127 doi: 10.1142/S0219530519410045
    [15]
    Bousquet O, Elisseeff A. Stability and generalization[J]. The Journal of Machine Learning Research, 2002, 2: 499−526
    [16]
    Elisseeff A, Evgeniou T, Pontil M, et al. Stability of randomized learning algorithms[J]. Journal of Machine Learning Research, 2005, 6(1): 55−79
    [17]
    Shen Wei, Yang Zhenhuan, Ying Yiming, et al. Stability and optimization error of stochastic gradient descent for pairwise learning[J]. Analysis and Applications, 2020, 18(5): 887−927 doi: 10.1142/S0219530519400062
    [18]
    Lei Yunwen, Ledent A, Kloft M. Sharper generalization bounds for pairwise learning[J]. Advances in Neural Information Processing Systems, 2020, 33: 21236−21246
    [19]
    McMahan B. Follow-the-regularized-leader and mirror descent: Equivalence theorems and L1 regularization[C] //Proc of the 14th Int Conf on Artificial Intelligence and Statistics. Cambridge, MA: MIT , 2011: 525−533
    [20]
    Agarwal N, Gonen A, Hazan E. Learning in non-convex games with an optimization oracle[C] //Proc of the 32nd Conf on Learning Theory. Berlin: Springer, 2019: 18−29
    [21]
    Fine M. Certifiably accurate private data release with generative adversarial networks[J/OL]. 2020[2021-12-18].https://projects.iq.harvard.edu/files/privacytools/files/michael_fines_thesis_-_dp.pdf
    [22]
    Grnarova P, Levy K Y, Lucchi A, et al. An online learning approach to generative adversarial networks[J]. arXiv preprint, arXiv: 1706.03269, 2017
    [23]
    Srebro N, Sridharan K, Tewari A. On the universality of online mirror descent[J/OL]. Advances in Neural Information Processing Systems, 2011[2021-11-25].https://proceedings.neurips.cc/paper/2011/file/a8f8f60264024dca151f164729b76c0b-Paper.pdf
    [24]
    Ying Yiming, Pontil M. Online gradient descent learning algorithms[J]. Foundations of Computational Mathematics, 2008, 8(5): 561−596 doi: 10.1007/s10208-006-0237-y
    [25]
    Ross S, Bagnell J A. Stability conditions for online learnability[J]. arXiv preprint, arXiv: 1108.3154, 2011
    [26]
    Suggala AS, Netrapalli P. Online non-convex learning: Following the perturbed leader is optimal[C] //Proc of the 31st Algorithmic Learning Theory. Berlin: Springer, 2020: 845−861
  • Related Articles

    [1]Lin Liansheng, Zheng Huanqin, Su Shen, Lei Kai, Chen Xiaofeng, Tian Zhihong. An On-Chain Mechanism Against DeFi Price Manipulation Attacks[J]. Journal of Computer Research and Development, 2025, 62(2): 443-457. DOI: 10.7544/issn1000-1239.202330291
    [2]Song Shuwei, Ni Xiaoze, Chen Ting. Gas Optimization for Smart Contracts: A Survey[J]. Journal of Computer Research and Development, 2023, 60(2): 311-325. DOI: 10.7544/issn1000-1239.202220887
    [3]Ying Chenhao, Xia Fuyuan, Li Jie, Si Xueming, Luo Yuan. Incentive Mechanism Based on Truth Estimation of Private Data for Blockchain-Based Mobile Crowdsensing[J]. Journal of Computer Research and Development, 2022, 59(10): 2212-2232. DOI: 10.7544/issn1000-1239.20220493
    [4]Feng Jingyu, Yang Jinwen, Zhang Ruitong, Zhang Wenbo. A Spectrum Sharing Incentive Scheme Against Location Privacy Leakage in IoT Networks[J]. Journal of Computer Research and Development, 2020, 57(10): 2209-2220. DOI: 10.7544/issn1000-1239.2020.20200453
    [5]Hai Mo, Zhu Jianming. A Propagation Mechanism Combining an Optimal Propagation Path and Incentive in Blockchain Networks[J]. Journal of Computer Research and Development, 2019, 56(6): 1205-1218. DOI: 10.7544/issn1000-1239.2019.20180419
    [6]He Yunhua, Li Mengru, Li Hong, Sun Limin, Xiao Ke, Yang Chao. A Blockchain Based Incentive Mechanism for Crowdsensing Applications[J]. Journal of Computer Research and Development, 2019, 56(3): 544-554. DOI: 10.7544/issn1000-1239.2019.20170670
    [7]He Haiwu, Yan An, Chen Zehua. Survey of Smart Contract Technology and Application Based on Blockchain[J]. Journal of Computer Research and Development, 2018, 55(11): 2452-2466. DOI: 10.7544/issn1000-1239.2018.20170658
    [8]Xiong Jinbo, Ma Rong, Niu Ben, Guo Yunchuan, Lin Li. Privacy Protection Incentive Mechanism Based on User-Union Matching in Mobile Crowdsensing[J]. Journal of Computer Research and Development, 2018, 55(7): 1359-1370. DOI: 10.7544/issn1000-1239.2018.20180080
    [9]Wang Bo, Huang Chuanhe, Yang Wenzhong, Dan Feng, and Xu Liya. An Incentive-Cooperative Forwarding Model Based on Punishment Mechanism in Wireless Ad Hoc Networks[J]. Journal of Computer Research and Development, 2011, 48(3): 398-406.
    [10]Yue Guangxue, Li Renfa, Chen Zhi, Zhou Xu. Analysis of Free-riding Behaviors and Modeling Restrain Mechanisms for Peer-to-Peer Networks[J]. Journal of Computer Research and Development, 2011, 48(3): 382-397.
  • Cited by

    Periodical cited type(2)

    1. 李硕,王馨爽. 多场景融合的码号数据分发架构及关键技术研究. 数据通信. 2024(06): 1-3+11 .
    2. 俞惠芳,李磊. 基于椭圆曲线签密的跨链医疗数据共享方案. 通信学报. 2024(12): 57-66 .

    Other cited types(0)

Catalog

    Article views (101) PDF downloads (65) Cited by(2)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return