Citation: | Lang Xuancong, Li Chunsheng, Liu Yong, Wang Mei. Regret Bounds for Online Pairwise Learning with Non-Convex Loss Functions Using Stability Analysis[J]. Journal of Computer Research and Development, 2023, 60(12): 2806-2813. DOI: 10.7544/issn1000-1239.202220221 |
Pairwise learning refers to a learning task which involves a loss function depending on pairs of instances. Recently, there is a growing interest in studying pairwise learning since it includes many important machine learning tasks as specific examples, e.g., metric learning, AUC maximization and ranking. Regret bounds are particularly important for generalization analysis of online pairwise learning. The existing online pairwise learning analysis provides regret bounds only with convex loss functions. To fill the gap in the theoretical study of online pairwise learning with non-convex loss functions, we present a systematic study on the generalization analysis for online pairwise learning and propose regret bounds for non-convex online pairwise learning in this paper. We consider online learning in an adversarial, non-convex setting under the assumption that the learner has access to an offline optimization oracle and the learner’s prediction with expert advice. We first propose a general online pairwise learning framework and establish the stability of online pairwise learning with non-convex loss functions. Then, the regret bounds can be derived naturally from stability. Finally, we show that the general online pairwise learning framework with non-convex loss functions achieves optimal regret bounds of O(T−1/2) when the learner has access to an offline optimization oracle.
[1] |
李志杰,李元香,王峰,等. 面向大数据分析的在线学习算法综述[J]. 计算机研究与发展,2015,52(8):1707−1721
Li Zhijie, Li Yuanxiang, Wang Feng, et al. Online learning algorithms for big data analytics: A survey[J]. Journal of Computer Research and Development, 2015, 52(8): 1707−1721 (in Chinese)
|
[2] |
Clémençon S, Lugosi G, Vayatis N. Ranking and empirical minimization of U-statistics[J]. The Annals of Statistics, 2008, 36(2): 844−874
|
[3] |
Agarwal S, Niyogi P. Generalization bounds for ranking algorithms via algorithmic stability[J]. Journal of Machine Learning Research, 2009, 10(2): 441−474
|
[4] |
Rejchel W. On ranking and generalization bounds[J]. Journal of Machine Learning Research, 2012, 13(5): 1373−1392
|
[5] |
Rejchel W. Fast rates for ranking with large families[J]. Neurocomputing, 2015, 168: 1104−1110 doi: 10.1016/j.neucom.2015.05.013
|
[6] |
Zhao Peilin, Hoi S, Jin Rong, et al. Online AUC maximization[C] //Proc of the 28th Int Conf on Machine Learning. New York: ACM, 2011: 233−240
|
[7] |
Weinberger K Q, Saul L K. Distance metric learning for large margin nearest neighbor classification[J]. Journal of Machine Learning Research, 2009, 10(2): 207−244
|
[8] |
Wang Yuyang, Khardon R, Pechyony D, et al. Generalization bounds for online learning algorithms with pairwise loss functions[C] //Proc of the 25th Conf on Learning Theory. Berlin: Springer, 2012: 1−22
|
[9] |
Ying Yiming, Zhou Dingxuan. Unregularized online learning algorithms with general loss functions[J]. Applied and Computational Harmonic Analysis, 2017, 42(2): 224−244 doi: 10.1016/j.acha.2015.08.007
|
[10] |
Ying Yiming, Zhou Dingxuan. Online pairwise learning algorithms[J]. Neural Computation, 2016, 28(4): 743−777 doi: 10.1162/NECO_a_00817
|
[11] |
Chen Xiaming, Lei Yunwen. Refined bounds for online pairwise learning algorithms[J]. Neurocomputing, 2018, 275: 2656−2665 doi: 10.1016/j.neucom.2017.11.049
|
[12] |
Guo Zhengchu, Ying Yiming, Zhou Dingxuan. Online regularized learning with pairwise loss functions[J]. Advances in Computational Mathematics, 2017, 43(1): 127−150 doi: 10.1007/s10444-016-9479-7
|
[13] |
Wang Cheng, Hu Ting. Online regularized pairwise learning with least squares loss[J]. Analysis and Applications, 2020, 18(1): 49−78 doi: 10.1142/S0219530519410070
|
[14] |
Hu Ting, Fan Jun, Xiang Daohong. Convergence analysis of distributed multi-penalty regularized pairwise learning[J]. Analysis and Applications, 2020, 18(1): 109−127 doi: 10.1142/S0219530519410045
|
[15] |
Bousquet O, Elisseeff A. Stability and generalization[J]. The Journal of Machine Learning Research, 2002, 2: 499−526
|
[16] |
Elisseeff A, Evgeniou T, Pontil M, et al. Stability of randomized learning algorithms[J]. Journal of Machine Learning Research, 2005, 6(1): 55−79
|
[17] |
Shen Wei, Yang Zhenhuan, Ying Yiming, et al. Stability and optimization error of stochastic gradient descent for pairwise learning[J]. Analysis and Applications, 2020, 18(5): 887−927 doi: 10.1142/S0219530519400062
|
[18] |
Lei Yunwen, Ledent A, Kloft M. Sharper generalization bounds for pairwise learning[J]. Advances in Neural Information Processing Systems, 2020, 33: 21236−21246
|
[19] |
McMahan B. Follow-the-regularized-leader and mirror descent: Equivalence theorems and L1 regularization[C] //Proc of the 14th Int Conf on Artificial Intelligence and Statistics. Cambridge, MA: MIT , 2011: 525−533
|
[20] |
Agarwal N, Gonen A, Hazan E. Learning in non-convex games with an optimization oracle[C] //Proc of the 32nd Conf on Learning Theory. Berlin: Springer, 2019: 18−29
|
[21] |
Fine M. Certifiably accurate private data release with generative adversarial networks[J/OL]. 2020[2021-12-18].https://projects.iq.harvard.edu/files/privacytools/files/michael_fines_thesis_-_dp.pdf
|
[22] |
Grnarova P, Levy K Y, Lucchi A, et al. An online learning approach to generative adversarial networks[J]. arXiv preprint, arXiv: 1706.03269, 2017
|
[23] |
Srebro N, Sridharan K, Tewari A. On the universality of online mirror descent[J/OL]. Advances in Neural Information Processing Systems, 2011[2021-11-25].https://proceedings.neurips.cc/paper/2011/file/a8f8f60264024dca151f164729b76c0b-Paper.pdf
|
[24] |
Ying Yiming, Pontil M. Online gradient descent learning algorithms[J]. Foundations of Computational Mathematics, 2008, 8(5): 561−596 doi: 10.1007/s10208-006-0237-y
|
[25] |
Ross S, Bagnell J A. Stability conditions for online learnability[J]. arXiv preprint, arXiv: 1108.3154, 2011
|
[26] |
Suggala AS, Netrapalli P. Online non-convex learning: Following the perturbed leader is optimal[C] //Proc of the 31st Algorithmic Learning Theory. Berlin: Springer, 2020: 845−861
|
[1] | Lin Liansheng, Zheng Huanqin, Su Shen, Lei Kai, Chen Xiaofeng, Tian Zhihong. An On-Chain Mechanism Against DeFi Price Manipulation Attacks[J]. Journal of Computer Research and Development, 2025, 62(2): 443-457. DOI: 10.7544/issn1000-1239.202330291 |
[2] | Song Shuwei, Ni Xiaoze, Chen Ting. Gas Optimization for Smart Contracts: A Survey[J]. Journal of Computer Research and Development, 2023, 60(2): 311-325. DOI: 10.7544/issn1000-1239.202220887 |
[3] | Ying Chenhao, Xia Fuyuan, Li Jie, Si Xueming, Luo Yuan. Incentive Mechanism Based on Truth Estimation of Private Data for Blockchain-Based Mobile Crowdsensing[J]. Journal of Computer Research and Development, 2022, 59(10): 2212-2232. DOI: 10.7544/issn1000-1239.20220493 |
[4] | Feng Jingyu, Yang Jinwen, Zhang Ruitong, Zhang Wenbo. A Spectrum Sharing Incentive Scheme Against Location Privacy Leakage in IoT Networks[J]. Journal of Computer Research and Development, 2020, 57(10): 2209-2220. DOI: 10.7544/issn1000-1239.2020.20200453 |
[5] | Hai Mo, Zhu Jianming. A Propagation Mechanism Combining an Optimal Propagation Path and Incentive in Blockchain Networks[J]. Journal of Computer Research and Development, 2019, 56(6): 1205-1218. DOI: 10.7544/issn1000-1239.2019.20180419 |
[6] | He Yunhua, Li Mengru, Li Hong, Sun Limin, Xiao Ke, Yang Chao. A Blockchain Based Incentive Mechanism for Crowdsensing Applications[J]. Journal of Computer Research and Development, 2019, 56(3): 544-554. DOI: 10.7544/issn1000-1239.2019.20170670 |
[7] | He Haiwu, Yan An, Chen Zehua. Survey of Smart Contract Technology and Application Based on Blockchain[J]. Journal of Computer Research and Development, 2018, 55(11): 2452-2466. DOI: 10.7544/issn1000-1239.2018.20170658 |
[8] | Xiong Jinbo, Ma Rong, Niu Ben, Guo Yunchuan, Lin Li. Privacy Protection Incentive Mechanism Based on User-Union Matching in Mobile Crowdsensing[J]. Journal of Computer Research and Development, 2018, 55(7): 1359-1370. DOI: 10.7544/issn1000-1239.2018.20180080 |
[9] | Wang Bo, Huang Chuanhe, Yang Wenzhong, Dan Feng, and Xu Liya. An Incentive-Cooperative Forwarding Model Based on Punishment Mechanism in Wireless Ad Hoc Networks[J]. Journal of Computer Research and Development, 2011, 48(3): 398-406. |
[10] | Yue Guangxue, Li Renfa, Chen Zhi, Zhou Xu. Analysis of Free-riding Behaviors and Modeling Restrain Mechanisms for Peer-to-Peer Networks[J]. Journal of Computer Research and Development, 2011, 48(3): 382-397. |
1. |
李硕,王馨爽. 多场景融合的码号数据分发架构及关键技术研究. 数据通信. 2024(06): 1-3+11 .
![]() | |
2. |
俞惠芳,李磊. 基于椭圆曲线签密的跨链医疗数据共享方案. 通信学报. 2024(12): 57-66 .
![]() |