• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
高级检索

基于增量切空间校准的自适应流式大数据学习算法

谈超, 吉根林, 赵斌

谈超, 吉根林, 赵斌. 基于增量切空间校准的自适应流式大数据学习算法[J]. 计算机研究与发展, 2017, 54(11): 2547-2557. DOI: 10.7544/issn1000-1239.2017.20160712
引用本文: 谈超, 吉根林, 赵斌. 基于增量切空间校准的自适应流式大数据学习算法[J]. 计算机研究与发展, 2017, 54(11): 2547-2557. DOI: 10.7544/issn1000-1239.2017.20160712
Tan Chao, Ji Genlin, Zhao Bin. Self-Adaptive Streaming Big Data Learning Algorithm Based on Incremental Tangent Space Alignment[J]. Journal of Computer Research and Development, 2017, 54(11): 2547-2557. DOI: 10.7544/issn1000-1239.2017.20160712
Citation: Tan Chao, Ji Genlin, Zhao Bin. Self-Adaptive Streaming Big Data Learning Algorithm Based on Incremental Tangent Space Alignment[J]. Journal of Computer Research and Development, 2017, 54(11): 2547-2557. DOI: 10.7544/issn1000-1239.2017.20160712
谈超, 吉根林, 赵斌. 基于增量切空间校准的自适应流式大数据学习算法[J]. 计算机研究与发展, 2017, 54(11): 2547-2557. CSTR: 32373.14.issn1000-1239.2017.20160712
引用本文: 谈超, 吉根林, 赵斌. 基于增量切空间校准的自适应流式大数据学习算法[J]. 计算机研究与发展, 2017, 54(11): 2547-2557. CSTR: 32373.14.issn1000-1239.2017.20160712
Tan Chao, Ji Genlin, Zhao Bin. Self-Adaptive Streaming Big Data Learning Algorithm Based on Incremental Tangent Space Alignment[J]. Journal of Computer Research and Development, 2017, 54(11): 2547-2557. CSTR: 32373.14.issn1000-1239.2017.20160712
Citation: Tan Chao, Ji Genlin, Zhao Bin. Self-Adaptive Streaming Big Data Learning Algorithm Based on Incremental Tangent Space Alignment[J]. Journal of Computer Research and Development, 2017, 54(11): 2547-2557. CSTR: 32373.14.issn1000-1239.2017.20160712

基于增量切空间校准的自适应流式大数据学习算法

基金项目: 国家自然科学基金项目(41471371,61702270);江苏省高校自然科学基金项目(15KJB520022)
详细信息
  • 中图分类号: TP181

Self-Adaptive Streaming Big Data Learning Algorithm Based on Incremental Tangent Space Alignment

  • 摘要: 流形学习是为了寻找高维空间中观测数据的低维嵌入.作为一种有效的非线性维数约减方法,流形学习被广泛应用于数据挖掘、模式识别等机器学习领域.然而,对于样本外点学习、增量学习和在线学习等流形学习方法,面对流式大数据的学习算法时间效率较低.为此提出了一种新的基于增量切空间的自适应流式大数据学习算法(self-adaptive streaming big data learning algorithm based on incremental tangent space alignment, SLITSA),该算法采用增量PCA的思想,增量地构造子空间,能在线或增量地检测数据流中的内在低维流形结构,在迭代过程中构建新的切空间进行调准,保证了算法的收敛性并降低了重构误差.通过人工数据集以及真实数据集上的实验表明:该算法分类精度和时间效率优于其他学习算法,可推广到在线或流式大数据的应用当中.
    Abstract: Manifold learning is developed to find the observed data's low-dimension embeddings in high dimensional data space. As a type of effective nonlinear dimension reduction method, it has been widely applied to the machine learning field, such as data mining and pattern recognition, etc. However, when processing a large scale data stream, the complexity of time is too high for many traditional manifold learning algorithms, including out of sample learning algorithm, incremental learning algorithm, online learning algorithm, and so on. This paper presents a novel self-adaptive learning algorithm based on incremental tangent space alignment (named SLITSA) for big data stream processing. SLITSA adopts the incremental PCA to construct the subspace incrementally, and can detect the intrinsic low dimensional manifold structure of data streams online or incrementally. In order to ensure the convergence of SLITSA and reduce the reconstruction error, it can also construct a new tangent space for adjustment during the iterative process. Experiments on artificial data sets and real data sets show that the classification accuracy and time efficiency of the proposed algorithm are better than other manifold learning algorithms, which can be extended to the application of streaming data and real-time big data analytics.
  • 期刊类型引用(9)

    1. 黄翔东,陈红红,甘霖. 基于频率-时间扩张密集网络的语音增强方法. 计算机研究与发展. 2023(07): 1628-1638 . 本站查看
    2. 许春冬,徐琅,周滨. 结合优化U-Net和残差神经网络的单通道语音增强算法. 现代电子技术. 2022(09): 35-40 . 百度学术
    3. 葛宛营,张天骐,范聪聪,张天. 噪声情况下采用稀疏非负矩阵分解与深度吸引子网络的人声分离算法. 声学学报. 2021(01): 55-66 . 百度学术
    4. GE Wanying,ZHANG Tianqi,FAN Congcong,ZHANG Tian. Monaural noisy speech separation combining sparse non-negative matrix factorization and deep attractor network. Chinese Journal of Acoustics. 2021(02): 266-280 . 必应学术
    5. 王静红,梁丽娜,李昊康,周易. 基于注意力网络特征的社区发现算法. 山东大学学报(理学版). 2021(09): 1-12+20 . 百度学术
    6. 张天骐,柏浩钧,叶绍鹏,刘鉴兴. 基于门控残差卷积编解码网络的单通道语音增强方法. 信号处理. 2021(10): 1986-1995 . 百度学术
    7. 曹丽静. 语音增强技术研究综述. 河北省科学院学报. 2020(02): 30-36 . 百度学术
    8. 张天骐,张晓艳,周琳,胡延平. 基于稀疏性的相位谱补偿语音增强算法. 信号处理. 2020(11): 1867-1876 . 百度学术
    9. 时文华,张雄伟,邹霞,孙蒙. 利用深度全卷积编解码网络的单通道语音增强. 信号处理. 2019(04): 631-640 . 百度学术

    其他类型引用(8)

计量
  • 文章访问数:  1154
  • HTML全文浏览量:  0
  • PDF下载量:  724
  • 被引次数: 17
出版历程
  • 发布日期:  2017-10-31

目录

    /

    返回文章
    返回