高级检索

    基于嵌入式Bootstrap的主动学习示例选择方法

    An Example Selection Method for Active Learning Based on Embedded Bootstrap Algorithm

    • 摘要: 在Bootstrap示例选择算法的基础上提出一种新的嵌入式Bootstrap算法.该算法适用于一大类主动机器学习中训练示例的选择问题.新算法在保持和原Bootstrap算法相当的训练时间的前提下可得到更典型的训练示例集,从而解决了计算条件对训练集规模的限制,使训练所得预测器具有更高的性能.从理论上分析了新算法的有效性,然后将其与原Bootstrap算法分别应用到基于AdaBoost的正面人脸检测任务中进行对比实验,实验结果与理论分析一致.

       

      Abstract: In order to reduce the redundancy of the original Bootstrap example selection algorithm, an embedded Bootstrap (E-Bootstrap) strategy is proposed, which sieves elaborately through extremely large training data sets for more typical examples relevant to the learning problem. Through formulating, the E-Bootstrap and Bootstrap algorithms are compared from two aspects, which indicate that the E-Bootstrap algorithm with almost the same training time selects more utility examples to represent a potentially overwhelming quantity of training data. Thus computational resource constraints to the size of training example set can be handled to some degree, and more effective predictor can be trained. Furthermore, both the above algorithms are applied to the negative example selection for AdaBoost based face detection system. Two experiments are implemented according to each aspect of theoretical analysis. And the results show that the E-Bootstrap outperforms the Bootstrap in getting rid of the redundant examples in Bootsrap sampling to obtain more representative training example set. Moreover, the E-Bootstrap algorithm is applicable to many other example-based active learning methods.

       

    /

    返回文章
    返回