高级检索

    局部线性与One-Class结合的科技文本分类方法

    Journal Text Categorization with the Combination of Local Linearity and One-Class

    • 摘要: 结合了局部线性和One-Class的思想对科技文本分类问题进行了研究,利用局部线性的思想寻找文本样本的内在支撑流形,利用One-Class的思想确定正负样本的分界面.与K近邻算法、线性SVM算法和One-Class问题的SVM算法相比,给出的科技文本分类方法具有分类精度高、参数估计简便、正负样本分类精度可控制等优点,为解决科技文献的分类问题提供了一条有效的途径.

       

      Abstract: A research is proposed on journal text categorization with the combination of local linearity and one-class. Local linearity is introduced to determine the samples' low-dimensional manifold, which could be regarded as the distribution of the samples in low-dimensional mapping spaces. At the same time, the border of positive and negative samples is determined by one-class. Compared with Knearest algorithm, linear SVM and one-class SVM, the new algorithm of journal text categorization gives better results in high precision, simple parameter estimation and easy control of risks, which gives an effective approach for the solution of text categorization.

       

    /

    返回文章
    返回