• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Wang Jianhui, Wang Hongwei, Shen Zhan, Hu Yunfa. A Simple and Efficient Algorithm to Classify a Large Scale of Texts[J]. Journal of Computer Research and Development, 2005, 42(1): 85-93.
Citation: Wang Jianhui, Wang Hongwei, Shen Zhan, Hu Yunfa. A Simple and Efficient Algorithm to Classify a Large Scale of Texts[J]. Journal of Computer Research and Development, 2005, 42(1): 85-93.

A Simple and Efficient Algorithm to Classify a Large Scale of Texts

More Information
  • Published Date: January 14, 2005
  • Most of classifying methods are based on VSM (vector space model) in the research on classification at present, of which the widely-used method is kNN (k-nearest neighbors). But most of them are highly complicated on computation, and cannot be used on the occasion of classifying a large number of specimen. Moreover, to them, the classifier must be rebuilt when to increment the corpora of the training specimen. So they have tough scalability. Two new concepts, MD (mutual dependence) and ER (equivalent radius), are put forward in this paper. Furthermore, a new classifying method, SECTILE, is offered. SECTILE can be used to classify a large number of specimen and has good scalability. Later, SECTILE is applied to classify Chinese documents and compared to kNN and CCC method. As a result, SECTILE outperforms kNN and CCC method, and can be used online to classify a large number of specimen while the precision and recall of classification are kept.
  • Related Articles

    [1]Wang Xianghai, Huang Junying, Li Ming. Approximate Degree Reduction Method by Blending of Multi-Triangular Bézier Surfaces with GC\+1 Constraint[J]. Journal of Computer Research and Development, 2013, 50(5): 1012-1020.
    [2]Liu Zhi, Tan Jieqing, Chen Xiaoyan. Cubic Bézier Triangular Patch with Shape Parameters[J]. Journal of Computer Research and Development, 2012, 49(1): 152-157.
    [3]Huang Weixian and Wang Guojin. Ribs and Fans of Bézier Curves and Surfaces with Endpoints G1 Continuity[J]. Journal of Computer Research and Development, 2011, 48(9): 1781-1787.
    [4]Zhi Dejia and Wang Guojin. Bézier Approximate Merging by Interval Curves[J]. Journal of Computer Research and Development, 2011, 48(4): 675-682.
    [5]Chen Jun and Wang Guojin. Optimal Parameterizations of the Degree 2 Rational Bézier Curves[J]. Journal of Computer Research and Development, 2008, 45(9): 1601-1604.
    [6]Tang Min, Tang Yang, Xu Lizhong, Pheng Ann Heng, Xia Deshen. 3D Segmentation Based on Cylindrical B-Spline Active Surface Model[J]. Journal of Computer Research and Development, 2007, 44(9): 1604-1611.
    [7]Xu Gang and Wang Guozhao. Extensions of Uniform Cubic B-Spline Curve with Local Shape Parameters[J]. Journal of Computer Research and Development, 2007, 44(6): 1032-1037.
    [8]Liu Xumin, Huang Houkuan, Wang Liuqiang, Ma Sujing. Study of Spline-Curves with Shape Parameters[J]. Journal of Computer Research and Development, 2007, 44(3).
    [9]Chen Jun and Wang Guojin. Constructing Convexity-Preserving Interpolation Curves of Hyperbolic Polynomial B-Splines Using a Shape Parameter[J]. Journal of Computer Research and Development, 2006, 43(7): 1216-1224.
    [10]Liu Yi and Zhang Caiming. Study of Determining a Conic with Five Constrained Points and Its Application in Parametric Interpolation[J]. Journal of Computer Research and Development, 2005, 42(12): 2161-2168.

Catalog

    Article views (858) PDF downloads (947) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return