• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Jiang Yuan and Zhou Zhihua. A Text Classification Method Based on Term Frequency Classifier Ensemble[J]. Journal of Computer Research and Development, 2006, 43(10): 1681-1687.
Citation: Jiang Yuan and Zhou Zhihua. A Text Classification Method Based on Term Frequency Classifier Ensemble[J]. Journal of Computer Research and Development, 2006, 43(10): 1681-1687.

A Text Classification Method Based on Term Frequency Classifier Ensemble

More Information
  • Published Date: October 14, 2006
  • In this paper, a method of text classification based on term frequency classifier ensemble is proposed. Term frequency classifier is a kind of simple classifier obtained after calculating terms' frequency of texts in the corpus. Though the generalization ability of term frequency classifier is not strong enough, it is a qualified base learner for ensemble because of its low computational cost, flexibility in updating with new samples and classes, and the feasibility of improving generalization with the help of ensemble paradigms. An improved AdaBoost algorithm is used to build the ensemble, which employs a scheme of compulsive weights updating to avoid early stop. Therefore it is more suitable for text classification. Experimental results on the corpus of Reuters-21578 show that the proposed method can achieve good performance in text classification tasks.

Catalog

    Article views (807) PDF downloads (1091) Cited by()
    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return