An Efficient XML Documents Classification Method Based on Structure and Keywords Frequency

Yuan Jiazheng; Xu De; Bao Hong

Yuan Jiazheng, Xu De, Bao Hong. An Efficient XML Documents Classification Method Based on Structure and Keywords FrequencyJ. Journal of Computer Research and Development, 2006, 43(8): 1361-1367.

Citation:

Yuan Jiazheng, Xu De, Bao Hong. An Efficient XML Documents Classification Method Based on Structure and Keywords FrequencyJ. Journal of Computer Research and Development, 2006, 43(8): 1361-1367.

Citation:

Yuan Jiazheng, Xu De, Bao Hong. An Efficient XML Documents Classification Method Based on Structure and Keywords FrequencyJ. Journal of Computer Research and Development, 2006, 43(8): 1361-1367.

An Efficient XML Documents Classification Method Based on Structure and Keywords Frequency

Graphical Abstract

Abstract

Abstract

According to the XML Web page character, an efficient method for computing XML document similarity, position weight and frequency of keywords in documents is presented. Then some features are selected from XML documents based on the method and a multi-classification algorithm of XML Web page is proposed using support vector machines. In this algorithm, a CFK(classifier feature kernel) of common similarity features is created from each sample set of XML documents class. The class label of an XML document is determined by computing similar distance between a test XML document and each CFK. Experimental results prove the effectiveness of the classification algorithm and good performance for multi-classification of XML documents.

FullText(HTML)

References (0)

Cited By

Turn off MathJax

Article Contents

An Efficient XML Documents Classification Method Based on Structure and Keywords Frequency

Abstract

Catalog

Export File

Citation

Format

Content