• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Wang Junhua, Zuo Wanli, Yan Zhao. Word Semantic Similarity Measurement Based on Nave Bayes Model[J]. Journal of Computer Research and Development, 2015, 52(7): 1499-1509. DOI: 10.7544/issn1000-1239.2015.20140383
Citation: Wang Junhua, Zuo Wanli, Yan Zhao. Word Semantic Similarity Measurement Based on Nave Bayes Model[J]. Journal of Computer Research and Development, 2015, 52(7): 1499-1509. DOI: 10.7544/issn1000-1239.2015.20140383

Word Semantic Similarity Measurement Based on Nave Bayes Model

More Information
  • Published Date: June 30, 2015
  • Measuring semantic similarity between words is a classical and hot problem in nature language processing, the achievement of which has great impact on many applications such as word sense disambiguation, machine translation, ontology mapping, computational linguistics, etc. A novel approach is proposed to measure words semantic similarity by combining Nave Bayes model with knowledge base. To start, extract attribute variables based on WordNet; then, generate conditional probability distribution by statistics and piecewise linear interpolation technique; after that, obtain posteriori through Bayesian inference; at last, quantify word semantic similarity. The main contributions are definition of distance and depth between word pairs with small amount of computation and high degree of distinguishing the characteristics from words’ sense, and word semantic similarity measurement based on nave Bayesian model. On benchmark data set R&G(65), the experiment is conducted through 5-fold cross validation. The sample Pearson correlation between test results and human judgments is 0.912, with 0.4% improvement over existing best practice, and 7%~13% improvement over classical methods. Spearman correlation between test results and human judgments is 0.873, with 10%~20% improvement over classical methods. And the computational complexity of the method is as efficient as the classical methods, which indicates that integrating Nave Bayes model with knowledge base to measure word semantic similarity is reasonable and effective.
  • Related Articles

    [1]Zhang Wenjun, Jiang Liangxiao, Zhang Huan, Chen Long. A Two-Layer Bayes Model: Random Forest Naive Bayes[J]. Journal of Computer Research and Development, 2021, 58(9): 2040-2051. DOI: 10.7544/issn1000-1239.2021.20200521
    [2]Wang Fei, Yue Kun, Sun Zhengbao, Wu Hao, Feng Hui. Analyzing Rating Data and Modeling Dynamic Behaviors of Users Based on the Bayesian Network[J]. Journal of Computer Research and Development, 2017, 54(7): 1488-1499. DOI: 10.7544/issn1000-1239.2017.20160556
    [3]Zhu Kenan, Yin Baolin, Mao Yaming, Hu Yingnan. Malware Classification Approach Based on Valid Window and Naive Bayes[J]. Journal of Computer Research and Development, 2014, 51(2): 373-381.
    [4]Wang Mei, Liao Shizhong. Three-Step Bayesian Combination of SVM on Regularization Path[J]. Journal of Computer Research and Development, 2013, 50(9): 1855-1864.
    [5]Si Guannan, Ren Yuhan, Xu Jing, and Yang Jufeng. A Dependability Evaluation Model for Internetware Based on Bayesian Network[J]. Journal of Computer Research and Development, 2012, 49(5): 1028-1038.
    [6]Tian Junfeng and Tian Rui. A Fine-Grain Trust Model Based on Domain and Bayesian Network for P2P E-Commerce System[J]. Journal of Computer Research and Development, 2011, 48(6): 974-982.
    [7]Zhu Mingfang, Tang Changjie, Dai Shucheng, Chen Yu, Qiao Shaojie, Xiang Yong. Nave Gene Expression Programming Based on Genetic Neutrality[J]. Journal of Computer Research and Development, 2010, 47(2): 292-299.
    [8]Qian Ning, Wu Guoxin, and Zhao Shenghui. A Bayesian Network-Based Search Method in Unstructured Peer-to-Peer Networks[J]. Journal of Computer Research and Development, 2009, 46(6): 889-897.
    [9]Miao Duoqian, Wang Ruizhi, and Ran Wei. A Dynamic Bayesian Network Based Framework for Continuous Speech Recognition and its Token Passing Model[J]. Journal of Computer Research and Development, 2008, 45(11): 1882-1891.
    [10]Xu Junming, Jiang Yuan, and Zhou Zhihua. Bayesian Classifier Based on Frequent Item Sets Mining[J]. Journal of Computer Research and Development, 2007, 44(8): 1293-1300.
  • Cited by

    Periodical cited type(0)

    Other cited types(1)

Catalog

    Article views (1582) PDF downloads (806) Cited by(1)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return