• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Wang Junhua, Zuo Wanli, Yan Zhao. Word Semantic Similarity Measurement Based on Nave Bayes Model[J]. Journal of Computer Research and Development, 2015, 52(7): 1499-1509. DOI: 10.7544/issn1000-1239.2015.20140383
Citation: Wang Junhua, Zuo Wanli, Yan Zhao. Word Semantic Similarity Measurement Based on Nave Bayes Model[J]. Journal of Computer Research and Development, 2015, 52(7): 1499-1509. DOI: 10.7544/issn1000-1239.2015.20140383

Word Semantic Similarity Measurement Based on Nave Bayes Model

More Information
  • Published Date: June 30, 2015
  • Measuring semantic similarity between words is a classical and hot problem in nature language processing, the achievement of which has great impact on many applications such as word sense disambiguation, machine translation, ontology mapping, computational linguistics, etc. A novel approach is proposed to measure words semantic similarity by combining Nave Bayes model with knowledge base. To start, extract attribute variables based on WordNet; then, generate conditional probability distribution by statistics and piecewise linear interpolation technique; after that, obtain posteriori through Bayesian inference; at last, quantify word semantic similarity. The main contributions are definition of distance and depth between word pairs with small amount of computation and high degree of distinguishing the characteristics from words’ sense, and word semantic similarity measurement based on nave Bayesian model. On benchmark data set R&G(65), the experiment is conducted through 5-fold cross validation. The sample Pearson correlation between test results and human judgments is 0.912, with 0.4% improvement over existing best practice, and 7%~13% improvement over classical methods. Spearman correlation between test results and human judgments is 0.873, with 10%~20% improvement over classical methods. And the computational complexity of the method is as efficient as the classical methods, which indicates that integrating Nave Bayes model with knowledge base to measure word semantic similarity is reasonable and effective.
  • Related Articles

    [1]Zhang Wenjun, Jiang Liangxiao, Zhang Huan, Chen Long. A Two-Layer Bayes Model: Random Forest Naive Bayes[J]. Journal of Computer Research and Development, 2021, 58(9): 2040-2051. DOI: 10.7544/issn1000-1239.2021.20200521
    [2]Wang Fei, Yue Kun, Sun Zhengbao, Wu Hao, Feng Hui. Analyzing Rating Data and Modeling Dynamic Behaviors of Users Based on the Bayesian Network[J]. Journal of Computer Research and Development, 2017, 54(7): 1488-1499. DOI: 10.7544/issn1000-1239.2017.20160556
    [3]Zhu Kenan, Yin Baolin, Mao Yaming, Hu Yingnan. Malware Classification Approach Based on Valid Window and Naive Bayes[J]. Journal of Computer Research and Development, 2014, 51(2): 373-381.
    [4]Wang Mei, Liao Shizhong. Three-Step Bayesian Combination of SVM on Regularization Path[J]. Journal of Computer Research and Development, 2013, 50(9): 1855-1864.
    [5]Si Guannan, Ren Yuhan, Xu Jing, and Yang Jufeng. A Dependability Evaluation Model for Internetware Based on Bayesian Network[J]. Journal of Computer Research and Development, 2012, 49(5): 1028-1038.
    [6]Tian Junfeng and Tian Rui. A Fine-Grain Trust Model Based on Domain and Bayesian Network for P2P E-Commerce System[J]. Journal of Computer Research and Development, 2011, 48(6): 974-982.
    [7]Zhu Mingfang, Tang Changjie, Dai Shucheng, Chen Yu, Qiao Shaojie, Xiang Yong. Nave Gene Expression Programming Based on Genetic Neutrality[J]. Journal of Computer Research and Development, 2010, 47(2): 292-299.
    [8]Qian Ning, Wu Guoxin, and Zhao Shenghui. A Bayesian Network-Based Search Method in Unstructured Peer-to-Peer Networks[J]. Journal of Computer Research and Development, 2009, 46(6): 889-897.
    [9]Miao Duoqian, Wang Ruizhi, and Ran Wei. A Dynamic Bayesian Network Based Framework for Continuous Speech Recognition and its Token Passing Model[J]. Journal of Computer Research and Development, 2008, 45(11): 1882-1891.
    [10]Xu Junming, Jiang Yuan, and Zhou Zhihua. Bayesian Classifier Based on Frequent Item Sets Mining[J]. Journal of Computer Research and Development, 2007, 44(8): 1293-1300.
  • Cited by

    Periodical cited type(5)

    1. 郭萌,徐胜,陈锦龙,李玉靖,杜江,杨帅. 电力调度系统人员声纹特征AI回溯提取方法. 微型电脑应用. 2025(01): 134-137 .
    2. 朱东骏,汪如毅,江婷,朱雨薪,熊剑峰. 基于AI识别技术的拆回计量器具自动化分拣方法. 机械制造与自动化. 2023(03): 152-155 .
    3. 江楠,陈洁,肖潘,唐文强,林志泉. 基于声纹识别的电力会议多角色语音的分离和识别研究. 高电压技术. 2023(S1): 40-46 .
    4. 张晶晶,李晨鸣. 智慧家居混合声纹识别系统的应用及研究. 自动化与仪器仪表. 2023(12): 139-143 .
    5. 张波,佟玉强. 基于双随机相位编码的多特征人脸模板保护方法. 激光与光电子学进展. 2022(18): 215-222 .

    Other cited types(5)

Catalog

    Article views (1582) PDF downloads (806) Cited by(10)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return