Sentiment Uncertainty Measure and Classification of Negative Sentences
-
摘要: 情感分类是社交媒体大数据分析的有力手段之一.否定句作为一种普遍且特殊的句子现象,其情感分类的研究具有重要的意义.否定词语和情感词语在否定句情感分类中同样重要,已有方法仅仅考虑否定词语修饰情感词语的情况,忽视否定词语本身反映情感的作用.为了统一解决否定词语修饰和不修饰情感词语情况下的分类问题,提出了基于决策粗糙集的否定句情感分类模型.构造词典并结合句际关系计算子句情感值,根据子句情感值提出基于KL散度的句子情感不确定性度量方法;然后融合多个特征,特别是与否定相关的独立否定特征和显著副词特征,用于否定句的特征表示;最后提出基于决策相关程度的决策正域约简算法,生成否定句情感分类决策规则.实验结果验证了该模型的有效性以及情感不确定性度量对于情感分类的作用.Abstract: Sentiment classification is a powerful technology for social media big data analysis. It is of great importance to predict the sentiment polarity of a sentence, especially a negative sentence that is often used. The negation words and sentiment words play equally important roles in the sentiment classification of negative sentences. A negation word is important when it modifies a sentiment word; but it can also have sentimental implication on its own. The existing methods only consider the negation words when they modify sentiment words. In this paper, a unified classification model based on decision-theoretic rough sets is proposed to deal with the sentiment classification of negative sentences. First, the sentiment value of each clause in a sentence is calculated by several lexicons and the inter-sentence relations. A novel measure of sentiment uncertainty for a sentence is given based on Kullback-Leibler divergence. Then, the negative sentences are represented in terms of four features (initial polarity, sentiment uncertainty, successive punctuations, and sentence type) and especially two negation-related features: single negation and salient adverb. Finally, a novel attribute reduction algorithm based on the decision correlation degree is used to generate the decision rules for sentiment classification of negative sentences. The experimental results show that this model is effective and the sentiment uncertainty measure is helpful to sentiment classification.
-
-
期刊类型引用(10)
1. 徐怡,陶强. 划分序乘积空间约简算法研究. 系统工程理论与实践. 2025(02): 554-570 . 百度学术
2. 刘长顺,刘炎,宋晶晶,徐泰华. 基于论域离散度的属性约简算法. 山东大学学报(理学版). 2023(05): 26-35+52 . 百度学术
3. 张清华,艾志华,张金镇. 融合密度与邻域覆盖约简的分类方法. 陕西师范大学学报(自然科学版). 2022(03): 33-42 . 百度学术
4. 张雨新,孙达明,李飞. 基于粒化单调的不完备混合型数据增量式属性约简算法. 计算机应用与软件. 2021(03): 279-286 . 百度学术
5. 邹丽,任思远,杨光,杨鑫华. 基于改进条件邻域熵的接头疲劳寿命影响因素分析. 焊接学报. 2021(11): 43-50+99-100 . 百度学术
6. 刘正,陈雪勤,张书锋. 基于最小化邻域互信息的邻域熵属性约简算法. 微电子学与计算机. 2020(03): 26-32 . 百度学术
7. 陈帅,张贤勇,唐玲玉,姚岳松. 邻域互补信息度量及其启发式属性约简. 数据采集与处理. 2020(04): 630-641 . 百度学术
8. 周艳红,张强. 基于三层粒结构的三支邻域熵. 数学的实践与认识. 2020(14): 83-93 . 百度学术
9. 亓慧,史颖. 不同度量下集成属性选择器的对比研究. 山西大学学报(自然科学版). 2019(04): 848-853 . 百度学术
10. 周艳红,张迪,张强. 基于单调信息度量的特定类属性约简. 内江师范学院学报. 2019(12): 35-39 . 百度学术
其他类型引用(11)
计量
- 文章访问数: 1565
- HTML全文浏览量: 0
- PDF下载量: 754
- 被引次数: 21