基于GCN-Stacking协同增强的多模态金融舆情负向情感强度识别

李婷婷; 孙守强; 李青青; 万玲

doi:10.7544/issn1000-1239.202550930

基于GCN-Stacking协同增强的多模态金融舆情负向情感强度识别

GCN-Stacking Collaborative Enhancement-Based Multimodal Financial Public Opinion Negative Sentiment Intensity Recognition

摘要

摘要: 社交媒体中金融舆情的负向情感强度识别对金融风险防控与市场稳定具有重要意义。针对图文多模态信息在金融情感表达中的普遍性，以及模态间结构关联有待深入挖掘的问题，提出一种基于图卷积网络与Stacking集成学习协同增强的多模态负向情感强度识别模型。该模型首先利用CLIP模型提取文本与图像的语义特征；随后，通过构建多模态图结构并引入图卷积网络，实现特征的结构化关联嵌入，以增强对模态间潜在关联的捕捉能力；最后，构建双层Stacking集成学习框架，通过融合多个差异化基分类器的预测结果，优化情感强度识别的决策性能。实验结果表明，在最优配置下，模型的F1分数达83.57%，AUC达95.05%，相较于单一文本最优模型、单一图像最优模型及原始CLIP多模态模型，F1分数分别提升2.48%，5.27%，1.87%。实验结果验证了所提方法在挖掘模态关联与提升识别性能方面的有效性，可为金融舆情智能化分析与金融风险监测提供技术支撑。

Abstract: The recognition of negative sentiment intensity in financial public opinion on social media is of great significance for financial risk prevention and market stability. Given the prevalence of multimodal information combining text and images in financial sentiment expression, and the need for further exploration of structural correlations between modalities, we propose a multimodal negative sentiment intensity recognition model based on the collaborative enhancement of graph convolutional networks and stacking ensemble learning. The model first employs the CLIP model to extract semantic features from text and images. Subsequently, a multimodal graph structure is constructed and graph convolutional networks are introduced to achieve structural association embedding of features, thereby enhancing the ability to capture potential inter-modal correlations. Finally, a two-layer stacking ensemble learning framework is built to optimize the decision performance of sentiment intensity recognition by integrating the prediction results of multiple diverse base classifiers. Experimental results show that under the optimal configuration, the model achieves an F1-score of 83.57% and an AUC of 95.05%, outperforming the optimal single-text model, the optimal single-image model, and the original CLIP multimodal model by 2.48%, 5.27%, and 1.87% in F1-score, respectively. These results verify the effectiveness of the proposed method in mining modal correlations and improving recognition performance, providing technical support for intelligent financial public opinion analysis and financial risk monitoring.

HTML全文

参考文献(35)

施引文献

资源附件(0)