高级检索

    基于GCN-Stacking协同增强的多模态金融舆情负向情感强度识别

    GCN-Stacking collaborative enhancement-based multimodal financial public opinion negative sentiment intensity recognition

    • 摘要: 社交媒体中的金融舆情负向情感强度识别,对防控金融风险、维护市场稳定至关重要。随着网络用户表达形式的多样化,图文并茂的多模态网络信息已成为公众传递金融情感与观点的主要方式。为充分利用不同模态间的关联性与互补性,提升识别效果,本文提出一种基于GCN-Stacking协同增强的多模态金融舆情负向情感强度识别模型。该模型首先通过多模型一致性标注策略获取高置信度情感标签,利用CLIP模型提取文本与图像的基础多模态特征;随后构建多模态图结构,通过GCN实现特征的结构关联嵌入与增强,弥补传统多模态特征忽视模态间潜在关联的缺陷;最后通过基础分类器筛选与双层集成建模构建Stacking框架,优化情感强度识别性能。实验结果表明,模型最优配置下F1分数达 83.57%、AUC达95.05%,相较于单一文本最优模型、单一图像最优模型及原始CLIP多模态模型,F1分数分别提升2.48%、5.27%、1.87%,充分验证了方法的有效性。本文所提模型能够有效提升金融舆情负向情感强度识别性能,为金融舆情智能化分析提供决策参考。

       

      Abstract: The recognition of negative sentiment intensity in financial public opinion on social media is crucial for pre-venting financial risks and maintaining market stability. With the diversification of online expression formats, multimodal information combining text and images has become the primary way for the public to convey financial sentiments and opinions. To fully leverage the correlation and complementarity among different modalities and improve recognition performance, this study proposes a multimodal financial public opinion negative sentiment intensity recognition model based on GCN-Stacking collaborative enhancement. First, the model adopts a multi-model consistency annotation strategy to obtain high-confidence sentiment la-bels and uses the CLIP model to extract basic multimodal features of text and images. Subsequently, a multi-modal graph structure is constructed, and Graph Convolutional Network (GCN) is employed to realize structural association embedding and enhancement of features, making up for the defect that traditional multimodal features ignore potential inter-modal correlations. Finally, a Stacking framework is built through base classifier selection and two-layer ensemble modeling to optimize the performance of sentiment intensity recognition. Experimental results show that under the optimal model configuration, the F1-score reaches 83.57% and the AUC reaches 95.05%. Compared with the optimal single-text model, optimal single-image model, and original CLIP multimodal model, the F1-score is improved by 2.48%, 5.27%, and 1.87% respectively. These results fully verify the effectiveness of the proposed method, which can provide technical support and d

       

    /

    返回文章
    返回