多示例多标记学习综述

王梅霞; 王慧颖; 黄苗苗; 赵宇海

doi:10.7544/issn1000-1239.202440643

摘要: 多示例多标记学习（multi-instance multi-label learning，MIML）是一种重要的机器学习框架，用于处理每个对象被表示为多个示例构成的多示例包，并且是一种能够同时与多个标记相关联的复杂学习任务，被广泛应用于药物活性预测、医学图像分析等领域。与传统的多示例学习和多标记学习不同，MIML同时考虑了输入空间（多示例）和输出空间（多标记）的多样性，使得模型能更全面地描述和理解现实世界的复杂对象，但同样也面临着计算复杂性高、模型难以优化和泛化能力受限等挑战。因此，MIML问题受到了研究者们的广泛关注。然而，目前尚缺乏对MIML研究的完整综述。首先给出与MIML相关的问题和符号定义；其次从数据复杂性的角度将MIML划分为标准MIML、多元MIML和非精确标记的MIML三类，并且分别从问题求解策略、示例来源和标记完整性的角度，将3类MIML细化为9个小类进行介绍和分析；然后给出了MIML方法的常用数据集和实验对比结果；最后介绍了5种常见的MIML实际应用场景，展望了MIML领域的4个未来研究方向并对全文进行总结。

Abstract: Multi-instance multi-label learning (MIML) is an important machine learning framework designed to address complex tasks where each object is represented as a bag of multiple instances and can be associated with multiple labels simultaneously. It has been widely applied in domains such as drug activity prediction and medical image analysis. Different from traditional multi-instance learning and multi-label learning, MIML simultaneously captures the diversity in both the input space (multiple instances) and the output space (multiple labels), enabling more comprehensive modeling and understanding of real-world complex objects. However, this expressiveness also brings significant challenges, including high computational complexity, difficulties in model optimization, and limited generalization capabilities. Therefore, MIML has attracted increasing attention from the research community. Despite this growing interest, a comprehensive survey of MIML is still lacking. We first introduce the formal problem definition and notations related to MIML. Then, from the perspective of data complexity, we categorize MIML problems into three main types: standard MIML, multi-source MIML, and MIML with imprecise labeling. These are further refined into nine subcategories based on solution strategies, instance sources, and label completeness, which are systematically discussed and analyzed. We also summarize commonly used datasets and benchmark comparisons for MIML methods. Finally, we present five representative application scenarios of MIML in real-world tasks, outline four promising future research directions, and conclude the paper.

多示例多标记学习综述

Survey on Multi-Instance Multi-Label Learning