• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
高级检索

一种层次化的联合识别模型

陈耀东, 李仁发

陈耀东, 李仁发. 一种层次化的联合识别模型[J]. 计算机研究与发展, 2015, 52(11): 2431-2440. DOI: 10.7544/issn1000-1239.2015.20140492
引用本文: 陈耀东, 李仁发. 一种层次化的联合识别模型[J]. 计算机研究与发展, 2015, 52(11): 2431-2440. DOI: 10.7544/issn1000-1239.2015.20140492
Chen Yaodong, Li Renfa. A Hierarchical Model for Joint Object Detection and Pose Estimation[J]. Journal of Computer Research and Development, 2015, 52(11): 2431-2440. DOI: 10.7544/issn1000-1239.2015.20140492
Citation: Chen Yaodong, Li Renfa. A Hierarchical Model for Joint Object Detection and Pose Estimation[J]. Journal of Computer Research and Development, 2015, 52(11): 2431-2440. DOI: 10.7544/issn1000-1239.2015.20140492
陈耀东, 李仁发. 一种层次化的联合识别模型[J]. 计算机研究与发展, 2015, 52(11): 2431-2440. CSTR: 32373.14.issn1000-1239.2015.20140492
引用本文: 陈耀东, 李仁发. 一种层次化的联合识别模型[J]. 计算机研究与发展, 2015, 52(11): 2431-2440. CSTR: 32373.14.issn1000-1239.2015.20140492
Chen Yaodong, Li Renfa. A Hierarchical Model for Joint Object Detection and Pose Estimation[J]. Journal of Computer Research and Development, 2015, 52(11): 2431-2440. CSTR: 32373.14.issn1000-1239.2015.20140492
Citation: Chen Yaodong, Li Renfa. A Hierarchical Model for Joint Object Detection and Pose Estimation[J]. Journal of Computer Research and Development, 2015, 52(11): 2431-2440. CSTR: 32373.14.issn1000-1239.2015.20140492

一种层次化的联合识别模型

基金项目: 国家自然科学基金项目(60873047,61173036)
详细信息
  • 中图分类号: TP391.4

A Hierarchical Model for Joint Object Detection and Pose Estimation

  • 摘要: 目标检测与姿态估计在当前视觉研究中分属不同的任务,但两者在研究方法和现实应用上具有较强的互补性.提出了一种混合的层次树模型,该模型包含3类结点,分别描述整体目标、判别部件和组件(即语义部件).中间层的判别部件兼顾承上(目标)与启下(组件)的功能,一方面刻画整体目标的局部特征,另一方面隐含多组件的共现信息.相比当前最新的联合模型,层次树模型能够并行化处理检测与估计,避免串联化联合引发的错误传播.采用基于隐变量的结构化支持向量机训练模型,同时提出了一种新的部件学习方法以自动地初始化和优化判别部件.实验设计了多任务识别和单任务识别2种评估场景,对比了本文模型与当前主流的联合识别模型,实验结果说明层次化模型具有更强的识别性能以及更高的时效性.
    Abstract: Object detection and pose estimation belong to different tasks in computer vision. Viewed from research methods and practical application, there is great complementarity between these two tasks. This paper presents a mixture of hierarchical tree models that consists of three types of nodes, representing the whole object, discriminative parts and components (i.e. semantic parts) respectively. A key point of the model is that the discriminative parts in the middle level characterize not only object features but also mutual information among components. The proposed model can detect articulated objects and estimate their poses in parallel so as to address the error propagation problem that exists in previous joint models. For training the model, we use a latent structured SVM method where the discriminative nodes are viewed as latent variables. A novel learning method is introduced to initialize and optimize the parameters of the discriminative parts automatically. In experiments we design two evaluation scenarios (i.e. multi-task recognition and single-task recognition) to compare the proposed model and obtain the performance with the state-of-the-art joint methods on PASCAL VOC datasets. The results show that the hierarchical model not only outperforms other joint models in both recognition rate, but also has higher time-effectiveness.
计量
  • 文章访问数:  1392
  • HTML全文浏览量:  0
  • PDF下载量:  928
  • 被引次数: 0
出版历程
  • 发布日期:  2015-10-31

目录

    /

    返回文章
    返回