• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Chen Yaodong, Li Renfa. A Hierarchical Model for Joint Object Detection and Pose Estimation[J]. Journal of Computer Research and Development, 2015, 52(11): 2431-2440. DOI: 10.7544/issn1000-1239.2015.20140492
Citation: Chen Yaodong, Li Renfa. A Hierarchical Model for Joint Object Detection and Pose Estimation[J]. Journal of Computer Research and Development, 2015, 52(11): 2431-2440. DOI: 10.7544/issn1000-1239.2015.20140492

A Hierarchical Model for Joint Object Detection and Pose Estimation

More Information
  • Published Date: October 31, 2015
  • Object detection and pose estimation belong to different tasks in computer vision. Viewed from research methods and practical application, there is great complementarity between these two tasks. This paper presents a mixture of hierarchical tree models that consists of three types of nodes, representing the whole object, discriminative parts and components (i.e. semantic parts) respectively. A key point of the model is that the discriminative parts in the middle level characterize not only object features but also mutual information among components. The proposed model can detect articulated objects and estimate their poses in parallel so as to address the error propagation problem that exists in previous joint models. For training the model, we use a latent structured SVM method where the discriminative nodes are viewed as latent variables. A novel learning method is introduced to initialize and optimize the parameters of the discriminative parts automatically. In experiments we design two evaluation scenarios (i.e. multi-task recognition and single-task recognition) to compare the proposed model and obtain the performance with the state-of-the-art joint methods on PASCAL VOC datasets. The results show that the hierarchical model not only outperforms other joint models in both recognition rate, but also has higher time-effectiveness.
  • Related Articles

    [1]Xiao Mengnan, He Ruifang, Ma Jinsong. Event Detection Based on Hierarchical Latent Semantic-Driven Network[J]. Journal of Computer Research and Development, 2024, 61(1): 184-195. DOI: 10.7544/issn1000-1239.202220447
    [2]Wang Fei, Yue Kun, Sun Zhengbao, Wu Hao, Feng Hui. Analyzing Rating Data and Modeling Dynamic Behaviors of Users Based on the Bayesian Network[J]. Journal of Computer Research and Development, 2017, 54(7): 1488-1499. DOI: 10.7544/issn1000-1239.2017.20160556
    [3]Feng Ling, Peng Zhiyong, Liu Bin, Che Dunren. A Latent-Citation-Network Based Patent Value Evaluation Method[J]. Journal of Computer Research and Development, 2015, 52(3): 649-660. DOI: 10.7544/issn1000-1239.2015.20131424
    [4]Wu Lei, Zhang Wensheng, Wang Jue. Hidden Topic Variable Graphical Model Based on Deep Learning Framework[J]. Journal of Computer Research and Development, 2015, 52(1): 191-199. DOI: 10.7544/issn1000-1239.2015.20131113
    [5]Hu Yan, Peng Qimin, Hu Xiaohui. A Personalized Web Service Recommendation Method Based on Latent Semantic Probabilistic Model[J]. Journal of Computer Research and Development, 2014, 51(8): 1781-1793. DOI: 10.7544/issn1000-1239.2014.20130024
    [6]Wang Li, Cheng Suqi, Shen Huawei, Cheng Xueqi. Structure Inference and Prediction in the Co-Evolution of Social Networks[J]. Journal of Computer Research and Development, 2013, 50(12): 2492-2503.
    [7]Ye Xiaoping. Model and Algebra of Object-Relation Bitemporal Data Based on Temporal Variables[J]. Journal of Computer Research and Development, 2007, 44(11): 1971-1979.
    [8]Ye Ning, Sun Ruixiang, Dong Yisheng. SVM Fast Training Algorithm Research Based on Multi-Lagrange Multiplier[J]. Journal of Computer Research and Development, 2006, 43(3): 442-448.
    [9]Wang Jian, Lin Fuzong. Digital Audio Watermarking Based on Support Vector Machine (SVM)[J]. Journal of Computer Research and Development, 2005, 42(9): 1605-1611.
    [10]Ye Ning, Sun Ruixiang, Dong Yisheng. MLSVM4—An SVM Fast Training Algorithm Based on Multi-Lagrange Multiplier[J]. Journal of Computer Research and Development, 2005, 42(9): 1467-1471.

Catalog

    Article views (1391) PDF downloads (928) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return