• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
高级检索

基于关键词精化和句法树的商品图像句子标注

张红斌, 姬东鸿, 尹兰, 任亚峰, 牛正雨

张红斌, 姬东鸿, 尹兰, 任亚峰, 牛正雨. 基于关键词精化和句法树的商品图像句子标注[J]. 计算机研究与发展, 2016, 53(11): 2542-2555. DOI: 10.7544/issn1000-1239.2016.20150906
引用本文: 张红斌, 姬东鸿, 尹兰, 任亚峰, 牛正雨. 基于关键词精化和句法树的商品图像句子标注[J]. 计算机研究与发展, 2016, 53(11): 2542-2555. DOI: 10.7544/issn1000-1239.2016.20150906
Zhang Hongbin, Ji Donghong, Yin Lan, Ren Yafeng, Niu Zhengyu. Caption Generation from Product Image Based on Tag Refinement and Syntactic Tree[J]. Journal of Computer Research and Development, 2016, 53(11): 2542-2555. DOI: 10.7544/issn1000-1239.2016.20150906
Citation: Zhang Hongbin, Ji Donghong, Yin Lan, Ren Yafeng, Niu Zhengyu. Caption Generation from Product Image Based on Tag Refinement and Syntactic Tree[J]. Journal of Computer Research and Development, 2016, 53(11): 2542-2555. DOI: 10.7544/issn1000-1239.2016.20150906
张红斌, 姬东鸿, 尹兰, 任亚峰, 牛正雨. 基于关键词精化和句法树的商品图像句子标注[J]. 计算机研究与发展, 2016, 53(11): 2542-2555. CSTR: 32373.14.issn1000-1239.2016.20150906
引用本文: 张红斌, 姬东鸿, 尹兰, 任亚峰, 牛正雨. 基于关键词精化和句法树的商品图像句子标注[J]. 计算机研究与发展, 2016, 53(11): 2542-2555. CSTR: 32373.14.issn1000-1239.2016.20150906
Zhang Hongbin, Ji Donghong, Yin Lan, Ren Yafeng, Niu Zhengyu. Caption Generation from Product Image Based on Tag Refinement and Syntactic Tree[J]. Journal of Computer Research and Development, 2016, 53(11): 2542-2555. CSTR: 32373.14.issn1000-1239.2016.20150906
Citation: Zhang Hongbin, Ji Donghong, Yin Lan, Ren Yafeng, Niu Zhengyu. Caption Generation from Product Image Based on Tag Refinement and Syntactic Tree[J]. Journal of Computer Research and Development, 2016, 53(11): 2542-2555. CSTR: 32373.14.issn1000-1239.2016.20150906

基于关键词精化和句法树的商品图像句子标注

基金项目: 国家自然科学基金项目(61133012);国家社会科学基金重大招标项目(11&ZD189);教育部人文社科基金项目(16YJAZH029);江西省科技厅科技攻关项目(20121BBG70050,20142BBG70011);江西省高校人文社科基金项目(XW1502,TQ1503);江西省普通本科高校中青年教师发展计划访问学者专项资金;江西省社科规划项目(16TQ02) This work was supported by the National Natural Science Foundation of China (61133012), the National Social Science Major Tender Project (11&ZD189), the Humanity and Social Science Foundation of Ministry of Education (16YJAZH029), the Science and Technology Research Project of Jiangxi Provincial Department of Science and Technology (20121BBG70050,20142BBG70011), the Humanity and Social Science Foundation of Jiangxi Provincial Universities (XW1502,TQ1503), the Visiting Scholar Special Fund for the Development Plan of Young and Middle-Aged Teachers of General Universities in Jiangxi Province, and the Social Science Planning Project of Jiangxi Province (16TQ02).
详细信息
  • 中图分类号: TP391

Caption Generation from Product Image Based on Tag Refinement and Syntactic Tree

  • 摘要: 商品图像句子标注是图像标注中一项既有趣又富有挑战的研究任务.噪声单词干扰和句法结构错误是该项研究的制约因素,针对噪声单词干扰,提出关键词精化思想:用绝对排序特征强化关键词权重,完成第1次关键词精化;计算单词的语义相关度评分,进一步优选能准确刻画图像内容的单词,完成第2次关键词精化.设计词序列"拼积木"算法,把关键词拼装成N元词序列.针对句法结构错误,提出句法树思想:基于N元词序列和句法子树递归地构建一棵完整的句法树,遍历该树叶子结点输出句子,标注商品图像.实验结果表明:关键词精化和句法树均有助于改善标注性能,句中的语义信息兼容性和句法模式兼容性得以保持,句子内容更连贯、流畅.
    Abstract: Automatic caption generation from product image is an interesting and challenging research task of image annotation. However, noisy words interference and inaccurate syntactic structures are the key problems that affect the research heavily. For the first problem, a novel idea of tag refinement (TR) is presented: absolute rank (AR) feature is applied to strengthen the key words weights. The process is called the first tag refinement. The semantic correlation score of each word is calculated in turn and the words that have the tightest semantic correlations with images content are summarized for caption generation. The process is called the second tag refinement. A novel natural language generation (NLG) algorithm named word sequence blocks building (WSBB) is designed accordingly to generate N gram word sequences. For the second problem, a novel idea of syntactic tree (ST) is presented: a complete syntactic tree is constructed recursively based on the N gram word sequences and predefined syntactic subtrees. Finally, sentence is generated by traversing all leaf nodes of the syntactic tree. Experimental results show both the tag refinement and the syntactic tree help to improve the annotation performance. More importantly, not only the semantic information compatibility but also the syntactic mode compatibility of the generated sentence is better retained simultaneously. Moreover, the sentence contains abundant semantic information as well as coherent syntactic structure.
  • 期刊类型引用(4)

    1. 薛万利,张智彬,裴生雷,张开华,陈胜勇. 混合目标与搜索区域令牌的视觉目标跟踪. 计算机研究与发展. 2024(02): 460-469 . 本站查看
    2. 姜文涛,崔江磊. 旋转区域提议网络的孪生神经网络跟踪算法. 计算机工程与应用. 2022(24): 247-255 . 百度学术
    3. 谭建豪,张思远. 基于自适应空间正则化的视觉目标跟踪算法. 计算机研究与发展. 2021(02): 427-435 . 本站查看
    4. 朱洪波. 自适应模型的视觉跟踪算法. 计算机与数字工程. 2020(12): 2991-2996 . 百度学术

    其他类型引用(5)

计量
  • 文章访问数:  1380
  • HTML全文浏览量:  0
  • PDF下载量:  476
  • 被引次数: 9
出版历程
  • 发布日期:  2016-10-31

目录

    /

    返回文章
    返回