Video Texts Tracking and Segmentation Based on Multiple Frames

Mi Congjie, Liu Yang, and Xue Xiangyang

Journal of Computer Research and Development > 2006 > 43(9): 1523-1529.

Mi Congjie, Liu Yang, and Xue Xiangyang. Video Texts Tracking and Segmentation Based on Multiple Frames[J]. Journal of Computer Research and Development, 2006, 43(9): 1523-1529.

Citation:

Mi Congjie, Liu Yang, and Xue Xiangyang. Video Texts Tracking and Segmentation Based on Multiple Frames[J]. Journal of Computer Research and Development, 2006, 43(9): 1523-1529.

Citation:

Mi Congjie, Liu Yang, and Xue Xiangyang. Video Texts Tracking and Segmentation Based on Multiple Frames[J]. Journal of Computer Research and Development, 2006, 43(9): 1523-1529.

PDF (522 KB)

Video Texts Tracking and Segmentation Based on Multiple Frames

Mi Congjie, Liu Yang, and Xue Xiangyang

(Department of Computer Science and Engineering, Fudan University, Shanghai 200433)

More Information

Published Date: September 14, 2006

Graphical Abstract

Abstract

Abstract

Superimposed texts bring important semantic clues for video indexing and retrieval. Texts in videos often span tens or even hundreds of frames and many researchers have exploited the temporal redundancy of video text to improve the text detection accuracy and the text region quality. Described in this paper is a novel approach to track and segment static superimposed texts by utilizing multiple video frame information. For text detection, multiple frames are used to verify the appearance of the text regions which have been detected on a single frame. A binary-search based text tracking method is proposed, which can track the static text object efficiently by utilizing the features of the edge bit map. In order to refine the text regions, text detection is performed again on a synthesized image, which is produced by minimum/maximum pixel search on consecutive tracked frames. In text segmentation, edge features are exploited to further remove complex background in addition to traditional gray-value integration. Experimental results show the effectiveness of the proposed method.
- video text tracking,
- video text segmentation,
- multimedia information retrieval

FullText(HTML)

References (0)

[1]	Li Ang, Du Junping, Kou Feifei, Xue Zhe, Xu Xin, Xu Mingying, Jiang Yang. Scientific and Technological Information Oriented Semantics-Adversarial and Media-Adversarial Based Cross-Media Retrieval Method[J]. Journal of Computer Research and Development, 2023, 60(11): 2660-2670. DOI: 10.7544/issn1000-1239.202220430
[2]	Tu Rongcheng, Mao Xianling, Kong Weijie, Cai Chengfei, Zhao Wenzhe, Wang Hongfa, Huang Heyan. CLIP Based Multi-Event Representation Generation for Video-Text Retrieval[J]. Journal of Computer Research and Development, 2023, 60(9): 2169-2179. DOI: 10.7544/issn1000-1239.202220440
[3]	Wu Famin, Lü Guangyi, Liu Qi, He Ming, Chang Biao, He Weidong, Zhong Hui, Zhang Le. Deep Semantic Representation of Time-Sync Comments for Videos[J]. Journal of Computer Research and Development, 2019, 56(2): 293-305. DOI: 10.7544/issn1000-1239.2019.20170752
[4]	Peng Yuxin, Qi Jinwei, Huang Xin. Current Research Status and Prospects on Multimedia Content Understanding[J]. Journal of Computer Research and Development, 2019, 56(1): 183-208. DOI: 10.7544/issn1000-1239.2019.20180770
[5]	Zha Zhengjun, Zheng Xiaoju. Query and Feedback Technologies in Multimedia Information Retrieval[J]. Journal of Computer Research and Development, 2017, 54(6): 1267-1280. DOI: 10.7544/issn1000-1239.2017.20170004
[6]	Liu Ming, Liu Bingquan, and Liu Yuanchao. A Fast Clustering Algorithm for Information Retrieval[J]. Journal of Computer Research and Development, 2013, 50(7): 1452-1463.
[7]	Zhang Jing, Lu Hong, and Xue Xiangyang. Efficient Sports Video Retrieval Based on Index Structure[J]. Journal of Computer Research and Development, 2006, 43(11): 1953-1958.
[8]	Huo Longshe, Gao Wen, Huang Qingming, Xie Jianguo. Error Protection Algorithms for Scalable Multimedia Transmission: A Survey[J]. Journal of Computer Research and Development, 2005, 42(11): 1954-1961.
[9]	Liu Jun, Yang Xuejun, Tang Yuhua, and Wang Junwei. Track Replica—The Strategy for Disk Seeking Optimization in Retrieving Multimedia Data[J]. Journal of Computer Research and Development, 2005, 42(8): 1452-1459.
[10]	Wang Rongrong, Jin Wanjun, and Wu Lide. A Novel Video Caption Detection Approach Using Multi-Frame Integration[J]. Journal of Computer Research and Development, 2005, 42(7): 1191-1197.