ISSN 1000-1239 CN 11-1777/TP

Journal of Computer Research and Development ›› 2019, Vol. 56 ›› Issue (7): 1525-1533.doi: 10.7544/issn1000-1239.2019.20180543

Previous Articles     Next Articles

Chinese Text Extraction Method of Natural Scene Images Based on City Monitoring

Xiao Ke1, Dai Shun1, He Yunhua1, Sun Limin2   

  1. 1(School of Information Science and Technology, North China University of Technology, Beijing 100144);2(Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093)
  • Online:2019-07-01

Abstract: Efficient environment monitoring and information analysis in urban scenes has become one of primary tasks of smart cities. In smart cities, the recognition of text information in scene images, especially the extraction of Chinese text in scene images, is an intuitive and efficient method for analyzing scene information. However, the Chinese text extraction of the current scene images fails to achieve good results because of the uneven illumination and blurred images. In addition, the complexity of Chinese character structure is also an important factor affecting the Chinese text extraction. In order to solve this problem, this paper proposes an edge enhanced maximally stable extremal regions (MSER) detection method, which can extract the MSER under the conditions of illumination and blurring influence, and the non-MSER can be efficiently filtered by geometric feature constraints to obtain high quality candidate MSER. Then the proposed central aggregation is used to aggregate the candidate Chinese text field that has been divided into multiple MSER, so that the candidate region becomes a single candidate Chinese text component, and then these components are analyzed, and finally the correct Chinese text is selected by machine learning. Experiments show that the algorithm can extract Chinese text in natural scene images more effectively.

Key words: text extraction, maximally stable extremal regions (MSER), Chinese aggregation, support vector machine (SVM), Internet of things (IoT)

CLC Number: