A novel scene text detection algorithm based on convolutional neural network

Xiaohang Ren, Kai Chen, Xiaokang Yang, Yi Zhou, Jianhua He, Jun Sun

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Candidate text region extraction plays a critical role in convolutional neural network (CNN) based text detection from natural images. In this paper, we propose a CNN based scene text detection algorithm with a new text region extractor. The so called candidate text region extractor I-MSER is based on Maximally Stable Extremal Region (MSER), which can improve the independency and completeness of the extracted candidate text regions. Design of I-MSER is motivated by the observation that text MSERs have high similarity and are close to each other. The independency of candidate text regions obtained by I-MSER is guaranteed by selecting the most representative regions from a MSER tree which is generated according to the spatial overlapping relationship among the MSERs. A multi-layer CNN model is trained to score the confidence value of the extracted regions extracted by the I-MSER for text detection. The new text detection algorithm based on I-MSER is evaluated with wide-used ICDAR 2011 and 2013 datasets and shows improved detection performance compared to the existing algorithms.

Original languageEnglish
Title of host publication2016 Visual Communication and Image Processing (VCIP)
PublisherIEEE
Pages1-4
Number of pages4
ISBN (Electronic)9781509053162
ISBN (Print)9781509053179
DOIs
Publication statusPublished - 5 Jan 2017
Event2016 IEEE Visual Communication and Image Processing, VCIP 2016 - Chengdu, China
Duration: 27 Nov 201630 Nov 2016

Conference

Conference2016 IEEE Visual Communication and Image Processing, VCIP 2016
CountryChina
CityChengdu
Period27/11/1630/11/16

    Fingerprint

Keywords

  • CNN
  • Deep learning model
  • I-MSER
  • Scene Text Detection
  • Text region extraction

Cite this

Ren, X., Chen, K., Yang, X., Zhou, Y., He, J., & Sun, J. (2017). A novel scene text detection algorithm based on convolutional neural network. In 2016 Visual Communication and Image Processing (VCIP) (pp. 1-4). [7805444] IEEE. https://doi.org/10.1109/VCIP.2016.7805444