
<?xml version="1.0" encoding="UTF-8"?>
<record>
  <title>Scene Text Detection based on Enhanced Mlti-channels MSER and a Fast Text Grouping Process</title>
  <journal>International Journal of Computational Linguistics Research</journal>
  <author>Jin Dai, Zu Wang, Xianjing Zhao, Shuai Shao</author>
  <volume>9</volume>
  <issue>2</issue>
  <year>2018</year>
  <doi>https://doi.org/10.6025/jcl/2018/9/2/47-59</doi>
  <url>http://www.dline.info/jcl/fulltext/v9n2/jclv9n2_1.pdf</url>
  <abstract>Scene text detection has become a popular research in computer vision and pattern recognition field in recent
years because of the accurate and rich information carried by scene text. Now component-based methods have become the
trend, and the detection result is largely determined by the success of filtering text-like non-text regions. The main task of this
paper is to reduce the time complexity without a big fall in recall. First an enhanced multi-channels MSER model is introduced.
Before extracting MSER, the image is sharpened by using the Laplacian and Gaussian blur and multi-channel is
utilized, then the step of the threshold used in MSER algorithm is set to the minimum in order to get add the more refined
MSERs. Second, two novel scene text features local contrast and boundary key points are proposed to better distinguish text
regions from non-text regions. Finally, a fast text grouping algorithm is achieved which reduces the time complexity from O
(n2) to O(nlog2n). Experiments on both ICDAR 2011 and ICDAR 2013 show that the recall of the proposed method is improved
by 3%.</abstract>
</record>
