TY - GEN
T1 - Text detection in natural and computer-generated images
AU - Özgen, Azmi Can
AU - Fasounaki, Mandana
AU - Ekenel, Hazim Kemal
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/7/5
Y1 - 2018/7/5
N2 - Text detection is one of the most challenging and commonly dealt applications in computer vision. Detecting text regions is the first step of the text recognition systems called Optical Character Recognition. This process requires the separation of text region from non-text region. In this paper, we utilize Maximally Stable Extremal Regions to acquire very first text region candidates. Then these possible regions are reduced in quantity by using geometric and stroke width properties. Candidate regions are joined to obtain text groups. Finally, Tesseract Optical Character Recognition engine is utilized as the last step to eliminate non-text groups. We evaluated the proposed system on KAIST and ICDAR datasets for both natural images and computer-generated images. For natural images 82.7% precision and 52.0% f-accuracy; for computer-generated images 64.0% precision and 65.2% f-accuracy is achieved.
AB - Text detection is one of the most challenging and commonly dealt applications in computer vision. Detecting text regions is the first step of the text recognition systems called Optical Character Recognition. This process requires the separation of text region from non-text region. In this paper, we utilize Maximally Stable Extremal Regions to acquire very first text region candidates. Then these possible regions are reduced in quantity by using geometric and stroke width properties. Candidate regions are joined to obtain text groups. Finally, Tesseract Optical Character Recognition engine is utilized as the last step to eliminate non-text groups. We evaluated the proposed system on KAIST and ICDAR datasets for both natural images and computer-generated images. For natural images 82.7% precision and 52.0% f-accuracy; for computer-generated images 64.0% precision and 65.2% f-accuracy is achieved.
KW - Geometric and stroke width properties
KW - Maximally stable extremal regions
KW - Non-text region elimination
KW - Text detection
UR - http://www.scopus.com/inward/record.url?scp=85050822432&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85050822432&partnerID=8YFLogxK
U2 - 10.1109/SIU.2018.8404600
DO - 10.1109/SIU.2018.8404600
M3 - Conference contribution
AN - SCOPUS:85050822432
T3 - 26th IEEE Signal Processing and Communications Applications Conference, SIU 2018
SP - 1
EP - 4
BT - 26th IEEE Signal Processing and Communications Applications Conference, SIU 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 26th IEEE Signal Processing and Communications Applications Conference, SIU 2018
Y2 - 2 May 2018 through 5 May 2018
ER -