Automatic character detection and segmentation in natural scene images

200 Accesses
12 Citations
Explore all metrics

Abstract

We present a robust connected-component (CC) based method for automatic detection and segmentation of text in real-scene images. This technique can be applied in robot vision, sign recognition, meeting processing and video indexing. First, a Non-Linear Niblack method (NLNiblack) is proposed to decompose the image into candidate CCs. Then, all these CCs are fed into a cascade of classifiers trained by Adaboost algorithm. Each classifier in the cascade responds to one feature of the CC. Proposed here are 12 novel features which are insensitive to noise, scale, text orientation and text language. The classifier cascade allows non-text CCs of the image to be rapidly discarded while more computation is spent on promising text-like CCs. The CCs passing through the cascade are considered as text components and are used to form the segmentation result. A prototype system was built, with experimental results proving the effectiveness and efficiency of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automated Text Detection and Character Recognition in Natural Scenes Based on Local Image Features and Contour Processing Techniques

Text Segmentation for Document Recognition

Connected Operators for Non-text Object Segmentation in Grayscale Document Images

References

Chen, B.T., Bae, Y., Kim, T.Y., 1999. Automatic Text Extraction in Digital Videos Using FFT and Neural Network. Proceedings of the IEEE International Fuzzy Systems Conference. Seoul, Korea.
Clark, P., Mirmehdi, M., 2000. Finding Text Regions Using Localised Measures. Proceedings of the 11th British Machine Vision Conference, p.675–684.
Doermann, D., Liang, J., Li, H., 2003. Progress in Camera-based Document Image Analysis. Proceedings of the 7th International Conference on Document Analysis and Recognition. Edinburgh, Scotland, 1:606–616.
Article Google Scholar
Ferreira, S., Thillou, C., Gosselin, B., 2003. From Picture to Speech: An Innovative OCR Application for Embedded Environment. ProRISC 2003. Veldhoven, Netherland.
Gao, J., Yang, J., 2001. An Adaptive Algorithm for Text Detection from Natural Scenes. CVPR 2001, p.84–89.
Haritaoglu, I., 2001. Scene Text Extraction and Translation for Handheld Devices. CVPR 2001, p.408–413.
Hasan, Y.M.Y., Karam, L.J., 2000. Morphological text extraction from images. IEEE Transactions on Image Processing, 9(11): 1978–1983. [doi:10.1109/83.877220]
Article Google Scholar
Jie, S., Rehg, J.M., Bobick, A., 2004. Automatic Cascade Training with Perturbation Bias. CVPR 2004, 2:276–283.
Google Scholar
Li, H., Doermann, D., Kia, O., 2000. Automatic text detection and tracking in digital video. IEEE Transactions on Image Processing, 9(1):147–156. [doi:10.1109/83.817607]
Article Google Scholar
Shin, C.S., Kim, K.I., Park, M.H., Kim, H.J., 2000. Support Vector Machine-based Text Detection in Digital Video. Proceedings of the IEEE Signal Processing Society Workshop on Neural Networks for Signal Processing X, 2:634–641.
Google Scholar
Viola, P., Jones, M., 2001. Robust Real-time Face Detection. ICCV01, 2:747.
Google Scholar
Wang, K., Kangas, J., 2003. Character location in scene images from digital camera. Pattern Recognition, 36(10):2287–2299. [doi:10.1016/S0031-3203(03)00082-7]
Article MATH Google Scholar
Winger, L., Robinson, J.A., Jernigan, M.E., 2000. Low-complexity character extraction in low-contrast scene images. International Journal of Pattern Recognition and Artificial Intelligence, 14(2):113–135. [doi:10.1142/S0218001400000106]
Article Google Scholar
Zhong, Y., Zhang, H.J., Jain, A.K., 2000. Automatical caption localization in compressed video. IEEE Transactions on PAMI, 22(4):385–392.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Technology, Shanghai Jiao Tong University, Shanghai, 200030, China
Zhu Kai-hua, Qi Fei-hu, Jiang Ren-jie & Xu Li

Authors

Zhu Kai-hua
View author publications
You can also search for this author in PubMed Google Scholar
Qi Fei-hu
View author publications
You can also search for this author in PubMed Google Scholar
Jiang Ren-jie
View author publications
You can also search for this author in PubMed Google Scholar
Xu Li
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Project supported by OMRON under PVS project

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhu, Kh., Qi, Fh., Jiang, Rj. et al. Automatic character detection and segmentation in natural scene images. J. Zhejiang Univ. - Sci. A 8, 63–71 (2007). https://doi.org/10.1631/jzus.2007.A0063

Download citation

Received: 18 October 2005
Accepted: 22 February 2006
Published: 01 January 2007
Issue Date: January 2007
DOI: https://doi.org/10.1631/jzus.2007.A0063

Key words

CLC number

TP391.41

Automatic character detection and segmentation in natural scene images

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Automated Text Detection and Character Recognition in Natural Scenes Based on Local Image Features and Contour Processing Techniques

Text Segmentation for Document Recognition

Connected Operators for Non-text Object Segmentation in Grayscale Document Images

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Key words

CLC number

Subscribe and save

Buy Now

Navigation

Automatic character detection and segmentation in natural scene images

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Automated Text Detection and Character Recognition in Natural Scenes Based on Local Image Features and Contour Processing Techniques

Text Segmentation for Document Recognition

Connected Operators for Non-text Object Segmentation in Grayscale Document Images

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

CLC number

Subscribe and save

Buy Now

Search

Navigation