×

Fuzzy-based segmentation for variable font-sized text extraction from images/videos. (English) Zbl 1407.94026

Summary: Textual information embedded in multimedia can provide a vital tool for indexing and retrieval. A lot of work is done in the field of text localization and detection because of its very fundamental importance. One of the biggest challenges of text detection is to deal with variation in font sizes and image resolution. This problem gets elevated due to the undersegmentation or oversegmentation of the regions in an image. The paper addresses this problem by proposing a solution using novel fuzzy-based method. This paper advocates postprocessing segmentation method that can solve the problem of variation in text sizes and image resolution. The methodology is tested on ICDAR 2011 Robust Reading Challenge dataset which amply proves the strength of the recommended method.

MSC:

94A08 Image processing (compression, reconstruction, etc.) in information and communication theory
94D05 Fuzzy sets and logic (in connection with information, communication, or circuits theory)
Full Text: DOI

References:

[1] Li, H.; Doermann, D.; Kia, O., Automatic text detection and tracking in digital video, IEEE Transactions on Image Processing, 9, 1, 147-156 (2000) · doi:10.1109/83.817607
[2] Kim, K. I.; Jung, K.; Kim, J. H., Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm, IEEE Transactions on Pattern Analysis and Machine Intelligence, 25, 12, 1631-1639 (2003) · doi:10.1109/TPAMI.2003.1251157
[3] Zhao, M.; Li, S.; Kwok, J., Text detection in images using sparse representation with discriminative dictionaries, Image and Vision Computing, 28, 12, 1590-1599 (2010) · doi:10.1016/j.imavis.2010.04.002
[4] Wang, K.; Belongie, S., Word spotting in the wild, Proceedings of the European Conference on Computer Vision (ECCV ’10), Springer
[5] Neumann, L.; Matas, J., A method for text localization and recognition in real-world images, Proceedings of the Asian Conference on Computer Vision (ACCV ’11), Springer
[6] Shivakumara, P.; Phan, T. Q.; Tan, C. L., A Laplacian approach to multi-oriented text detection in video, IEEE Transactions on Pattern Analysis and Machine Intelligence, 33, 2, 412-419 (2011) · doi:10.1109/TPAMI.2010.166
[7] Jung, K.; Kim, K. I.; Jain, A. K., Text information extraction in images and video: a survey, Pattern Recognition, 37, 5, 977-997 (2004) · doi:10.1016/j.patcog.2003.10.012
[8] Liang, J.; Doermann, D.; Li, H., Camera-based analysis of text and documents: a survey, International Journal on Document Analysis and Recognition, 7, 2-3, 84-104 (2005) · doi:10.1007/s10032-004-0138-z
[9] Sumathi, C. P.; Santhanam, T.; Gayathri, G., A Survey on various approaches of text extraction in images, International Journal of Computer Science & Engineering Survey, 3, 4 (2012)
[10] Lienhart, R., Video OCR: A Survey and Practitioner’s Guide. Video OCR: A Survey and Practitioner’s Guide, Video mining (2003), Burlingame, Calif, USA: Springer, Burlingame, Calif, USA
[11] Li, C.; Ding, X. G.; Wu, Y. S., An algorithm for text location in images based on histogram features and Ada-boost, Journal of Image and Graphics, 3, article 003 (2006)
[12] Kim, K. I.; Jung, K.; Kim, J. H., Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm, IEEE Transactions on Pattern Analysis and Machine Intelligence, 25, 12, 1631-1639 (2003) · doi:10.1109/TPAMI.2003.1251157
[13] Lienhart, R.; Wernicke, A., Localizing and segmenting text in images and videos, IEEE Transactions on Circuits and Systems for Video Technology, 12, 4, 256-268 (2002) · doi:10.1109/76.999203
[14] Gllavata, J.; Qeli, E.; Freisleben, B., Detecting text in videos using fuzzy clustering ensembles, Proceedings of the 8th IEEE International Symposium on Multimedia (ISM ’06), IEEE · doi:10.1109/ISM.2006.60
[15] Chen, D.; Odobez, J.-M.; Bourlard, H., Text detection and recognition in images and video frames, Pattern Recognition, 37, 3, 595-608 (2004) · doi:10.1016/j.patcog.2003.06.001
[16] Fabrizio, J.; Cord, M.; Marcotegui, B., Text Extraction from Street Level Images. Text Extraction from Street Level Images, City Models, Roads and Traffic (CMRT), 3 (2009)
[17] León Cristóbal, M.; Vilaplana Besler, V.; Gasull Llampallas, A.; Marqués Acosta, F., Region-Based Caption Text Extraction (2012)
[18] Epshtein, B.; Ofek, E.; Wexler, Y., Detecting text in natural scenes with stroke width transform, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR ’10), IEEE · doi:10.1109/CVPR.2010.5540041
[19] Anthimopoulos, M.; Gatos, B.; Pratikakis, I., A two-stage scheme for text detection in video images, Image and Vision Computing, 28, 9, 1413-1426 (2010) · doi:10.1016/j.imavis.2010.03.004
[20] Neumann, L.; Matas, J., A method for text localization and recognition in real-world images, Proceedings of the Asian Conference on Computer Vision (ACCV ’10), Springer
[21] Deepa, S. T.; Victor, S. P., A novel method for text extraction, International Journal of Engineering Science & Advanced Technology, 4, 961-964 (2013)
[22] Farhoodi, R.; Kasaei, S., Text segmentation from images with textured and colored background, Proceedings of 13th Iranian Conference on Electrical Engineering
[23] Das, M. S.; Bindhu, B. H.; Govardhan, A., Evaluation of text detection and localization methods in natural images, International Journal of Emerging Technology and Advanced Engineering, 2, 6, 277-282 (2012)
[24] Deepa, S. T.; Victor, S. P., A novel method for text extraction, International Journal of Engineering Science Advanced Technology, 2, 4, 961-964 (2013)
[25] Li, S.; Kwok, J. T., Text extraction using edge detection and morphological dilation, Proceedings of the International Symposium on Intelligent Multimedia, Video and Speech Processing (ISIMP ’04), IEEE
[26] Poignant, J.; Besacier, L.; Quenot, G.; Thollard, F., From text detection in videos to person identification, Proceedings of the IEEE International Conference on Multimedia and Expo (ICME ’12), IEEE
[27] Minetto, R.; Thome, N.; Cord, M.; Fabrizio, J.; Marcotegui, B., Snoopertext: a multiresolution system for text detection in complex visual scenes, Proceedings of the 17th IEEE International Conference on Image Processing (ICIP ’10), IEEE · doi:10.1109/ICIP.2010.5651761
[28] Anthimopoulos, M.; Gatos, B.; Pratikakis, I., Multiresolution text detection in video frames, Proceedings of the 2nd International Conference on Computer Vision Theory and Applications (VISAPP ’07)
[29] Wolf, C.; Jolion, J.-M., Extraction and recognition of artificial text in multimedia documents, Pattern Analysis and Applications, 6, 4, 309-326 (2004)
[30] Pan, Y.-F.; Hou, X.; Liu, C.-L., A hybrid approach to detect and localize texts in natural scene images, IEEE Transactions on Image Processing, 20, 3, 800-813 (2011) · Zbl 1372.94199 · doi:10.1109/TIP.2010.2070803
[31] Gonzalez, A.; Bergasa, L. M., A text reading algorithm for natural images, Image and Vision Computing, 31, 255-274 (2013) · doi:10.1016/j.imavis.2013.01.003
[32] Shi, C.; Wang, C.; Xiao, B.; Zhang, Y.; Gao, S., Scene text detection using graph model built upon maximally stable extremal regions, Pattern Recognition Letters, 34, 107-116 (2012)
[33] Yao, C.; Bai, X.; Liu, W.; Ma, Y.; Tu, Z., Detecting texts of arbitrary orientations in natural images, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR ’12), IEEE
[34] Tobias, O. J.; Seara, R., Image segmentation by histogram thresholding using fuzzy sets, IEEE Transactions on Image Processing, 11, 12, 1457-1465 (2002) · doi:10.1109/TIP.2002.806231
[35] Senthilkumaran, N.; Rajesh, R., Edge detection techniques for image segmentation-a survey of soft computing approaches, International Journal of Recent Trends in Engineering, 1, 2, 250-254 (2009)
[36] Wang, L. X., A Course in Fuzzy Systems (1999), Upper Saddle River, NJ, USA: Prentice-Hall Press, Upper Saddle River, NJ, USA
[37] Tehsin, S.; Masood, A.; Kausar, S.; Javed, Y., Text localization and detection method for born-digital images, IETE Journal of Research, 59, 4, 343-349 (2013) · doi:10.4103/0377-2063.118025
[38] Karatzas, D.; Mestre, S. R.; Mas, J.; Nourbakhsh, F.; Roy, P. P., ICDAR 2011 robust reading competition—challenge 1: reading text in born-digital images (web and email), Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR ’11), IEEE · doi:10.1109/ICDAR.2011.295
[39] Wolf, C.; Jolion, J.-M., Object count/area graphs for the evaluation of object detection and segmentation algorithms, International Journal on Document Analysis and Recognition, 8, 4, 280-296 (2006) · doi:10.1007/s10032-006-0014-0
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.