Evaluation of Commercial OCR: A New Goal Directed Methodology for Video Documents

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3686))

Included in the following conference series:

International Conference on Pattern Recognition and Image Analysis

1878 Accesses
1 Citations

Abstract

Texts embedded in video streams convey crucial information for documentation. Many text detection and recognition systems have been designed to automatically extract such documentary data from video streams. Most of the research teams involved argue that commercial OCR do not work properly on images extracted from a video stream. They thus concieve their own detection systems. Nevertheless, commercial OCR have never been evaluated on such corpora. This article details a new methodology to evaluate a commercial OCR on a video document. This methodology is goal directed: the system is penalized proportionally to TFIDF (Term Frequency Inverse Document Frequency) scores of texts [1]. We experiment our methodology on Abbyy FineReader 6.0.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Extracting Multi-Language Text from Video into Editable Form

Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th Video Browser Showdown

Article 26 January 2022

ICDAR 2023 Competition on RoadText Video Text Detection, Tracking and Recognition

References

Jones, K.S., Walker, S., Robertson, S.: A probabilistic model of information retrieval: development and status. Technical Report Technical Report 446, University of Cambridge Computer Laboratory (1998)
Google Scholar
Wu, V., Manmatha, R., Riseman, E.: Textfinder: An automatic system to detect and recognize text in images. IEEE Transactions on Pattern Analysis and Machine Intelligence 21, 1224–1229 (1999)
Article Google Scholar
Wolf, C., Jolion, J.-M.: Extraction and recognition of artificial text in multimedia documents. Pattern Analysis and Applications 6, 309–326 (2003)
MathSciNet Google Scholar
Li, H., Doermann, D.: Automatic text detection and tracking in digital video. IEEE Transactions on Image Processing 9, 147–156 (2000)
Article Google Scholar
Chen, D., Odobez, J.M., Boulard, H.: Text detection and recognition in images and video frames. Pattern Recognition 37, 595–608 (2004)
Article Google Scholar
Wolf, C.: Détection de textes dans des images issues d’un flux vidéo pour l’indexation sémantique. PhD thesis, Institut National de Sciences Appliquées de Lyon, France (2003)
Google Scholar
Li, H.: Automatic processing and analysis of text in digital video. PhD thesis, University of Maryland, College Park (2000)
Google Scholar
Hua, X.S., Wenyin, L., Zhang, H.J.: An automatic performance evaluation protocol for video text detection algorithms. IEEE Trans. on Circuits and Systems for Video Technology 14, 498–507 (2004)
Article Google Scholar
Doermann, D., Mihalcik, D.: Tools and techniques for video performance evaluation. In: Proceedings of the ICPR 2000, vol. 4, pp. 4167–4170. IEEE Computer Society, Los Alamitos (2000)
Google Scholar
Yanikoglu, B., Vincent, L.: Pink panther: A complete environment for ground-truthing and benchmarking document page segmentation. Pattern Recognition 31, 1191–1204 (1998)
Article Google Scholar
Lee, C.H., Kanungo, T.: The architecture of trueviz: a groundtruth/metadata editing and visualizing toolkit. Pattern Recognition 36, 811–825 (2003)
Article Google Scholar
Fruchterman, T.: Dafs: A standard for document and image understanding. In: Proceedings of the Symposium on Document Image Understanding Technology, pp. 94–100 (1995)
Google Scholar
Liang, J., Philips, I., Haralick, R.: Performance evaluation of document layout analysis algorithms on the uw data set. In: Document Recognition IV, Proceedings of the SPIE, pp. 149–160 (1996)
Google Scholar
Lienhart, R., Wernike, A.: Localizing and segmenting text in images, videos and web pages. IEEE Transactions on Circuits and Systems for Video Technology 12, 256–268 (2002)
Article Google Scholar
Mariano, V.Y., Min, J., Park, J.H., Kasturi, R., Mihalcik, D., Li, H., Doermann, D.: Performance evaluation of object detection algorithms. In: International Conference on Pattern Recognition (2002)
Google Scholar
Mao, S., Kanungo, T.: Empirical performance evaluation of page segmentation algorithms. In: Proceedingsof SPIE Conference on Document Recognition, San Jose CA (2000)
Google Scholar
Kanai, J., Rice, S., Natker, T., Nagy, G.: Automated evaluation of ocr zoning. IEEE Transactions on Pattern Analysis and Machine Intelligence 17, 86–90 (1995)
Article Google Scholar
Wagner, R., Fisher, M.J.: The string to string correction problem. Journal of Assoc. Comp. Mach. 21, 168–173 (1974)
MATH Google Scholar
Van Rijsbergen, C.J.: Information Retrieval, 2nd edn. Dept. of Computer Science. University of Glasgow (1979)
Google Scholar
Jolion, J.: The deviation of a set strings. Pattern Analysis And Application 6, 224–231 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institut National de l’Audiovisuel, Direction de la recherche et de l’expérimentation, 4, Av. de l’Europe, 94366 cedex, Bry-sur-Marne, France
Rémi Landais & Laurent Vinet
Lyon Research Center for Images and Intelligent Information Systems (LIRIS), INSA de Lyon, Bât. J. Verne, 20, rue Albert Einstein, F-69621 cedex, Villeurbanne, France
Rémi Landais & Jean-Michel Jolion

Authors

Rémi Landais
View author publications
You can also search for this author in PubMed Google Scholar
Laurent Vinet
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Michel Jolion
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Research School of Infomatics, Loughborough, UK
Sameer Singh
ATR Lab, Research School of Informatics, University of Loughborough, Loughborough, UK
Maneesha Singh
IBM Corporation, 1133 Wetchester Avenue, White Plains, 10604, New York, United States
Chid Apte
Institute of Computer Vision and applied Computer Sciences, IBaI, Germany
Petra Perner

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Landais, R., Vinet, L., Jolion, JM. (2005). Evaluation of Commercial OCR: A New Goal Directed Methodology for Video Documents. In: Singh, S., Singh, M., Apte, C., Perner, P. (eds) Pattern Recognition and Data Mining. ICAPR 2005. Lecture Notes in Computer Science, vol 3686. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551188_74

Download citation

DOI: https://doi.org/10.1007/11551188_74
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28757-5
Online ISBN: 978-3-540-28758-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Evaluation of Commercial OCR: A New Goal Directed Methodology for Video Documents

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Extracting Multi-Language Text from Video into Editable Form

Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th Video Browser Showdown

ICDAR 2023 Competition on RoadText Video Text Detection, Tracking and Recognition

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Evaluation of Commercial OCR: A New Goal Directed Methodology for Video Documents

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Extracting Multi-Language Text from Video into Editable Form

Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th Video Browser Showdown

ICDAR 2023 Competition on RoadText Video Text Detection, Tracking and Recognition

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation