×

Hierarchical classification and feature reduction for fast face detection with support vector machines. (English) Zbl 1045.68116

Summary: We present a two-step method to speed-up object detection systems in computer vision that use support vector machines as classifiers. In the first step we build a hierarchy of classifiers. On the bottom level, a simple and fast linear classifier analyzes the whole image and rejects large parts of the background. On the top level, a slower but more accurate classifier performs the final detection. We propose a new method for automatically building and training a hierarchy of classifiers. In the second step we apply feature reduction to the top level classifier by choosing relevant image features according to a measure derived from statistical learning theory. Experiments with a face detection system show that combining feature reduction with hierarchical classification leads to a speed-up by a factor of 335 with similar classification performance.

MSC:

68T10 Pattern recognition, speech recognition
Full Text: DOI

References:

[1] B. Heisele, T. Poggio, M. Pontil, Face detection in still gray images, A.I. Memo 1687, Center for Biological and Computational Learning, MIT, Cambridge, MA, 2000.; B. Heisele, T. Poggio, M. Pontil, Face detection in still gray images, A.I. Memo 1687, Center for Biological and Computational Learning, MIT, Cambridge, MA, 2000.
[2] Rosenfeld, A.; Vanderbrug, G. J., Coarse-fine template matching, IEEE Trans. Syst. Man Cybernet., 2, 104-107 (1977)
[3] Burt, P. J., Smart sensing within a pyramid vision machine, Proc. IEEE, 76, 8, 1006-1015 (1988)
[4] J. Edwards, H. Murase, Appearance matching of occluded objects using coarse-to-fine adaptive masks, Proc. IEEE Comput. Vision Pattern Recognition, Rierto Rico, 1997, pp. 533-539.; J. Edwards, H. Murase, Appearance matching of occluded objects using coarse-to-fine adaptive masks, Proc. IEEE Comput. Vision Pattern Recognition, Rierto Rico, 1997, pp. 533-539.
[5] H.A. Rowley, Neural network-based face detection, Ph.D. Thesis, CMU, School of Computer Science, Pittsburgh, 1999.; H.A. Rowley, Neural network-based face detection, Ph.D. Thesis, CMU, School of Computer Science, Pittsburgh, 1999.
[6] Blum, A.; Langley, P., Selection of relevant features and examples in machine learning, Artif. Intell., 10, 245-271 (1997) · Zbl 0904.68142
[7] Kohavi, R., Wrappers for feature subset selection, Artificial Intelligence (special issue on relevance), 97, 273-324 (1995) · Zbl 0904.68143
[8] P. Viola, M. Jones, Robust real-time face detection, in: Proceedings of Eighth International Conference on Computer Vision, Vancouver, Vol. 20(11), 2001, pp. 1254-1259.; P. Viola, M. Jones, Robust real-time face detection, in: Proceedings of Eighth International Conference on Computer Vision, Vancouver, Vol. 20(11), 2001, pp. 1254-1259.
[9] Vapnik, V., Statistical Learning Theory (1998), Wiley: Wiley New York · Zbl 0935.62007
[10] M. Oren, C. Papageorgiou, P. Sinha, E. Osuna, T. Poggio, Pedestrian detection using wavelet templates, in: IEEE Conference on Computer Vision and Pattern Recognition, San Juan, 1997, pp. 193-199.; M. Oren, C. Papageorgiou, P. Sinha, E. Osuna, T. Poggio, Pedestrian detection using wavelet templates, in: IEEE Conference on Computer Vision and Pattern Recognition, San Juan, 1997, pp. 193-199.
[11] K.-K. Sung, Learning and example selection for object and pattern recognition, Ph.D. Thesis, MIT, Artificial Intelligence Laboratory and Center for Biological and Computational Learning, Cambridge, MA, 1996.; K.-K. Sung, Learning and example selection for object and pattern recognition, Ph.D. Thesis, MIT, Artificial Intelligence Laboratory and Center for Biological and Computational Learning, Cambridge, MA, 1996.
[12] H.A. Rowley, S. Baluja, T. Kanade, Rotation invariant neural network-based face detection, Computer Science Technical Report CMU-CS-97-201, CMU, Pittsburgh, 1997.; H.A. Rowley, S. Baluja, T. Kanade, Rotation invariant neural network-based face detection, Computer Science Technical Report CMU-CS-97-201, CMU, Pittsburgh, 1997.
[13] J. Weston, S. Mukherjee, O. Chapelle, M. Pontil, T. Poggio, V. Vapnik, Feature selection for support vector machines, in: T.K. Leen, T.G. Diettrich, V. Tresp (Eds.), Advances in Neural Information Processing Systems, Vol. 13, MIT Press, Cambridge, MA, 2001, pp. 668-674.; J. Weston, S. Mukherjee, O. Chapelle, M. Pontil, T. Poggio, V. Vapnik, Feature selection for support vector machines, in: T.K. Leen, T.G. Diettrich, V. Tresp (Eds.), Advances in Neural Information Processing Systems, Vol. 13, MIT Press, Cambridge, MA, 2001, pp. 668-674.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.