A classification learning algorithm robust to irrelevant features

H. Altay Güvenir¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1480))

Included in the following conference series:

International Conference on Artificial Intelligence: Methodology, Systems, and Applications

174 Accesses
2 Citations

Abstract

Presence of irrelevant features is a fact of life in many real-world applications of classification learning. Although nearest-neighbor classification algorithms have emerged as a promising approach to machine learning tasks with their high predictive accuracy, they are adversely affected by the presence of such irrelevant features. In this paper, we describe a recently proposed classification algorithm called VFI5, which achieves comparable accuracy to nearest-neighbor classifiers while it is robust with respect to irrelevant features. The paper compares both the nearest-neighbor classifier and the VFI5 algorithms in the presence of irrelevant features on both artificially generated and real-world data sets selected from the UCI repository.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aha, D., Kibler, D., Albert, M.: Instance-based Learning Algorithms. Machine Learning. 6 (1991) 37–66
Google Scholar
Almuallim, H., Dietterich, T.G.: Learning with many irrelevant features. In: Proceedings of the 9th National Conference on Artificial Intelligence: AAAI Press, Menlo Park (1991) 547–552
Google Scholar
Cardie, C.: Automating Feature Set Selection for Case-Based Learning of Linguistic Knowledge. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, University of Pennsylvania (1996) 113–126
Google Scholar
Christopher, J.M., Murphy, P.M.: UCI repository of machine learning databases. At http://www.ics.uci.edu/~mlearn/MLRepository.html (1998)
Google Scholar
Demiröz, G.: Non-Incremental Classification Learning Algorithms based on Voting Feature Intervals. MSc. Thesis. Bilkent University, Dept. of Computer Engineering and Information Science. Ankara, Turkey (1997)
Google Scholar
Demiröz, G., Güvenir, H.A., İlter, N.: Differential Diagnosis of Erythemato-Squamous Diseases using Voting Feature Intervals. In: Ciftcibasi, T., Karaman, M., Atalay, V. (Eds.): New Trends in Artificial Intelligence and Neural Networks (TAINN'97), Kizilcahamam, Turkey, (May 22–23, 1997), 190–194
Google Scholar
Demiröz, G., Güvenir, H.A.: Classification by Voting Feature Intervals. In: van Someren, M., Widmer, G. (Eds.): Machine Learning: ECML-97. Lecture Notes in Computer Science, Vol. 1224. Springer-Verlag, Berlin (1997) 85–92
Google Scholar
Domingos, P.: Context-sensitive feature selection for lazy learners. Artificial Intelligence Review 11 (1997) 227–253
Article Google Scholar
Güvenir, H.A., Acar, B., Demiröz, G., Çekin, A.: A Supervised Machine Learning Algorithm for Arrhythmia Analysis. In: Computers in Cardiology 1997, 24 Lund, Sweden (1997) 433–436
Google Scholar
Güvenir, H.A., AkkuŞ, A.: Weighted K Nearest Neighbor Classification on Feature Projections. In: Kuru, S., Çağlayan, M.U., Akin, H.L. (Eds.): Proceedings of the Twelfth International Symposium on Computer and Information Sciences (ISCIS XII). Antalya, Turkey (1997) 44–51
Google Scholar
Güvenir, H.A., Şirin, İ.: Classification by Feature Partitioning. Machine Learning 23 (1996) 47–67
Google Scholar
Kohavi, R., Langley, P., Yun, Y.: The Utility of Feature Weighting in Nearest-Neighbor Algorithms. In: van Someren, M., Widmer, G. (Eds.): Machine Learning: ECML-97. Lecture Notes in Computer Science, Vol. 1224. Springer-Verlag, Berlin (1997) 85–92
Google Scholar
Langley, P.: Selection of Relevant Features in Machine Learning. In: Proceedings of the AAAI Fall Symposium on Relevance. New Orleans, USA, AAAI Press, (1994)
Google Scholar
Liu, H., Setiono, R.: A probabilistic approach to feature selection — A filter solution. In: Saitta, L. (Ed.): Proceedings of the Thirteenth International Conference on Machine Learning (ICML'96) Italy (1996) 319–327
Google Scholar
Skalak, D.: Prototype and feature selection by sampling and random mutation hill climbing algorithms. In: Proceedings of the Eleventh International Machine Learning Conference (ICML-94). Morgan Kauffmann, New Brunswick (1994) 293–301
Google Scholar
Wettschereck, D., Aha, D.W., Mohri, T.: Review and Empirical Evaluation of Feature Weighting Methods for a Class of Lazy Learning Algorithms. Artificial Intelligence Review 11 (1997) 273–314.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Engineering and Information Science, Bilkent University, 06533, Ankara, Turkey
H. Altay Güvenir

Authors

H. Altay Güvenir
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Fausto Giunchiglia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Güvenir, H.A. (1998). A classification learning algorithm robust to irrelevant features. In: Giunchiglia, F. (eds) Artificial Intelligence: Methodology, Systems, and Applications. AIMSA 1998. Lecture Notes in Computer Science, vol 1480. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0057452

Download citation

DOI: https://doi.org/10.1007/BFb0057452
Published: 27 June 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64993-9
Online ISBN: 978-3-540-49793-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics