×

Data mining with a simulated annealing based fuzzy classification system. (English) Zbl 1131.68514

Summary: The use of Simulated Annealing (SA) metaheuristic for constructing a fuzzy classification system is presented. In several previous investigations, the capability of fuzzy systems to solve different kinds of problems has been demonstrated. Simulated Annealing Based Fuzzy Classification System (SAFCS), hybridizes the learning capability of SA metaheuristic with the approximate reasoning method of fuzzy systems. The objective of this paper is to illustrate the ability of SA to develop an accurate fuzzy classifier. The use of SA in classification is an attempt to effectively explore and exploit the large search space usually associated with classification problems, and find the optimum set of fuzzy if-then rules. The SAFCS would be capable to extract accurate fuzzy classification rules from input data sets, and applies them to classify new data instances in different predefined groups or classes. Experiments are performed with eight UCI data sets. The results indicate that the proposed SAFCS achieves competitive results in comparison with several well-known classification algorithms.

MSC:

68T10 Pattern recognition, speech recognition
68T05 Learning and adaptive systems in artificial intelligence

Software:

UCI-ml; LIBSVM
Full Text: DOI

References:

[1] Han, J.; Kamber, M., Data Mining: Concepts and Techniques (2000), Morgan Kaufmann: Morgan Kaufmann Los Altos, CA
[2] Michalski, R. S.; Bratko, I.; Kubat, M., Machine Learning and Data Mining: Methods and Applications (1998), Wiley: Wiley New York
[3] Hu, Y. C.; Chen, R. S.; Tzeng, G. H., Finding fuzzy classification rules using data mining techniques, Pattern Recognition Lett., 24, 509-519 (2003) · Zbl 1053.68084
[4] Shen, Q.; Chouchoulas, A., A rough-fuzzy approach for generating classification rules, Pattern Recognition, 35, 2425-2438 (2002) · Zbl 1006.68902
[5] Lin, C. T.; Yeh, C. M.; Liang, S. F.; Chung, J. F.; Kumar, N., Support-vector-based fuzzy neural network for pattern classification, IEEE Trans. Fuzzy Syst., 14, 1, 31-41 (2006)
[6] Sexton, R. S.; McMurtrey, S.; Cleavenger, D., Knowledge discovery using a neural network simultaneous optimization algorithm on a real world classification problem, Eur. J. Oper. Res., 168, 3, 1009-1018 (2006) · Zbl 1101.90404
[7] Zhang, G. P., Neural networks for classification: a survey, IEEE Trans. Syst. Man Cybern. C—Appl. Rev., 30, 4, 451-461 (2006)
[8] Cordón, O.; Gomide, F.; Herrera, F.; Hoffmann, F.; Magdalena, L., Ten years of genetic fuzzy systems: current framework and new trends, Fuzzy Sets Syst., 141, 1, 5-31 (2004) · Zbl 1050.93513
[9] Ishibuchi, H.; Nojima, Y., Analysis of interpretability-accuracy tradeoff of fuzzy systems by multiobjective fuzzy genetics-based machine learning, Int. J. Approximate Reasoning, 44, 1, 4-31 (2007) · Zbl 1109.68091
[10] Ishibuchi, H.; Yamamoto, T., Rule weight specification in fuzzy rule-based classification systems, IEEE Trans. Fuzzy Syst., 13, 4, 428-435 (2005)
[11] Holte, R., Very simple classification rules perform well in most common used data sets, Mach. Learn., 11, 63-91 (1993) · Zbl 0850.68278
[12] Sugeno, M., An introductory survey of fuzzy control, Inf. Sci., 36, 1/2, 59-83 (1985) · Zbl 0586.93053
[13] Lee, C. C., Fuzzy logic in control systems: fuzzy logic controller, part I and part II, IEEE Trans. Syst. Man Cybern., 20, 2, 404-435 (1990) · Zbl 0707.93036
[14] Wangm, L. X.; Mendel, J. M., Generating fuzzy rules by learning from examples, IEEE Trans. Syst. Man Cybern., 22, 6, 1414-1427 (1992)
[15] Ishibuchi, H.; Nozaki, K.; Tanaka, H., Distributed representation of fuzzy rules and its application to pattern classification, Fuzzy Sets Syst., 52, 1, 21-32 (1992)
[16] Abe, S.; Lan, M. S., A method for fuzzy rules extraction directly from numerical data and its application to pattern classification, IEEE Trans. Fuzzy Syst., 3, 1, 18-28 (1995)
[17] Mitra, S.; Pal, S. K., Self-organizing neural network as a fuzzy classifier, IEEE Trans. Syst. Man Cybern., 24, 3, 385-399 (1994)
[18] Metropolis, N.; Rosenbluth, A. W.; Rosenbluth, M. N.; Teller, A. H.; Teller, E., Equation of state calculation by fast computing machines, J. Chem. Phys., 21, 1087-1092 (1953) · Zbl 1431.65006
[19] Kirkpatrick, S.; Gelatt, C. D.; Vecchi, M. P., Optimization by simulated annealing, Science, 220, 671-680 (1983) · Zbl 1225.90162
[20] Abbass, H. A.; Sarker, R. A.; Newton, C. S., Data Mining: A Heuristic Approach (2002), Idea Group Publishing
[21] Debuse, J. C.; Rayward-Smith, V. J., Feature subset selection within a simulated annealing data mining algorithm, J. Intell. Inf. Syst., 9, 57-81 (1997)
[22] Bandyopadhyay, S.; Pal, S. K.; Murthy, C. A., Simulated annealing based pattern classification, Inf. Sci., 109, 1-4, 165-184 (1998)
[23] Bandyopadhyay, S., Simulated annealing using a reversible jump Markov chain Monte Carlo algorithm for fuzzy clustering, IEEE Trans. Knowl. Data Eng., 17, 4, 479-490 (2005)
[24] X.Y. Wang, J. Garibaldi, Simulated annealing fuzzy clustering in cancer diagnosis, Eur. J. Inf. (2005) 61-70.; X.Y. Wang, J. Garibaldi, Simulated annealing fuzzy clustering in cancer diagnosis, Eur. J. Inf. (2005) 61-70.
[25] UCI machine learning repository: \( \langle;\) http://www.ics.uci.edu/\( \sim;\rangle;\); UCI machine learning repository: \( \langle;\) http://www.ics.uci.edu/\( \sim;\rangle;\)
[26] D.F. Wong, C.L. Liu, A new algorithm for floorplan design, in: Proceedings of the 23rd DAC, 1986, pp. 101-107.; D.F. Wong, C.L. Liu, A new algorithm for floorplan design, in: Proceedings of the 23rd DAC, 1986, pp. 101-107.
[27] Youssef, H.; Sait, S. M.; Adiche, H., Evolutionary algorithms, simulated annealing and tabu search: a comparative study, Eng. Appl. Artif. Intell., 14, 2, 167-181 (2001)
[28] Weiss, S. M.; Kulikowski, C. A., Computer Systems that Learn (1991), Morgan Kaufmann Publishers: Morgan Kaufmann Publishers San Mateo
[29] Bacardit, J., Pittsburgh Genetics-Based Machine Learning in the Data Mining era: Representation, Generalization, and Run-time (2004), Ramon Llull University: Ramon Llull University Barcelona, Catalonia, Spain
[30] Quinlan, J. R., C4:5: Programs for Machine Learning (1993), Morgan Kaufmann: Morgan Kaufmann San Mateo, CA
[31] Aha, D. W.; Kibler, D. F.; Albert, M. K., Instance-based learning algorithms, Mach. Learn., 6, 1, 37-66 (1991)
[32] John, G. H.; Langley, P., Estimating continuous distributions in Bayesian classifiers, (Proceedings of the 11th Conference on Uncertainty in Artificial Intelligence (1995), Morgan Kaufmann Publishers: Morgan Kaufmann Publishers San Mateo), 338-345
[33] C.C. Chang, C.J. Lin, LIBSVM: A Library for Support Vector Machines, Department of Computer Science and Information Engineering, National Taiwan University, 2001.; C.C. Chang, C.J. Lin, LIBSVM: A Library for Support Vector Machines, Department of Computer Science and Information Engineering, National Taiwan University, 2001.
[34] Platt, J. C., Fast training of support vector machines using sequential minimal optimization, (Advances in Kernel Methods Support Vector Learning (1999), MIT Press: MIT Press Cambridge, MA), 185-208
[35] Joachims, T., Learning to Classify Text using Support Vector Machines: Methods, Theory and Algorithms (2002), Kluwer Academic Publishers: Kluwer Academic Publishers Dordrecht
[36] Wilson, S. W., Classifier fitness based on accuracy, Evol. Comput., 3, 2, 149-175 (1995)
[37] Butz, M. V.; Sastry, K.; Goldberg, D. E., Tournament selection in XCS, (Proceedings of the Genetic and Evolutionary Computation Conference—GECCO2003, Lecture Notes in Computer Science, vol. 2724 (2003), Springer: Springer Berlin), 1857-1869 · Zbl 1038.68651
[38] Bacardit, J.; Garrell, J. M., Bloat control and generalization pressure using the minimum description length principle for a pittsburgh approach learning classifier system, (Proceedings of the 6th International Workshop on Learning Classifier Systems, Lecture Notes in Artificial Intelligence (2003), Springer: Springer Berlin)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.