×

Machine learning models, epistemic set-valued data and generalized loss functions: an encompassing approach. (English) Zbl 1427.68263

Summary: We study those problems where the goal is to find “optimal” models with respect to some specific criterion, in regression and supervised classification problems. Alternatives to the usual expected loss minimization criterion are proposed, and a general framework where this criterion can be seen as a particular instance of a general family of criteria is provided.
In the new setting, each model is formally identified with a random variable that associates a loss value to each individual in the population. Based on this identification, different stochastic orderings between random variables lead to different criteria to compare pairs of models. Our general setting encompasses the classical criterion based on the minimization of the expected loss, but also other criteria where a numerical loss function is not available, and therefore the computation of its expectation does not make sense.
The presentation of the new framework is divided into two stages. First, we consider the new framework under standard situations about the sample information, where both the collection of attributes and the response variables are observed with precision. Then, we assume that just incomplete information about them (expressed in terms of set-valued data sets) is provided. We cast some comparison criteria from the recent literature on learning methods from low-quality data as particular instances of our general approach.

MSC:

68T05 Learning and adaptive systems in artificial intelligence
62H30 Classification and discrimination; cluster analysis (statistical aspects)
Full Text: DOI

References:

[1] Bonissone, P. P.; Subbu, R.; Lizzi, J., Multicriteria decision making (mcdm): a framework for research and applications, IEEE Comput. Intell. Mag., 4, 3, 48-61 (2009)
[2] Couso, I., Preference relations and families of probabilities: different sides of the same coin, Proceedings of the International Conference on Information Processing and Management of Uncertainty in Communications in Computer and Information Science. Proceedings of the International Conference on Information Processing and Management of Uncertainty in Communications in Computer and Information Science, IPMU 2014, 442, 1-9 (2014), Springer · Zbl 1422.91189
[3] Couso, I.; Dubois, D., An imprecise probability approach to joint extensions of stochastic and interval orderings, Advances in Computational Intelligence, IPMU 2012, 299, 388-399 (2012), Springer · Zbl 1252.60006
[4] Couso, I.; Dubois, D., Statistical reasoning with set-valued information: Ontic vs. epistemic views, Int. J. Approx. Reason., 55, 1502-1518 (2014) · Zbl 1407.62032
[5] Couso, I.; Dubois, D.; Sánchez, L., Random Sets and Random Fuzzy Sets as Ill-Perceived Random Variables (2014), Springer · Zbl 1355.60005
[6] Couso, I.; Moral, S.; Sánchez, L., The behavioral meaning of the median, Inf. Sci., 294, 127-138 (2015) · Zbl 1414.91103
[7] David, H., The method of paired comparisons, Griffin’s Statistical Monographs and Courses, 12 (1963), Charles Griffin & D. Ltd., London
[8] Denœux, T., Maximum likelihood estimation from fuzzy data using the EM algorithm, Fuzzy Sets Syst., 183, 72-91 (2011) · Zbl 1239.62017
[9] Denœux, T., Maximum likelihood estimation from uncertain data in the belief function framework, IEEE Trans. Knowl. Data Eng., 25, 119-130 (2013)
[10] Destercke, S.; Couso, I., Ranking of fuzzy intervals seen through the imprecise probabilistic lens, Fuzzy Sets Syst., 278, 20-39 (2015) · Zbl 1375.91051
[11] Dubois, D., Ontic vs. epistemic fuzzy sets in modeling and data processing tasks, (Madani, K.; Kacprzyk, J.; Filipe, J., Proceedings of the International Joint Conference on Computational Intelligence and International Conference on Neural Computation Theory and Applications IJCCI (NCTA) (2011), Paris)
[12] Dubois, D.; Prade, H., Gradualness, uncertainty and bipolarity: Making sense of fuzzy sets, Fuzzy Sets Syst., 192, 3-24 (2012) · Zbl 1238.03044
[13] S. Ferson, V. Kreinovich, J. Hajagos, W. Oberkampf, L. Ginzburg, Experimental Uncertainty Estimation and Statistics for Data Having Interval Uncertainty, Report SAND2007-0939, Sandia National Laboratories, 2007.; S. Ferson, V. Kreinovich, J. Hajagos, W. Oberkampf, L. Ginzburg, Experimental Uncertainty Estimation and Statistics for Data Having Interval Uncertainty, Report SAND2007-0939, Sandia National Laboratories, 2007.
[14] Fishburn, P. C., Interval Orderings and Interval Graphs (1985), Wiley: Wiley New-York · Zbl 0568.05047
[15] Hadar, J.; Russell, W., Rules for ordering uncertain prospects, Am. Econ. Rev., 59, 25-34 (1969)
[16] Haykin, S., Neural Networks: A Comprehensive Foundation (1998), Prentice Hall · Zbl 0828.68103
[17] Hüllermeier, E., Learning from imprecise and fuzzy observations: Data disambiguation through generalized loss minimization, Int. J. Approx. Reason., 55, 1519-1534 (2014) · Zbl 1407.68396
[18] Hurwicz, L., A Class of Criteria for Decision-Making Under Ignorance. A Class of Criteria for Decision-Making Under Ignorance, Cowles Commission Paper, 356 (1951), Cowles Commission
[19] Palacios, A. M.; Sánchez, L.; Couso, I., Diagnosis of dyslexia with low quality data with genetic fuzzy systems, Int. J. Approx. Reason., 51, 993-1009 (2010)
[20] Palacios, A. M.; Sánchez, L.; Couso, I., Linguistic cost-sensitive learning of genetic fuzzy classifiers for imprecise data, Int. J. Approx. Reason., 52, 841-862 (2011)
[21] Palacios, A. M.; Sánchez, L.; Couso, I., Future performance modeling in athletism with low quality data-based genetic fuzzy systems, J. Mult. Valued Logic Soft Comput., 17, 207-228 (2011)
[22] Palacios, A. M.; Sánchez, L.; Couso, I., Boosting of fuzzy rules with low quality data, J. Mult. Valued Logic Soft Comput., 19, 591-619 (2012)
[23] Raja, R.; Raja, U. K.; Samidurai, R.; Leelamani, A., Improved stochastic dissipativity of uncertain discrete-time neural networks with multiple delays and impulses, Int. J. Mach. Learn. Cybern., 6, 289-305 (2015)
[24] Savage, L. J., The Foundations of Statistics (1954), Wiley: Wiley New York · Zbl 0055.12604
[25] Sánchez, L., Interval-valued GA-p algorithms, IEEE Trans. Evol. Comput., 4, 64-72 (2000)
[26] Sánchez, L.; Couso, I., Learning from imprecise examples with GA-p algorithms, Mathw. Soft Comput., 5, 305-319 (1998) · Zbl 0957.68100
[27] Sánchez, L.; Couso, I., Advocating the use of imprecisely observed data in genetic fuzzy systems, IEEE Trans. Fuzzy Syst., 15, 4, 551-562 (2007)
[28] Sánchez, L.; Couso, I., Fuzzy random variables-based modeling with GA-p algorithms, Information, Uncertainty and Fusion, 245-256 (2000), Kluwer (Dordrecht)
[29] Sánchez, L.; Couso, I., Exploiting low quality information for fuzzy modelling: New challenges, Proceedings of the IEEE Transactions on Fuzzy Systems, Special Issue on Fuzzy Systems in Data Science (2016)
[30] Sánchez, L.; Couso, I.; Casillas, J., Genetic learning of fuzzy rules based on low quality data, Fuzzy Sets Syst., 160, 2524-2552 (2009) · Zbl 1191.68526
[31] Sánchez, L.; Otero, J.; Couso, I., Obtaining linguistic fuzzy rule-based regression models from imprecise data with multiobjective genetic algorithms, Soft Comput., 13, 467-479 (2009) · Zbl 1181.68215
[32] Satia, L. E.; Roy, J., Markovian decision processes with uncertain transition probabilities, Oper. Res., 21, 728-740 (1973) · Zbl 0286.60038
[33] Serrurier, M.; Prade, H., A general framework for imprecise regression, Proceedings of the IEEE International Conference on Fuzzy Systems, Fuzz-IEEE, 1597-1602 (2007), London
[34] Sniedovich, M., Wald’s maximin model: a treasure in disguise!, J. Risk Financ., 9, 287-291 (2008)
[35] Teich, J., Pareto-front exploration with uncertain objectives, (Zitzler, E.; etal., Evolutionary Multi-Criterion Optimization. Evolutionary Multi-Criterion Optimization, LNCS, 1993 (2001), Springer-Verlag: Springer-Verlag Berlin), 314-328
[36] Utkin, L.; Coolen, F. P.A., Interval-valued regression and classification models in the framework of machine learning, Proceedings of the Seventh International Symposium on Imprecise Probability: Theories and Applications (2011), Innsbruck, Austria
[37] Wald, A., Statistical decision functions which minimize the maximum risk, Ann. Math., 46, 265-280 (1945) · Zbl 0063.08126
[38] Walley, P., Statistical Reasoning with Imprecise Probabilities (1991), Chapman and Hall · Zbl 0732.62004
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.