×

Two-sample Kolmogorov-Smirnov fuzzy test for fuzzy random variables. (English) Zbl 1307.62137

Summary: In this paper, a new method is proposed for developing two-sample Kolmogorov-Smirnov test for the case when the data are observations of fuzzy random variables, and the hypotheses are imprecise rather than crisp. In this approach, first a new notion of fuzzy random variables is introduced. Then, the \(\alpha \)-pessimistic values of the imprecise observations are transacted to extend the usual method of two-sample Kolmogorov-Smirnov test. To do this, the concepts of fuzzy cumulative distribution function and fuzzy empirical cumulative distribution function are defined. We also develop a well-known large sample property of the classical empirical cumulative distribution function for fuzzy empirical cumulative distribution function. In addition, the Kolmogorov-Smirnov two-sample test statistic is extended for fuzzy random variables. After that, the method of computing the so-called fuzzy \(p\) value is introduced to evaluate the imprecise hypotheses of interest. In this regard, applying an index called credibility degree, the obtained fuzzy \(p\) value and the crisp significance level are compared. The result provides a fuzzy test function which leads to some degrees to accept or to reject the null hypothesis. Some numerical examples are provided throughout the paper clarifying the discussions made in this paper.

MSC:

62G86 Nonparametric inference and fuzziness
62G10 Nonparametric hypothesis testing
Full Text: DOI

References:

[1] Adriaenssens V, De Baets B, Goethals PLM, De Pauw N (2004) Fuzzy rule-based models for decision support in ecosystem management. Sci Total Environ 319:1-12 · doi:10.1016/S0048-9697(03)00433-9
[2] Bandemer H, Näther W (1992) Fuzzy data analysis. Kluwer Academic Publisher, Dordrecht · Zbl 0758.62003 · doi:10.1007/978-94-011-2506-2
[3] Bertoluzza C, Gil MÁ, Ralescu DA (2002) Statistical modeling. Analysis and management of fuzzy data. Academic Press, New York · Zbl 0980.00008 · doi:10.1007/978-3-7908-1800-0
[4] Bethea RM, Rhinehart RR (1991) Applied engineering statistics. Marcel Dekker, New York · Zbl 0797.62085
[5] Buckley JJ (2006) Fuzzy statistics. Studies in fuzziness and soft computing. Springer, Berlin · Zbl 1095.62002
[6] Chachi J, Taheri SM (2011) Fuzzy confidence intervals for mean of Gaussian fuzzy random variables. Expert Syst Appl 38:5240-5244 · doi:10.1016/j.eswa.2010.10.034
[7] Chachi J, Taheri SM, Viertl R (2012) Testing statistical hypotheses based on fuzzy confidence intervals. Austrian J Stat 41:267-286
[8] Colubi A (2009) Statistical inference about the means of fuzzy random variables: applications to the analysis of fuzzy- and real-valued data. Fuzzy Sets Syst 160:344-356 · Zbl 1175.62021 · doi:10.1016/j.fss.2007.12.019
[9] Colubi A, Coppi R, D’Urso P, Gil MÁ (2007) Statistics with fuzzy random variables. METRON Int J Stat LXV:277-303 · Zbl 1147.62003
[10] Colubi A, Domínguez-Menchero JS, López-Díaz M, Ralescu DA (2001) On the formalization of fuzzy random variables. Inform Sci 133:3-6 · Zbl 0988.28008 · doi:10.1016/S0020-0255(01)00073-1
[11] Colubi A, González-Rodríguez G (2007) Triangular fuzzification of random variables and power of distribution tests: empirical discussion. Comput Stat Data Anal 51:4742-4750 · Zbl 1162.62342 · doi:10.1016/j.csda.2006.11.014
[12] Colubi, A.; González-Rodríguez, G.; Lubiano, MA; Montenegro, M.; Lawry, J. (ed.); Miranda, E. (ed.); Bugarin, A. (ed.); Li, S. (ed.); Gil, MA (ed.); Grzegorzewski, P. (ed.); Hryniewicz, O. (ed.), Exploratory analysis of random variables based on fuzzification, 95-102 (2006), Berlin · Zbl 1103.62018
[13] Coppi R, D’Urso P, Giordani P, Santoro A (2006a) Least squares estimation of a linear regression model with \[LR\] LR fuzzy response. Comput Stat Data Anal 51:267-286 · Zbl 1157.62460 · doi:10.1016/j.csda.2006.04.036
[14] Coppi R, Gil MÁ, Kiers HAL (2006b) The fuzzy approach to statistical analysis. Comput Stat Data Anal 51:1-14 · Zbl 1157.62308 · doi:10.1016/j.csda.2006.05.012
[15] Denœux T, Masson MH, Herbert PH (2005) Non-parametric rank-based statistics and significance tests for fuzzy data. Fuzzy Sets Syst 153:1-28 · Zbl 1062.62075 · doi:10.1016/j.fss.2005.01.008
[16] Dubois D, Prade H (1983) Ranking of fuzzy numbers in the setting of possibility theory. Inform Sci 30:183-224 · Zbl 0569.94031 · doi:10.1016/0020-0255(83)90025-7
[17] Feng Y (2000) Gaussian fuzzy random variables. Fuzzy Sets Syst 111:325-330 · Zbl 0966.60017 · doi:10.1016/S0165-0114(98)00033-5
[18] Ferraro MB, Colubi A, González-Rodríguez G, Coppi R (2011) A determination coefficient for a linear regression model with imprecise response. Environmetrics 22:516-529 · doi:10.1002/env.1056
[19] Gebhardt J, Gil MÁ, Kruse R (1998) Concepts of fuzzy-valued statistics. In: R Slowinski, (ed) Fuzzy sets in decision analysis, operations research and statistics. Kluwer, NewYork, pp 311-347 · Zbl 0946.62006
[20] Gibbons JD, Chakraborti S (2003) Non-parametric statistical inference, 4th edn. Marcel Dekker, New York · Zbl 1115.62004
[21] Gil MÁ, López-Díaz M, Ralescu DA (2006) Overview on the development of fuzzy random variables. Fuzzy Sets Syst 157:2546-2557 · Zbl 1108.60006 · doi:10.1016/j.fss.2006.05.002
[22] González-Rodríguez G, Colubi A, Gil MÁ (2006a) A fuzzy representation of random variables: an operational tool in exploratory analysis and hypothesis testing. Comput Stat Data Anal 51:163-176 · Zbl 1157.62303 · doi:10.1016/j.csda.2006.04.006
[23] González-Rodríguez G, Montenegro M, Colubi A, Gil MÁ (2006b) Bootstrap techniques and fuzzy random variables: synergy in hypothesis testing with fuzzy data. Fuzzy Sets Syst 157:2608-2613 · Zbl 1119.62037 · doi:10.1016/j.fss.2003.11.021
[24] Govindarajulu Z (2003) Non-parametric inference. World Scientific, Hackensack
[25] Grzegorzewski P (1998) Statistical inference about the median from vague data. Control Cybern 27:447-464 · Zbl 0945.62038
[26] Grzegorzewski P (2000) Testing statistical hypotheses with vague data. Fuzzy Sets Syst 112:501-510 · Zbl 0948.62010 · doi:10.1016/S0165-0114(98)00061-X
[27] Grzegorzewski P (2004) Distribution-free tests for vague data. In: Lopez-Diaz M et al (eds) Soft methodology and random information systems. Springer, Heidelberg, pp 495-502 · Zbl 1064.62052
[28] Grzegorzewski P (2005) Two-sample median test for vague data. In: Proceedings of the 4th Conference European Society for Fuzzy Logic and Technology-Eusflat, Barcelona, pp 621-626 · Zbl 0948.62010
[29] Grzegorzewski P (2008) A bi-robust test for vague data. In: Magdalena L, Ojeda-Aciego M, Verdegay JL (eds) Proceedings of the 12th international conference on information processing and management of uncertainty in knowledge-based systems, IPMU2008. Spain, Torremolinos (Malaga), June 22-27, pp 138-144 · Zbl 0966.60017
[30] Grzegorzewski P (2009) K-sample median test for vague data. Int J Intell Syst 24:529-539 · Zbl 1160.62039 · doi:10.1002/int.20345
[31] Hesamian G, Taheri SM (2013) Linear rank tests for two-sample fuzzy data: a p-value approach. Journal of Uncertainty Systems 7:129-137
[32] Hryniewicz O (2006a) Possibilistic decisions and fuzzy statistical tests. Fuzzy Sets Syst 157:2665-2673 · Zbl 1099.62008 · doi:10.1016/j.fss.2003.08.009
[33] Hryniewicz O (2006b) Goodman-Kruskal \[\gamma\] γ measure of dependence for fuzzy ordered categorical data. Comput Stat Data Anal 51:323-334 · Zbl 1157.62424 · doi:10.1016/j.csda.2006.04.014
[34] Kahraman C, Bozdag CF, Ruan D (2004) Fuzzy sets approaches to statistical parametric and non-parametric tests. Int J Intell Syst 19:1069-1078 · Zbl 1074.62029 · doi:10.1002/int.20037
[35] Krätschmer V (2001) A unified approach to fuzzy random variables. Fuzzy Sets Syst 123:1-9 · Zbl 1004.60003 · doi:10.1016/S0165-0114(00)00038-5
[36] Kruse R, Meyer KD (1987) Statistics with vague data. Reidel Publishing Company, Dordrecht · Zbl 0663.62010 · doi:10.1007/978-94-009-3943-1
[37] Kvam PH, Vidakovic B (2007) Non-parametric statistics with application to science and engineering. Wiley, New York · Zbl 1183.62207 · doi:10.1002/9780470168707
[38] Kwakernaak H (1978) Fuzzy random variables, part I: definitions and theorems. Inform Sci 19:1-15 · Zbl 0438.60004 · doi:10.1016/0020-0255(78)90019-1
[39] Kwakernaak H (1979) Fuzzy random variables, part II: Algorithms and examples for the discrete case. Inform Sci 17:253-278 · Zbl 0438.60005 · doi:10.1016/0020-0255(79)90020-3
[40] Lee KH (2005) First course on fuzzy theory and applications. Springer, Berlin · Zbl 1063.94129
[41] Liu B (2013) Uncertainty theory, 4th edn. Springer, Berlin
[42] Lin P, Wu B, Watada J (2010) Kolmogorov-Smirnov two sample test with continuous fuzzy data. Advances in intelligent and soft computing 68:175-186 · Zbl 1201.62057 · doi:10.1007/978-3-642-11960-6_17
[43] Peng J, Liu B (2004) Some properties of optimistic and pessimistic values of fuzzy. IEEE Int Conf Fuzzy Syst 2:745-750
[44] Puri ML, Ralescu DA (1985) The concept of normality for fuzzy random variables. Annals Probab 13:1373-1379 · Zbl 0583.60011 · doi:10.1214/aop/1176992822
[45] Puri ML, Ralescu DA (1986) Fuzzy random variables. J Math Anal Appl 114:409-422 · Zbl 0592.60004 · doi:10.1016/0022-247X(86)90093-4
[46] Salski A (2007) Fuzzy clustering of fuzzy ecological data. Ecol Inform 2:262-269 · doi:10.1016/j.ecoinf.2007.07.002
[47] Shapiro AF (2009) Fuzzy random variables. Insur Math Econ 44:307-314 · Zbl 1166.91018 · doi:10.1016/j.insmatheco.2008.05.008
[48] Simeonov V, Puxbaum H, Tsakowski S, Sarbu C, Kalina M (1999) Classification and receptor modeling of wet precipitation data from central Austria (1984-1993). Environmetrics 10:137-152 · doi:10.1002/(SICI)1099-095X(199903/04)10:2<137::AID-ENV343>3.0.CO;2-I
[49] Taheri SM, Hesamian G (2011) Goodman-Kruskal measure of association for fuzzy-categorized variables. Kybernetika 47:110-122 · Zbl 1213.93199
[50] Taheri SM, Hesamian G (2013) A generalization of the Wilcoxon signed-rank test and its applications. Sta Pap 54:457-470 · Zbl 1364.62114 · doi:10.1007/s00362-012-0443-4
[51] Tscherko D, Kandeler E, Bárdossy A (2007) Fuzzy classification of microbial biomass and enzyme activities in grassland soils. Soil Biol Biochem 39:1799-1808 · doi:10.1016/j.soilbio.2007.02.010
[52] Van Broekhoven E, Adriaenssens V, De Baets B, Verdonschot PFM (2006) Fuzzy rule-based macroinvertebrate habitat suitability models for running waters. Ecol Model 198:71-84 · doi:10.1016/j.ecolmodel.2006.04.006
[53] Van Broekhoven E, Adriaenssens V, De Baets B (2007) Interpretability-preserving genetic optimization of linguistic terms in fuzzy models for fuzzy ordered classification: an ecological case study. Int J Approx Reason 44:65-90 · Zbl 1107.68450 · doi:10.1016/j.ijar.2006.03.003
[54] Viertl R (1990) Statistical inference for fuzzy data in environmetrics. Environmetrics 1:37-42 · doi:10.1002/env.3170010105
[55] Viertl R (1997) On statistical inference for non-precise data. Environmetrics 8:541-568 · doi:10.1002/(SICI)1099-095X(199709/10)8:5<541::AID-ENV269>3.0.CO;2-U
[56] Viertl R (2011) Statistical methods for fuzzy data. Wiley, Chichester · Zbl 1278.62014 · doi:10.1002/9780470974414
[57] Wu HC (2005) Statistical hypotheses testing for fuzzy data. Fuzzy Sets Syst 175:30-56 · Zbl 1081.62012
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.