×

A unified approach to constructing correlation coefficients between random variables. (English) Zbl 1444.62063

Measuring the strength of correlation between two variables or two sets of variables is of vital importance in machine learning. This paper proposes the unified index of correlation between two continuous random variables \(X\) and \(Y\) which leads to new measures of correlations and subsumes some of the existing measures such as the Pearson correlation coefficient and Gini correlation coefficient and its extensions widely employed in random forest theory. The proposed measure is validated on three examples.

MSC:

62H05 Characterization and structure theory for multivariate probability distributions; copulas
62H20 Measures of association (correlation, canonical correlation, etc.)
62H30 Classification and discrimination; cluster analysis (statistical aspects)
68T05 Learning and adaptive systems in artificial intelligence

References:

[1] Aly, EEA; Bleuer, S., Confidence bands for quantile-quantile plots, Stat Risk Model, 4, 2-3, 205-226 (1986) · Zbl 0591.62040
[2] Arnold, BC; Balakrishnan, N.; Nagaraja, HN, Records (1998), New York: Wiley, New York
[3] Asadi, M., A new measure of association between random variables, Metrika, 80, 6-8, 649-661 (2017) · Zbl 1382.62032
[4] Asadi, M.; Zohrevand, Y., On the dynamic cumulative residual entropy, J Stat Plan Inference, 137, 6, 1931-1941 (2007) · Zbl 1118.62006
[5] Asadi, M.; Ebrahimi, N.; Soofi, ES, Connections of Gini, Fisher, and Shannon by Bayes risk under proportional hazards, J Appl Probab, 54, 4, 1027-1050 (2017) · Zbl 1401.62193
[6] Balanda, KP; MacGillivray, HL, Kurtosis and spread, Can J Stat, 18, 1, 17-30 (1990) · Zbl 0692.62010
[7] Barlow, RE; Proschan, F., Statistical theory of reliability and life testing: probability models. To begin with (1981), Berlin: Springer, Berlin
[8] Barnett, V., Some bivariate uniform distributions, Commun Stat Theory Methods, 9, 453-461 (1980) · Zbl 0449.62011
[9] Cambanis, S.; Salinetti, G.; Kotz, S., On Eyraud-Farlie-Gumbel-Morgenstern random processes, Advances in probability distributions with given marginals, 207-222 (1991), Netherlands: Springer, Netherlands · Zbl 0744.60037
[10] Cuadras, CM, On the covariance between functions, J Multivar Anal, 81, 19-27 (2002) · Zbl 1011.62062
[11] Diaz, W.; Cuadras, CM, On a multivariate generalization of the covariance, Commun Stat Theory Methods, 46, 9, 4660-4669 (2017) · Zbl 1368.62155
[12] Doksum, KA, Measures of location and asymmetry, Scand J Stat, 12, 11-22 (1975) · Zbl 0311.62002
[13] Doksum, KA; Fenstad, G.; Aaberge, R., Plots and tests for symmetry, Biometrika, 64, 3, 473-487 (1977) · Zbl 0381.62034
[14] Furman, E.; Zitikis, R., Beyond the Pearson correlation: heavytailed risks, weighted Gini correlations, and a Gini-type weighted insurance pricing model, ASTIN Bull J IAA, 47, 3, 919-942 (2017) · Zbl 1390.91183
[15] Gilchrist, W., Statistical modelling with quantile functions (2000), Boca Raton: CRC Press Inc, Boca Raton
[16] Groeneveld, RA, A class of quantile measures for kurtosis, Am Stat, 52, 4, 325-329 (1998)
[17] Grothe, O.; Schnieders, J.; Segers, J., Measuring association and dependence between random vectors, J Multivar Anal, 123, 96-110 (2014) · Zbl 1278.62090
[18] Gurrera MDC (2005) Construction of bivariate distributions and statistical dependence operations. Ph.D. dissertation, University of Barcelona, Spain
[19] Hutchinson, TP; Lai, CD, Continuous bivariate distributions, emphasising applications (1990), Adelaide: Rumsby Scientific Publishing, Adelaide · Zbl 1170.62330
[20] Johnson, NL; Kotz, S., On some generalized Farlie-Gumbel-Morgenstern distributions-II regression, correlation and further generalizations, Commun Stat Theory Methods, 6, 6, 485-496 (1977) · Zbl 0382.62040
[21] Nelsen, RB, Concordance and Gini’s measure of association, Nonparametric Stat, 9, 227-238 (1998) · Zbl 0919.62057
[22] Nolde, N., Geometric interpretation of the residual dependence coefficient, J Multivar Anal, 123, 85-95 (2014) · Zbl 1360.60100
[23] Psarrakos, G.; Navarro, J., Generalized cumulative residual entropy and record values, Metrika, 27, 623-640 (2013) · Zbl 1307.62011
[24] Rao, M.; Chen, Y.; Vemuri, BC; Wang, F., Cumulative residual entropy: a new measure of information, IEEE Trans Inf Theory, 50, 6, 1220-1228 (2004) · Zbl 1302.94025
[25] Schechtman, E.; Yitzhaki, S., On the proper bounds of the Gini correlation, Econ lett, 63, 2, 133-138 (1999) · Zbl 0924.90043
[26] Schezhtman, E.; Yitzhaki, S., A measure of association based on Gini’s mean difference, Commun Stat Theory Methods, 16, 1, 207-231 (1987) · Zbl 0617.62061
[27] Shaked, M.; Shanthikumar, JG, Stochastic orders (2007), Berlin: Springer, Berlin · Zbl 1111.62016
[28] Shaw WT, Buckley IR (2009) The alchemy of probability distributions: beyond Gram-Charlier expansions, and a skew-kurtotic-normal distribution from a rank transmutation map. arXiv preprint arXiv:0901.0434
[29] van Zwet, WR, Convex T ransformations of random variables (1964), Amsterdam: Mathematisch Centrum, Amsterdam · Zbl 0125.37102
[30] Wang, Y.; Hossain, AM; Zimmer, WJ, Monotone log-odds rate reliability analysis, Commun Stat Theory Methods, 12, 2227-2244 (2003) · Zbl 1131.62324
[31] Yeo, IK; Johnson, RA, A new family of power transformations to improve normality or symmetry, Biometrika, 87, 4, 954-959 (2000) · Zbl 1028.62010
[32] Yin, X., Canonical correlation analysis based on information theory, J Multivar Anal, 91, 2, 161-176 (2004) · Zbl 1058.62049
[33] Yitzhaki, S., Gini’s mean difference: a superior measure of variability for non-normal distributions, Metron, 61, 2, 285-316 (2003) · Zbl 1416.60031
[34] Yitzhaki, S.; Olkin, I., Concentration indices and concentration curves. Lecture notes-monograph series, 380-392 (1991), Stanford: Stanford University, Stanford · Zbl 0755.90016
[35] Yitzhaki, S.; Schechtman, E., The Gini methodology: a primer on a statistical methodology (2013), New York: Springer, New York · Zbl 1292.62013
[36] Yitzhaki, S.; Wodon, Q.; Amiel, Y., Mobility, inequality, and horizontal equity, Studies on economic well-being: essays in the honor of John P. Formby, 179-199 (2004), Bingley: Emerald Group Publishing Limited, Bingley
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.