×

Measuring association between nominal categorical variables: an alternative to the Goodman-Kruskal lambda. (English) Zbl 1516.62401

Summary: As a measure of association between two nominal categorical variables, the lambda coefficient or Goodman-Kruskal’s lambda has become a most popular measure. Its popularity is primarily due to its simple and meaningful definition and interpretation in terms of the proportional reduction in error when predicting a random observation’s category for one variable given (versus not knowing) its category for the other variable. It is an asymmetric measure, although a symmetric version is available. The lambda coefficient does, however, have a widely recognized limitation: it can equal zero even when there is no independence between the variables and when all other measures take on positive values. In order to mitigate this problem, an alternative lambda coefficient is introduced in this paper as a slight modification of the Goodman-Kruskal lambda. The properties of the new measure are discussed and a symmetric form is introduced. A statistical inference procedure is developed and a numerical example is provided.

MSC:

62-XX Statistics

Software:

bootlib
Full Text: DOI

References:

[1] Agresti, A., Analysis of Ordinal Data (2010), Wiley: Wiley, Hoboken, NJ · Zbl 1263.62007
[2] Agresti, A., Categorical Data Analysis (2013), Wiley: Wiley, Hoboken, NJ · Zbl 1281.62022
[3] Bishop, Y. M.M.; Fienberg, S. E.; Holland, P. W., Discrete Multivariate Analysis: Theory and Practice (1975), The MIT Press: The MIT Press, Cambridge, MA · Zbl 0332.62039
[4] Blalock, H. M. Jr., Social Statistics (1972), McGraw-Hill: McGraw-Hill, New York
[5] Cramér, H., Mathematical Methods of Statistics (1946), Princeton University Press: Princeton University Press, Princeton, NJ · Zbl 0063.01014
[6] Davison, A. C.; Hinkley, D. V., Bootstrap Methods and Their Application (1997), Cambridge University Press: Cambridge University Press, Cambridge · Zbl 0886.62001
[7] Everitt, B. S., The Analysis of Contingency Tables (1977), Chapman and Hall: Chapman and Hall, London · Zbl 0777.62060
[8] Freeman, D. H. Jr., Applied Categorical Data Analysis (1987), Marcel Dekker: Marcel Dekker, New York · Zbl 0631.62001
[9] Garson, G. D., Measures of Association (2012), Statistical Publishing Associates: Statistical Publishing Associates, Asheboro, NC
[10] Goodman, L. A.; Kruskal, W. H., Measures of association for cross classifications, J. Am. Stat. Assoc., 49, 732-764 (1954) · Zbl 0056.12801
[11] Goodman, L. A.; Kruskal, W. H., Measures of Association for Cross Classifications (1979), Springer: Springer, New York · Zbl 0426.62034
[12] Guttman, L.; Horst, P., An outline of the statistical theory of prediction, Prediction of Personal Adjustment, Bulletin 48, 253-313 (1941), The Social Science Research Council: The Social Science Research Council, New York
[13] Haberman, S. J.; Kotz, S.; Johnson, N. L., Measures of association, Encyclopedia of Statistical Sciences, 1, 130-137 (1982), Wiley: Wiley, New York · Zbl 0552.62001
[14] Hardy, G. H.; Littlewood, J. E.; Pólya, G., Inequalities (1934), Cambridge University Press: Cambridge University Press, London · JFM 60.0169.01
[15] Kendall, M. G.; Stuart, A., The Advanced Theory of Statistics, Vol. 2: Inference and Relationships (1979), Charles Griffin: Charles Griffin, London · Zbl 0416.62001
[16] Kvålseth, T.O., An alternative measure of ordinal association as a value-validity correction of the Goodman-Kruskal gamma, Commun. Stat. Theory M. 45 (2016), Advance online publication. doi: . · Zbl 1453.62529
[17] Liebetrau, A. M., Measures of Association (1983), Sage: Sage, Beverly Hills, CA
[18] Marshall, A. W.; Olkin, I.; Arnold, B. C., Inequalities: Theory of Majorization and its Applications (2011), Springer: Springer, New York · Zbl 1219.26003
[19] Parr, W. C.; Tolley, H. D., Jackknifing in categorical data analysis, Aust. J. Stat., 24, 67-79 (1982) · Zbl 0486.62033 · doi:10.1111/j.1467-842X.1982.tb00808.x
[20] Reynolds, H. T., The Analysis of Cross-Classifications (1977), The Free Press: The Free Press, New York
[21] Stuart, A.; Ord, J. K., Kendell’s Advanced Theory of Statistics, Vol. 1: Distribution Theory (1994), Edward Arnold: Edward Arnold, London · Zbl 0880.62012
[22] Tang, W.; He, H.; Tu, X. M., Applied Categorical and Count Data Analysis (2012), CRC Press: CRC Press, Boca Raton, FL · Zbl 1279.62018
[23] Upton, G.; Armitage, P.; Colton, T., Goodman-Kruskal measures of association, Encyclopedia of Biostatistics, 1721-1723 (2005), Wiley: Wiley, Chichester
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.