×

Ordinal and percentile clustering. (English) Zbl 0695.62144

The purpose of the paper is to put together the basic ideas of the theory of ordinal clustering, as developed by the first author et al. (1978), and the theory of probabilistic metric spaces, as developed by the second author et al. (1983). The principal result is a new theory of clustering, called percentile clustering, in which clustering is based, not on some average or other typical value of the data, but directly on the distributed data itself.
In the case of the Jardine-Sibson model [N. Jardine and R. Sibson, Mathematical Taxonomy, Wiley, N.Y. (1971)] this leads from dissimilarity coefficients (DC) to percentile dissimilarity coefficients (PDC) - and from a totally ordered set to a lattice. Each PDC determines a family of ordinary DC’s - one for each percentile c in [0,1]. A secondary outgrowth is a generalized theory of ordinal clustering. A number of algorithms that implement percentile clustering is presented.
The paper concludes by applying the new cluster methods to two concrete examples. The first of these is a data set concerning combat death in the Vietnam War; the second is a data set dealing with the classification of species of gibbons. In both instances results obtained with various standard clustering techniques are also presented.
Reviewer: V.Yu.Urbakh

MSC:

62H30 Classification and discrimination; cluster analysis (statistical aspects)
Full Text: DOI

References:

[1] Barthelemy, J. P.; Leclerc, B.; Monjardet, B., On the use of ordered sets in problems of comparison and consensus of classifications, J. of Classification, 3, 187-224 (1986) · Zbl 0647.62056
[2] Baulieu, F., Order theoretic classification of cluster methods, (Doctoral Dissertation (1981), Univ. of Massachusetts: Univ. of Massachusetts Amherst) · Zbl 0742.92025
[3] Baulieu, F., Order theoretic classification of monotone equivariant cluster methods, Algebra Universalis, 20, 351-367 (1985) · Zbl 0576.06005
[4] F. Baulieu, Classification of normalized cluster methods in an order theoretic model, Discrete Appl. Math., to appear.; F. Baulieu, Classification of normalized cluster methods in an order theoretic model, Discrete Appl. Math., to appear. · Zbl 0742.92025
[5] Bezdek, J. C., Pattern Recognition with Fuzzy Objective Function Algorithms (1981), Plenum: Plenum New York · Zbl 0503.68069
[6] Bezdek, J. C.; Harris, J. D., Fuzzy partitions and relations: an axiomatic basis for clustering, Fuzzy Sets and Systems, 1, 111-127 (1978) · Zbl 0442.68093
[7] Birkhoff, G., Lattice Theory (1967), Amer. Math. Soc: Amer. Math. Soc Providence · Zbl 0126.03801
[8] Blyth, T. S.; Janowitz, M. F., Residuation Theory (1972), Pergamon: Pergamon London · Zbl 0301.06001
[9] Creel, N.; Preuschoft, H., Systematics of the lesser apes: a quantitative taxonomic analysis of craniometric and other variables, (Preuschoft, H.; Chivers, D. J.; Breckelman, W. Y.; Creel, N., The Lesser Apes: Evolutionary and Behavioural Biology (1984), Edinburgh Univ. Press), 562-613
[10] Estabrook, G. F., A mathematical model for biological classification, J. Theoretical Biology, 12, 297-310 (1966)
[11] Frank, M. J.; Schweizer, B., On the duality of generalized infimal and supremal convolutions, Rend. Mat., 12, 6, 1-23 (1979) · Zbl 0467.39004
[12] Gordon, A. D., Classification (1981), Chapman and Hall: Chapman and Hall London · Zbl 0507.62057
[13] Grätzer, G., General Lattice Theory (1978), Birkhäuser: Birkhäuser Basel · Zbl 0385.06015
[14] Hartigan, J. A., Clustering Algorithms (1975), Wiley: Wiley New York · Zbl 0321.62069
[15] Herden, G., Some aspects of clustering functions, SIAM J. on Algebraic and Discrete Methods, 5, 101-116 (1984) · Zbl 0534.62043
[16] Hubert, L., A set theoretical approach to the problem of hierarchical clustering, J. Math. Psych., 15, 70-88 (1977) · Zbl 0358.62042
[17] Janowitz, M. F., An order theoretic model for cluster analysis, SIAM J. Appl. Math., 34, 55-72 (1978) · Zbl 0379.62050
[18] Janowitz, M. F., Semi-flat L-cluster methods, Discrete Math., 21, 47-60 (1978) · Zbl 0389.92001
[19] Janowitz, M. F., Monotone equivariant cluster methods, SIAM J. Appl. Math., 37, 78-88 (1979) · Zbl 0419.62056
[20] Janowitz, M. F., A note on Jardine and Sibson’s \(C_μ\)-methods, Math. Biosciences, 46, 205-208 (1979) · Zbl 0423.62044
[21] Janowitz, M. F., Preservation of global order equivalence, J. Math. Psych., 20, 78-88 (1979) · Zbl 0429.92028
[22] Janowitz, M. F., Applications of the theory of partially ordered sets to cluster analysis, (Universal Algebra and Applications, 9 (1980), Banach Center Publications), 305-319 · Zbl 0508.62056
[23] Janowitz, M. F., Continuous L-cluster methods, Discrete Appl. Math., 3, 107-112 (1981) · Zbl 0454.62056
[24] Janowitz, M. F., Induced social welfare functions, Math. Social Sciences, 15, 261-276 (1988) · Zbl 0664.90005
[25] Jardine, N.; Sibson, R., A model for taxonomy, Math. Biosciences, 2, 465-482 (1968)
[26] Jardine, N.; Sibson, R., Mathematical Taxonomy (1971), Wiley: Wiley New York · Zbl 0322.62065
[27] Kandel, A., Fuzzy Techniques in Pattern Recognition (1982), Wiley: Wiley New York · Zbl 0552.68067
[28] Lerman, I. C., Les Bases de la Classification Automatique (1970), Gauthier-Villars: Gauthier-Villars Paris · Zbl 0199.51402
[29] Ling, R. F., A probability theory for cluster analysis, J. Amer. Stat. Assn., 68, 159-169 (1973) · Zbl 0285.62035
[30] Loève, N., Probability Theory, Vol. 1 (1977), Springer-Verlag: Springer-Verlag New York · Zbl 0359.60001
[31] Martin, R. H., Nonlinear Operators and Differential Equations in Banach Spaces (1976), Wiley: Wiley New York · Zbl 0333.47023
[32] Matula, D. W., Graph theoretic techniques for cluster analysis algorithms, (van Ryzin, J., Classification and Clustering (1977), Academic Press: Academic Press New York), 95-129
[33] Powers, R. C., Order theoretic classification of percentile cluster methods, (Doctoral Dissertation (1988), Univ. of Massachusetts: Univ. of Massachusetts Amherst)
[34] Powers, R. C., Order automorphisms of spaces of nondecreasing functions, J. Math. Anal. Appl., 136, 112-123 (1988) · Zbl 0664.06005
[35] Radu, V., Some fixed point theorems in probabilistic metric spaces, (Kalashnikov, V. V.; Boyan, P.; Bladimir, M., Proc. 9th Int. Seminar held in Varna, Bulgaria, May 13-19, 1985. Proc. 9th Int. Seminar held in Varna, Bulgaria, May 13-19, 1985, Stability for Stochastic Models, 1233 (1987), Springer: Springer New York), 125-133, Lect. Notes in Math. · Zbl 0622.60008
[36] Roberts, F. S., Measurement Theory with Application to Decisionmaking, (Utility and the Social Sciences (1979), Addison-Wesley: Addison-Wesley Reading) · Zbl 0432.92001
[37] Rota, G. C., The number of partitions of a set, Amer. Math. Monthly, 71, 498-504 (1964) · Zbl 0121.01803
[38] Ruspini, E. H., A new approach to clustering, Inf. and Control, 15, 22-32 (1969) · Zbl 0192.57101
[39] Ruspini, E. H., Recent developments in classification using fuzzy sets, (Lasker, G. E., Proc. Int. Congress on Applied Systems Research, vol. 6 (1980), Pergamon: Pergamon New York), 2785-2790
[40] Ruspini, E. H., Recent developments in fuzzy clustering, (Yager, R. R., Fuzzy Set and Possibility Theory (1982), Pergamon: Pergamon New York), 133-147
[41] Sadovskii, B. N., Limit-compact and condensing operators, Russian Math. Surveys, 27, 85-155 (1972) · Zbl 0243.47033
[42] Schweizer, B., Distribution functions: numbers of the future, (Di Nola, A.; Ventre, G. S., Proc. Second Napoli Meeting on “The Mathematics of Fuzzy Systems” held by Istituto di Matematica della Facoltà di Architettura (1985), Universita di Napoli), 137-149
[43] Schweizer, B.; Sklar, A., Probabilistic Metric Spaces (1983), North-Holland: North-Holland New York · Zbl 0546.60010
[44] B. Schweizer, H. Sherwood and R. Tardiff, Remarks concerning two types of contractions on probabilistic metric spaces, Stochastica, to appear.; B. Schweizer, H. Sherwood and R. Tardiff, Remarks concerning two types of contractions on probabilistic metric spaces, Stochastica, to appear. · Zbl 0689.60019
[45] Sibley, D. A., A metric for weak convergence of distribution functions, Rocky Mountain J. Math., 1, 427-430 (1971) · Zbl 0245.60007
[46] Sibson, R., A model for taxonomy II, Math. Biosciences, 6, 405-430 (1970) · Zbl 0211.23601
[47] Sibson, R., Order invariant methods for data analysis, J. Royal Stat. Soc., 34, 311-349 (1972), Ser. B · Zbl 0253.62004
[48] Sneath, P. H.A.; Sokal, R. R., Numerical Taxonomy (1973), W.H. Freeman & Co: W.H. Freeman & Co San Francisco · Zbl 0285.92001
[49] Taylor, M. D., New metrics for weak convergence of distribution functions, Stochastica, 9, 5-17 (1985) · Zbl 0607.60004
[50] Zadeh, L. A., Fuzzy sets and their application to pattern classification and clustering analysis, (van Ryzin, J., Classification and Clustering (1977), Academic Press: Academic Press New York), 251-299
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.