×

Bayes discriminant rules with ordered predictors. (English) Zbl 1337.62139

Summary: We propose and discuss improved Bayes rules to discriminate between two populations using ordered predictors. To address the problem we propose an alternative formulation using a latent space that allows to introduce the information about the order in the theoretical rules. The rules are first defined when the marginal densities are fully known and then under normality when the parameters are unknown and training samples are available. Several numerical examples and simulations in the paper illustrate the methodology and show that the new rules handle the information appropriately. We compare the new rules with the classical Bayes and Fisher rules in these examples and we show that the misclassification probability is smaller for the new rules. The method is also applied to data from a diabetes study where we again show that the new rules improve over the usual Fisher rule.

MSC:

62H30 Classification and discrimination; cluster analysis (statistical aspects)
62F15 Bayesian inference

Software:

ORIOGEN
Full Text: DOI

References:

[1] ABELSON, R.P. and TUKEY, J.W. (1963), ”Efficient Utilization of Non-Numerical Information in Quantitative Analysis: General Theory and the Case of the Simple Order”, Annals of Mathematical Statistics 34, 1347–1369. · Zbl 0121.13907 · doi:10.1214/aoms/1177703869
[2] ATTALLAH, A.M., TOSON, E.A., EL-WASEEF,A.M., ABO-SEIF,M.A.,OMRAN,M.M., and SHIHA, G.E. (2006), ”Discriminant Function Based on Hyaluronic Acid and Its Degrading Enzymes and Degradation Products for Differentiating Cirrhotic from Non-Cirrhotic Liver Diseased Patients in Chronic HCV Infection”, Clinica Chimica Acta 369, 66–72. · doi:10.1016/j.cca.2006.01.004
[3] BARLOW, R.E., BARTHOLOMEW, D.J., BREMNER, J.M., and BRUNK, H.D. (1972), Statistical Inference under Order Restrictions, New York: Wiley. · Zbl 0246.62038
[4] BLAKE, C.L. and MERZ, C.J. (1998), UCI Repository Learning Databases, http://www.ics.uci.edu/\(\sim\)mlearn/MLRepository.html , Irvine, CA: University of California, Department of Information and Computer Science.
[5] BRETZ, F. and HOTHORN, L.A. (2003), ”Statistical Analysis of Monotone or Non-Monotone Dose-Response Data from In Vitro Toxicological Assays”, Alternatives to Laboratory Animals 31, Supp. 1, 81–96.
[6] CONDE, D., FERNÁNDEZ,M.A., and SALVADOR, B. (2005), ”A Classification Rule for Ordered Exponential Populations”, Journal of Statistical Planning and Inference 153, 339–356. · Zbl 1074.62040 · doi:10.1016/j.jspi.2004.05.004
[7] DUNSON, D.B. (2005), ”Bayesian Semiparametric Isotonic Regression for Count Data”, Journal of the American Statistical Association 100, 618–627. · Zbl 1117.62322 · doi:10.1198/016214504000001457
[8] DYKSTRA, R., HEWETT, J., and ROBERTSON, T. (1999), ”Nonparametric Isotonic Discriminant Procedures”, Biometrika 86, 429–438. · Zbl 0930.62065 · doi:10.1093/biomet/86.2.429
[9] EL BARMI, H. and DYKSTRA, R. (1998), ”Maximum Likelihood Methods for Log-Convex Models When Cell Probabilities Are Subject to Convex Constraints”, Annals of Statistics 26, 1878–1893. · Zbl 0929.62029 · doi:10.1214/aos/1024691361
[10] FERNÁNDEZ,M.A., RUEDA, C., and SALVADOR, B. (2006), ”Incorporating Additional Information to Normal Linear Discriminant Rules”, Journal of the American Statistical Association 101, 569–577. · Zbl 1119.62340 · doi:10.1198/016214505000001041
[11] GENZ, A. (1992), ”Numerical Computation of Multivariate Normal Probabilities”, Journal of Computational and Graphical Statatistics 1, 142–150.
[12] HOFF, P.D. (2003a), ”Bayesian Methods for Partial Stochastic Ordering”, Biometrika 90, 303–317. · Zbl 1035.62003 · doi:10.1093/biomet/90.2.303
[13] HOFF, P.D. (2003b), ”Nonparametric Estimation of Convex Models Via Mixtures”, Annals of Statistics 31, 174–200. · Zbl 1018.62023 · doi:10.1214/aos/1046294461
[14] LONG, T. and GUPTA, R.D. (1998), ”Alternative Linear Classification Rules Under Order Restrictions”, Communications in Statististics - Theory and Methods 27, 559–575. · Zbl 0895.62064 · doi:10.1080/03610929808832113
[15] MOON, H.R. and SCHORFHEIDE, F. (2005), ”Empirical Likelihood Estimation with Inequality Moment Constraints”, Working paper: http://www.econ.princeton.edu/seminars/ECONOMETRICS/Metrirs_Fall_2005/Schorfheide.pdf .
[16] NEELON, B. and DUNSON, D.B. (2004), ”Bayesian Isotonic Regression and Trend Analysis”, Biometrics 60, 398–406. · Zbl 1125.62023 · doi:10.1111/j.0006-341X.2004.00184.x
[17] PEDDADA, S.D., HARRIS, S., ZAJD, J., and HARVEY, E. (2005), ”ORIOGEN: Order Restricted Inference for Ordered Gene Expression Data”, Bioinformatics 21, 3933–3934. · doi:10.1093/bioinformatics/bti637
[18] PEDDADA, S.D., PRESCOTT, K.E., LI, L.P., AFSHARI, C.A., WEINBERG, C.R., and UMBACH, D. (2003), ”Gene Selection and Clustering for Time.Course And Dose-Response Microarray Experiments Using Order-Restricted Inference”, Bioinformatics 19, 834–841. · doi:10.1093/bioinformatics/btg093
[19] ROBERTSON, T., WRIGHT, F.T., and DYKSTRA, R.L. (1988), Order Restricted Statistical Inference, New York: John Wiley and Sons. · Zbl 0645.62028
[20] SILVAPULLE, M.J. and SEN, P.K. (2005), Constrained Statistical Inference, New Jersey: John Wiley and Sons. · Zbl 1077.62019
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.