Abstract
In many environmental and agricultural studies, data are collected on both linear and circular random variables, with possible dependence between the variables. Classically, the analysis of such data has been carried out in a classical regression framework. We propose a Bayesian hierarchical framework to handle all forms of uncertainty arising in a linear-circular data set. One novelty of our multivariate linear-circular model is that, marginally, the circular component is assumed to be a mixture model with an unknown number of von Mises (or circular normal) distributions. We use the Dirichlet process to introduce variability in the model dimensionality, and develop a simple Gibbs sampling algorithm for simulating the mixture components. Although we illustrate our methodology on von Mises mixtures, it is widely applicable. We thus avoid complicated reversible-jump Markov chain Monte Carlo methods, which are considered ideal for analyzing mixtures of unknown number of distributions. We illustrate our methodologies with simulated and real data sets. Using pseudo-Bayes factors, we also compare different models associated with both fixed and variable numbers of von Mises distributions. Our findings suggest that models associated with varying numbers of mixture components perform at least as well as those with known numbers of mixture components. We tentatively argue that model averaging associated with variable number of mixture components improves the model’s predictive power, which compensates for the lack of knowledge of the actual number of mixture components.
Similar content being viewed by others
References
Antoniak, C. E. (1974), “Mixtures of Dirichlet Processes With Applications to Nonparametric Problems,” Annals of Statistics, 2, 1152–1174.
Berger, J. O. (1985), Statistical Decision Theory and Bayesian Analysis, New York: Springer.
Bhattacharya, S. (2006), “A Bayesian Semiparametric Model for Organism Based Environmental Reconstruction,” Environmetrics, 17 (7), 763–776.
Blackwell, D., and McQueen, J. B. (1973), “Ferguson Distributions via Pólya Urn Schemes,” Annals of Statistics, 1, 353–355.
Dalal, S. R., and Hall, W. J. (1983), “Approximating Priors by Mixtures of Natural Conjugate Priors,” Journal of the Royal Statistical Society, Ser. B, 45, 278–286.
Damien, P., and Walker, S. (1999), “A Full Bayesian Analysis of Circular Data Using the von Mises Distribution,” The Canadian Journal of Statistics, 27, 291–298.
De Wiest, F., and Della Fiorentina, H. (1975), “Suggestion for a Realistic Definition of an Air Quality Index Relative to Hydro-Carbonaceous Matter Associated With Airborne Particles,” Atmospheric Environment, 9, 951–954.
Diaconis, P., and Ylvisaker, D. (1985), “Quantifying Prior Opinion” (with discussion), in Bayesian Statistics 2, eds. J. M. Bernardo, M. H. DeGroot, D. V. Lindley, and A. F. M. Smith, Amsterdam: North-Holland, pp. 133–156.
Escobar, M. D., and West, M. (1995), “Bayesian Density Estimation and Inference Using Mixtures,” Journal of the American Statistical Association, 90 (430), 577–588.
Ferguson, T. S. (1974), “A Bayesian Analysis of Some Nonparametric Problems,” Annals of Statistics, 1, 209–230.
Geisser, S., and Eddy, W. F. (1979), “A Predictive Approach to Model Selection,” Journal of the American Statistical Association, 74 (365), 153–160.
Gelfand, A. E. (1996), “Model Determination Using Sampling-Based Methods,” in Markov Chain Monte Carlo in Practice, eds. W. Gilks, S. Richardson, and D. Spiegelhalter, London: Chapman & Hall, pp. 145–162.
Gelfand, A. E., Dey, D. K., and Chang, H. (1992), “Model Determination Using Predictive Distributions With Implementation via Sampling Methods” (with discussion), in Bayesian Statistics 4, eds. J.M. Bernardo, J. O. Berger, A. P. Dawid, and A. F. M. Smith, Oxford: Clarendon Press, pp. 147–167.
George, B. J., and Ghosh, K. (2006), “A Semiparametric Bayesian Model for Circular-Linear Regression,” Communications in Statistics—Simulation and Computation, 35, 911–923.
Green, P. J. (1995), “Reversible Jump Markov Chain Monte Carlo Computation and Bayesian Model Determination,” Biometrika, 82, 711–732.
Guttorp, P., and Lockhart, R. A. (1988), “Finding the Location of a Signal: A Bayesian Analysis,” Journal of the American Statistical Association, 83, 322–330.
Jeffreys, H. (1961), Theory of Probability (3rd ed.), Oxford: Oxford University Press.
Johnson, R. A., and Wehrly, T. E. (1977), “Measures and Models for Angular Correlation and Angular-Linear Correlation,” Journal of the Royal Statistical Society, Ser. B, 39, 222–229.
— (1978), “Some Angular-Linear Distributions and Related Regression Models,” Journal of the American Statistical Association, 73, 602–606.
MacEachern, S. N. (1994), “Estimating Normal Means With a Conjugate Style Dirichlet Process Prior,” Communications in Statistics, Ser. B, 23, 727–741.
Müller, P., Erkanli, A., and West, M. (1996), “Bayesian Curve Fitting Using Multivariate Normal Mixtures,” Biometrika, 83 (1), 67–79.
Ravindran, P., and Ghosh, S. K. (2002), “Bayesian Methods for Circular Regression Using Wrapped Distributions,” in Proceedings of the Joint Statistical Meetings, American Statistical Association.
Richardson, S., and Green, P. J. (1997), “On Bayesian Analysis of Mixtures With an Unknown Number of Components” (with discussion), Journal of the Royal Statistical Society, Ser. B, 59, 731–792.
Riley, J. R., Reynolds, D. R., Mukhopadhyay, S., Ghosh, M. R., Chowdhuri, M. M., Sarkar, T. K., Nath, P. S., Sarkar, S., Das, B. K., De, B. K., Satpathi, C. R., Guha, M. K., Dutta Majumder, D., Das, J., and De, A. K. (1993), “A Preliminary Investigation of the Windborne Movements of Insect Vectors of Crop Diseases in Northeast India,” NRI Project A0166, Electronics and Communication Sciences Unit, Indian Statistical Institute, Calcutta.
Robert, C. P., and Casella, G. (2004), Monte Carlo Statistical Methods, New York: Springer.
SenGupta, A. (2004), “On the Constructions of Probability Distributions for Directional Data,” Bulletin of the Calcutta Mathematical Society, 96, 139–154.
SenGupta, A. (2007), “Directional Data Statistical Analysis Package,” available at http://directionalstatistics.net/.
SenGupta, A., and Ugwuowo, F. L. (2006), “Asymmetric Circular-Linear Multivariate Regression Models With Applications to Environmental Data,” Environmental and Ecological Statistics, 13, 299–309.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bhattacharya, S., Sengupta, A. Bayesian analysis of semiparametric linear-circular models. JABES 14, 33–65 (2009). https://doi.org/10.1198/jabes.2009.0003
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1198/jabes.2009.0003