×

A theory of statistical models for Monte Carlo integration (with discussion). (English) Zbl 1067.62054

Summary: The task of estimating an integral by Monte Carlo methods is formulated as a statistical model using simulated observations as data. The difficulty in this exercise is that we ordinarily have at our disposal all of the information required to compute integrals exactly by calculus or numerical integration, but we choose to ignore some of the information for simplicity or computational feasibility. Our proposal is to use a semiparametric statistical model that makes explicit what information is ignored and what information is retained. The parameter space in this model is a set of measures on the sample space, which is ordinarily an infinite-dimensional object. None-the-less, from simulated data the base-line measure can be estimated by maximum likelihood, and the required integrals computed by a simple formula previously derived by Y. Vardi [Ann. Stat. 13, 178–205 (1985; Zbl 0578.62047)] and by B. G. Lindsay [Mixture models: Theory, geometry and applications. (1995); see also J. Stat. Plann. Inference 47, 29–39 (1995; Zbl 0832.62027)] in a closely related model for biased sampling. The same formula was also suggested by C. J. Geyer [Tech. Rep. 568, School Statist., Univ. Minessota, Minneapolis (1994)] and by X.-L. Meng and W. H. Wong [Stat. Sin. 6, No. 4, 831–860 (1996; Zbl 0857.62017)] using entirely different arguments.
By contrast with Geyer’s retrospective likelihood, a correct estimate of simulation error is available directly from the Fisher information. The principal advantage of the semiparametric model is that variance reduction techniques are associated with submodels in which the maximum likelihood estimator in the submodel may have substantially smaller variance than the traditional estimator. The method is applicable to Markov chain and more general Monte Carlo sampling schemes with multiple samplers.

MSC:

62G99 Nonparametric inference
65C05 Monte Carlo methods
65D30 Numerical integration
62F15 Bayesian inference
65C60 Computational problems in statistics (MSC2010)
65C40 Numerical analysis or methods applied to Markov chains
Full Text: DOI

References:

[1] Albert J., J. Am. Statist. Ass. 88 pp 669– (1993)
[2] Bishop Y. M. M., Discrete Multivariate Analyses: Theory and Practice (1975)
[3] Brown B. W., Biostatistics Casebook (1980)
[4] Chib S., J. Am. Statist. Ass. 90 pp 1313– (1995)
[5] Craiu R. V., Ann. Statist. (2004)
[6] Cui L., Statist. Sci. 7 pp 483– (1992)
[7] Deming W. E., Ann. Math. Statist. 11 pp 427– (1940)
[8] DiCiccio T. J., J. Am. Statist. Ass. 92 pp 903– (1997)
[9] Evans M., Approximating Integrals via Monte Carlo and Deterministic Methods (2000) · Zbl 0958.65009
[10] DOI: 10.1111/1467-9868.00105 · Zbl 0910.62009 · doi:10.1111/1467-9868.00105
[11] Gelman A., Am. Statistn 45 pp 125– (1991)
[12] DOI: 10.1214/ss/1028905934 · Zbl 0966.65004 · doi:10.1214/ss/1028905934
[13] C. J. Geyer (1994 ) Estimating normalizing constants and reweighting mixtures in Markov chain Monte Carlo .Technical Report 568. School of Statistics, University of Minnesota, Minneapolis.
[14] Gill R., Ann. Statist. 16 pp 1069– (1988)
[15] Glynn P. W., Monte Carlo and Quasi-Monte Carlo Methods pp 27– (2000)
[16] Hammersley J. M., Monte Carlo Methods (1964) · Zbl 0121.35503 · doi:10.1007/978-94-009-5819-7
[17] Hesterberg T., Technometrics 37 pp 185– (1995)
[18] Horvitz D. G., J. Am. Statist. Ass. 47 pp 663– (1952)
[19] Lindsay B., Mixture Models: Theory, Geometry and Applications (1995) · Zbl 1163.62326
[20] Liu J. S., Monte Carlo Strategies in Scientific Computing (2001) · Zbl 0991.65001
[21] Liu J. S., Biometrika 87 pp 353– (2000)
[22] MacEachern S. N., J. Comput. Graph. Statist. 9 pp 99– (2000)
[23] Mallows C. L., Ann. Statist. 13 pp 204– (1985)
[24] DOI: 10.1214/aos/1035844977 · Zbl 1039.62003 · doi:10.1214/aos/1035844977
[25] Meng X.-L., Statist. Sin. 6 pp 831– (1996)
[26] Owen A., J. Am. Statist. Ass. 95 pp 135– (2000)
[27] DOI: 10.1111/1467-9868.00106 · Zbl 0909.62006 · doi:10.1111/1467-9868.00106
[28] Ripley B. D., Stochastic Simulation (1987) · Zbl 0613.65006 · doi:10.1002/9780470316726
[29] Ritter C., J. Am. Statist. Ass. 97 pp 861– (1992)
[30] Rothery P., Appl. Statist. 31 pp 125– (1982)
[31] Vardi Y., Ann. Statist. 13 pp 178– (1985)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.