×

Maximum likelihood estimation for a smooth Gaussian random field model. (English) Zbl 06736498

Summary: Gaussian processes are commonly used for modeling the output of deterministic computer models. We consider the behavior of maximum likelihood estimators (MLEs) of parameters of the commonly used squared exponential covariance function when the computer model has some simple deterministic form. We prove that for regularly spaced observations on the line, the MLE of the scale parameter converges to zero if the computer model is a constant function and diverges to infinity for linear functions. When observing successive derivatives of a \(p\)th order monomial at zero, we find the asymptotic orders of the MLE of the scale parameter for all \(p\geq 0\). For some commonly used test functions, we compare the MLE with cross validation in a prediction problem and explore the joint estimation of range and scale parameters. The correlation matrix is nearly numerically singular even when the sample size is moderate. To overcome numerical difficulties, we perform exact computation by making use of exact results for the correlation matrix and restricting ourselves to parameter values and test functions that yield rational correlations and function values at the observation locations. We also consider the common approach of including a nugget effect to deal with the numerical difficulties, and explore its consequences on model fitting and prediction.

MSC:

62M30 Inference from spatial processes
60G15 Gaussian processes
97N50 Interpolation and approximation (educational aspects)

Software:

Mathematica
Full Text: DOI

References:

[1] M. Abt and W. J. Welch, {\it Fisher information and maximum-likelihood estimation of covariance parameters in Gaussian stochastic processes}, Canad. J. Statist., 26 (1998), pp. 127-137. · Zbl 0899.62124
[2] G. E. Andrews, {\it The Theory of Partitions}, Cambridge University Press, Cambridge, UK, 1998. · Zbl 0996.11002
[3] I. Andrianakis and P. G. Challenor, {\it The effect of the nugget on Gaussian process emulators of computer models}, Comput. Statist. Data Anal., 56 (2012), pp. 4215-4228. · Zbl 1255.62306
[4] F. Bachoc, {\it Cross validation and maximum likelihood estimations of hyper-parameters of Gaussian processes with model misspecification}, Comput. Statist. Data Anal., 66 (2013), pp. 55-69. · Zbl 1471.62021
[5] R. A. Beeler, {\it How to Count: An Introduction to Combinatorics and Its Applications}, Springer, Cham, Switzerland, 2015. · Zbl 1328.05001
[6] F. Branin and S. Hoo, {\it A method for finding multiple extrema of a function of n variables}, Numer. Methods for Non-Linear Optimization/Academic, London, 1972, pp. 231-237. · Zbl 0271.65035
[7] I. Charpentier, C. D. Cappello, and J. Utke, {\it Efficient higher-order derivatives of the hypergeometric function}, in Advances in Automatic Differentiation, Springer, Berlin, 2008, pp. 127-137. · Zbl 1152.65413
[8] H. Cheng and A. Sandu, {\it Collocation least-squares polynomial chaos method}, in Proceedings of the 2010 Spring Simulation Multiconference, Society for Computer Simulation International, San Diego, CA, 2010, 80.
[9] A. Forrester, A. Sobester, and A. Keane, {\it Engineering Design via Surrogate Modeling: A Practical Guide}, Wiley, Chichester, England, 2008.
[10] R. Franke, {\it A Critical Comparison of Some Methods for Interpolation of Scattered Data}, Technical report NPS-53-79-003, DTIC, Fort Belvoir, VA, 1979.
[11] F. A. Gómez, C. E. Coleman-Smith, B. W. O’Shea, J. Tumlinson, and R. L. Wolpert, {\it Characterizing the formation history of milky way like stellar halos with model emulators}, Astrophys. J., 760 (2012), 112.
[12] H. Gould and J. Quaintance, {\it Double fun with double factorials}, Math. Mag., 85 (2012), pp. 177-192. · Zbl 1260.05003
[13] A. Griewank, J. Utke, and A. Walther, {\it Evaluating higher derivative tensors by forward propagation of univariate Taylor series}, Math. Comput., 69 (2000), pp. 1117-1130. · Zbl 0952.65028
[14] A. Griewank and A. Walther, {\it Evaluating Derivatives: Principles and Techniques of Algorithmic Differentiation}, 2nd ed., Other Titles Appl. Math., SIAM, Philadelphia, 2008. · Zbl 1159.65026
[15] B. Haaland, P. Z. Qian, {\it Accurate emulators for large-scale computer experiments}, Ann. Statist., 39 (2011), pp. 2974-3002. · Zbl 1246.65172
[16] W. Huang and W. J. Welch, {\it Properties of Parameters in a Stochastic Process Model for Computer Experiments}. M. Math thesis, Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Ontario, Canada, 2000.
[17] C. L. Lawson and R. J. Hanson, {\it Solving Least Squares Problems}, classics 15, SIAM, Appl. Math. Philadelphia, 1995. · Zbl 0860.65029
[18] Y. B. Lim, J. Sacks, W. Studden, and W. J. Welch, {\it Design and analysis of computer experiments when the output is highly correlated over the input space}, Canad. J. Statist., 30 (2002), pp. 109-126. · Zbl 1003.62082
[19] W.-L. Loh and T.-K. Lam, {\it Estimating structured correlation matrices in smooth Gaussian random field models}, Ann. Statist., 28 (2000), pp. 880-904. · Zbl 1105.62376
[20] J. D. Martin and T. W. Simpson, {\it On the use of kriging models to approximate deterministic computer models}, in ASME 2004 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, American Society of Mechanical Engineers, New York, 2004, pp. 481-492.
[21] J. Oakley, {\it Estimating percentiles of uncertain computer code outputs}, J. R. Stat. Soc. Ser. C. Appl. Stat., 53 (2004), pp. 83-93. · Zbl 1111.62395
[22] O. Obrezanova, G. Csányi, J. M. Gola, and M. D. Segall, {\it Gaussian processes: A method for automatic QSAR modeling of ADME properties}, J. Chem. Inf. Model., 47 (2007), pp. 1847-1857.
[23] V. Picheny, T. Wagner, and D. Ginsbourger, {\it A benchmark of kriging-based infill criteria for noisy optimization}, Struct. Multidiscip. Optim., 48 (2013), pp. 607-626.
[24] P. Ranjan, R. Haynes, and R. Karsten, {\it A computationally stable approach to Gaussian process interpolation of deterministic computer simulation data}, Technometrics, 53 (2011), pp. 366-378.
[25] J. Sacks, S. B. Schiller, and W. J. Welch, {\it Designs for computer experiments}, Technometrics, 31 (1989), pp. 41-47.
[26] J. Sacks, W. J. Welch, T. J. Mitchell, and H. P. Wynn, {\it Design and analysis of computer experiments}, Statist. Sci., 4 (1989), pp. 409-423. · Zbl 0955.62619
[27] T. W. Simpson, T. M. Mauery, J. J. Korte, and F. Mistree, {\it Kriging models for global approximation in simulation-based multidisciplinary design optimization}, AIAA J., 39 (2001), pp. 2233-2241.
[28] M. L. Stein, {\it Interpolation of Spatial Data: Some Theory for Kriging}, Springer, New York, 1999. · Zbl 0924.62100
[29] Q. Tang, Y. B. Lau, S. Hu, W. Yan, Y. Yang, and T. Chen, {\it Response surface methodology using Gaussian processes: Towards optimizing the trans-stilbene epoxidation over Co} 2{\it +–NaX catalysts}, Chem. Eng. J., 156 (2010), pp. 423-431.
[30] M. Wagner, A. Walther, and B.-J. Schaefer, {\it On the efficient computation of high-order derivatives for implicitly defined functions}, Comput. Phys. Commun., 181 (2010), pp. 756-764. · Zbl 1205.65131
[31] S. Wolfram, {\it The MATHEMATICA Book, Version} 4, Cambridge University Press, Cambridge, UK, 1999. · Zbl 0924.65002
[32] H. Zhang, {\it Inconsistent estimation and asymptotically equal interpolations in model-based geostatistics}, J. Amer. Statist. Assoc., 99 (2004), pp. 250-261. · Zbl 1089.62538
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.