×

Robust factor modelling for high-dimensional time series: an application to air pollution data. (English) Zbl 1428.86027

Summary: This paper considers the factor modelling for high-dimensional time series contaminated by additive outliers. We propose a robust variant of the estimation method given in [C. Lam and Q. Yao, Ann. Stat. 40, No. 2, 694–726 (2012; Zbl 1273.62214)]. The estimator of the number of factors is obtained by an eigen analysis of a robust non-negative definite covariance matrix. Asymptotic properties of the robust eigenvalues are derived and we show that the resulting estimators have the same convergence rates as those found for the standard eigenvalues estimators. Simulations are carried out to analyse the finite sample size performance of the robust estimator of the number of factors under the scenarios of multivariate time series with and without additive outliers. As an application, the robust factor analysis is performed to reduce the dimensionality of the data and, therefore, to identify the pollution behaviour of the pollutant \(PM_{10} \).

MSC:

86A32 Geostatistics
62M10 Time series, auto-correlation, regression, etc. in statistics (GARCH)
62G35 Nonparametric robustness

Citations:

Zbl 1273.62214

References:

[1] Anderson, T. W., An Introduction to Multivariate Statistical Analysis (2003), John Wiley & Sons: John Wiley & Sons New Jersey · Zbl 1039.62044
[2] Arcones, M. A., Limit theorems for nonlinear functionals of a stationary Gaussian sequence of vectors, Ann. Probab., 22, 4, 2242-2274 (1994) · Zbl 0839.60024
[3] Brunekreef, B.; Holgate, S. T., Air pollution and health, Lancet, 360, 9341, 1233-1242 (2002)
[4] Chang, I.; Tiao, G. C.; Chen, C., Estimation of time series parameters in the presence of outliers, Technometrics, 30, 2, 193-204 (1988)
[5] Chen, C.; Liu, L.-M., Joint estimation of model parameters and outlier effects in time series, J. Am. Stat. Assoc., 88, 421, 284-297 (1993) · Zbl 0775.62229
[6] Curtis, L.; Rea, W.; Smith-Willis, P.; Fenyves, E.; Pan, Y., Adverse health effects of outdoor air pollutants, Environ. Int., 32, 6, 815-830 (2006)
[7] Gosak, M.; Stožer, A.; Markovič, R.; Dolenšek, J.; Marhl, M.; Slak Rupnik, M.; Perc, M., The relationship between node degree and dissipation rate in networks of diffusively coupled oscillators and its significance for pancreatic beta cells, Chaos: Interdiscipl. J. Nonlinear Sci., 25, 7, 073115 (2015) · Zbl 1374.92058
[8] Horn, R. A.; Johnson, C. R., Matrix Analysis (1985), Cambridge University Press · Zbl 0576.15001
[9] Johnson, R.; Wichern, D., Applied Multivariate Statistical Analysis (2007), Prentice Hall: Prentice Hall New Jersey · Zbl 1269.62044
[10] Lam, C.; Yao, Q., Factor modeling for high-dimensional time series: inference for the number of factors, Ann. Stat., 40, 2, 694-726 (2012) · Zbl 1273.62214
[11] Lam, C.; Yao, Q.; Bathia, N., Estimation of latent factors for high-dimensional time series, Biometrika, 98, 901-918 (2011) · Zbl 1228.62110
[12] Lévy-Leduc, C.; Boistard, H.; Moulines, E.; Taqqu, M. S.; Reisen, V. A., Asymptotic properties of U-processes under long-range dependence, Ann. Stat., 39, 3, 1399-1426 (2011) · Zbl 1242.62100
[13] Lévy-Leduc, C.; Boistard, H.; Moulines, E.; Taqqu, M. S.; Reisen, V. A., Large sample behaviour of some well-known robust estimators under long-range dependence, Statistics, 45, 1, 59-71 (2011) · Zbl 1291.62108
[14] Lévy-Leduc, C.; Boistard, H.; Moulines, E.; Taqqu, M. S.; Reisen, V. A., Robust estimation of the scale and the autocovariance function of Gaussian short and long-range dependent processes, J. Time Ser. Anal., 32, 2, 135-156 (2011) · Zbl 1290.62082
[15] Lordan, O.; Sallan, J. M.; Escorihuela, N.; Gonzalez-Prieto, D., Robustness of airline route networks, Phys. A: Stat. Mech. Appl., 445, 18-26 (2016)
[16] Ma, Y.; Genton, M. G., Highly robust estimation of the autocovariance function, J. Time Ser. Anal., 21, 663-684 (2000) · Zbl 0970.62056
[17] Ma, Y.; Genton, M. G., Highly robust estimation of dispersion matrices, J. Multivar. Anal., 78, 11-36 (2001) · Zbl 1013.62065
[18] Maynard, R., Key airborne pollutants: the impact on health, Sci. Total Environ., 334-335, 0, 9-13 (2004)
[19] Molinares, F. F.; Reisen, V. A.; Cribari-Neto, F., Robust estimation in long-memory processes under additive outliers, J. Stat. Plann. Inference, 139, 8, 2511-2525 (2009) · Zbl 1162.62085
[20] Pea, D.; Box, G. E.P., Identifying a simplifying structure in time series, J. Am. Stat. Assoc., 82, 399, 836-843 (1987) · Zbl 0623.62081
[21] Perc, M., Nonlinear time series analysis of the human electrocardiogram, Eur. J. Phys., 26, 5, 757 (2005)
[22] Reisen, V. A.; Lévy-Leduc, C.; Bourguignon, M.; Boistard, H., Robust Dickey-Fuller tests based on ranks for time series with additive outliers, Metrika, 80, 1, 115-131 (2017) · Zbl 1356.62154
[23] Rousseeuw, P. J.; Croux, C., Alternatives to the median absolute deviation, J. Am. Stat. Assoc., 88, 424, 1273-1283 (1993) · Zbl 0792.62025
[24] Seinfeld, J. H.; Pandis, S. N., Atmospheric Chemistry and Physics: From Air Pollution to Climate Change (2006), John Wiley: John Wiley New York
[25] Souza, J. B.; Reisen, V. A.; Franco, G. C.; Spány, M.; Bondon, P.; Santos, J. M., Generalized additive model with principal component analysis: an application to time series of respiratory disease and air pollution data, J. R. Stat. Soc. Ser. C - Appl. Stat., 67, 453-480 (2018)
[26] Stock, J. H.; Watson, M. W., Forecasting using principal components from a large number of predictors, J. Am. Stat. Assoc., 97, 460, 1167-1179 (2002) · Zbl 1041.62081
[27] Tsay, R. S., Outliers, level shifts, and variance changes in time series, J. Forecast., 7, 1, 1-20 (1988)
[28] van der Vaart, A. W., Asymptotic Statistics. Asymptotic Statistics, Cambridge Series in Statistical and Probabilistic Mathematics (1998), Cambridge University Press · Zbl 0910.62001
[29] Vanhatalo, E.; Kulahci, M., Impact of autocorrelation on principal components and their use in statistical process control, Qual. Reliab. Eng. Int., 32, 4, 1483-1500 (2016)
[30] Watson, J. G.; Zhu, T.; Chow, J. C.; Engelbrecht, J.; Fujita, E. M.; Wilson, W. E., Receptor modeling application framework for particle source apportionment, Chemosphere, 49, 9, 1093-1136 (2002)
[31] WHO, Air Quality Guidelines: global update 2005 (2006), WHO - World Health Organization
[32] WHO, Air Pollution Estimates, WHO - World Health Organization, 2014.; WHO, Air Pollution Estimates, WHO - World Health Organization, 2014.
[33] Zamprogno, B., PCA applied in time series data with applications to air quality data (2013), PPGEA - Universidade Federal do Espírito Santo, (Ph.D. thesis)
[34] Zhang, M.; Liang, B.; Wang, S.; Perc, M.; Du, W.; Cao, X., Analysis of flight conflicts in the chinese air route network, Chaos Solitons Fractals, 112, 97-102 (2018)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.