×

Phenotyping OSA: a time series analysis using fuzzy clustering and persistent homology. (English) Zbl 1533.62074

Summary: Sleep apnea is a disorder that has serious consequences for the pediatric population. There has been recent concern that traditional diagnosis of the disorder using the apnea-hypopnea index may be ineffective in capturing its multi-faceted outcomes. In this work, we take a first step in addressing this issue by phenotyping patients using a clustering analysis of airflow time series. This is approached in three ways: using feature-based fuzzy clustering in the time and frequency domains, and using persistent homology to study the signal from a topological perspective. The fuzzy clusters are analyzed in a novel manner using a Dirichlet regression analysis, while the topological approach leverages Takens’ embedding theorem to study the periodicity properties of the signals.

MSC:

62P10 Applications of statistics to biology and medical sciences; meta analysis
55N31 Persistent homology and applications, topological data analysis
62H86 Multivariate analysis and fuzziness
62M10 Time series, auto-correlation, regression, etc. in statistics (GARCH)
92C50 Medical applications (general)

References:

[1] Won, C. H., When will we ditch the ahi?, J. Clin. Sleep Med., 16, 1055-1062 (2020)
[2] Rosen, C. L.; Wang, R.; Taylor, H. G.; Marcus, C. L.; Katz, E. S.; Paruthi, S.; Arens, R.; Muzumdar, H.; Garetz, S. L.; Mitchell, R. B.a.; Jones, D.; Weng, J.; Ellenberg, S.; Redline, S.; Chervin, R. D., Utility of symptoms to predict treatment outcomes in obstructive sleep apnea syndrome, Pediatrics, 135, 3 (2015), e662-e671
[3] Mostafa, S. S.; Mendono̧a, F.; Ravelo-Garcóa, A. G.; Morgado-Dias, F., A systematic review of detecting sleep apnea using deep learning, Sensors (Basel), 4934-4959 (2019)
[4] Zinchuk, A.; Yaggi, H. K., Phenotypic subtypes of osa a challenge and opportunity for precision medicine, Sleep: CHEST Rev., 403-420 (2020)
[5] Loliencar, P.; Singhal, K.; Heo, G., Phenotyping Pediatric Obstructive Sleep Apnea Using Clustering of Polysomnography Time Series (2021)
[6] Hijazi, R. H.; Jernigan, R. W., Modelling compositional data using Dirichlet regression models, J. Appl. Probab. Stat., 4, 1, 77-91 (2009) · Zbl 1166.62053
[7] Campbell, G.; Mosimann, J., Multivariate methods for proportional shape, (ASA Proceedings of the Section on Statistical Graphics, vol. 1. ASA Proceedings of the Section on Statistical Graphics, vol. 1, Washington (1987)), 10-17
[8] Maier, M. J., Dirichletreg: Dirichlet regression for compositional data in r (January 2014), WU Vienna University of Economics and Business: WU Vienna University of Economics and Business Vienna, Research Report Series/Department of Statistics and Mathematics 125
[9] Zhu, X., Persistent Homology: An Introduction and a New Text Representation for Natural Language Processing (2013)
[10] Bendich, P.; Marron, J. S.; Miller, E.; Pieloch, A.; Skwerer, S., Persistent homology analysis of brain artery trees, Ann. Appl. Stat., 10, 1, 198-218 (2016)
[11] Tralie, C. J.; Perea, J. A., (Quasi)periodicity quantification in video data, using topology, SIAM J. Imaging Sci., 11, 2, 1049-1077 (2018) · Zbl 1401.65025
[12] Gidea, M.; Katz, Y., Topological data analysis of financial time series: landscapes of crashes, Physica A, 491, 820-834 (2018) · Zbl 1514.62206
[13] Edelsbrunner, H.; Harer, J., Computational Topology - an Introduction (2010), American Mathematical Society · Zbl 1193.55001
[14] Zomorodian, A.; Carlsson, G., Computing persistent homology, Discrete Comput. Geom., 33, 249-274 (2005) · Zbl 1069.55003
[15] Carlsson, G., Topological pattern recognition for point cloud data, Acta Numer., 23, 289-368 (2014) · Zbl 1398.68615
[16] Takens, F., Detecting strange attractors in turbulence, (Lecture Notes in Mathematics (1981), Springer: Springer Berlin, Heidelberg), 366-381 · Zbl 0513.58032
[17] Perea, J. A.; Harer, J., Sliding windows and persistence: an application of topological methods to signal analysis, Found. Comput. Math., 15, 799-838 (2015) · Zbl 1325.37054
[18] Perea, J. A.; Deckard, A.; Haase, S. B.; Harer, J., SW1PerS: sliding windows and 1-persistence scoring; discovering periodicity in gene expression time series data, BMC Bioinform., 16, Article 257 pp. (2015)
[19] Kim, K.; Kim, J.; Rinaldo, A., Time series featurization via topological data analysis (2019)
[20] Ravishanker, N.; Chen, R., Topological data analysis (TDA) for time series (2019)
[21] Gholizadeh, S.; Zadrozny, W., A short survey of topological data analysis in time series and systems analysis (2018)
[22] Maharaj, E.; D’Urso, P.; Caiado, J., Time Series Clustering and Classification (2019), CRC Press · Zbl 1435.62006
[23] Krishnapuram, R.; Joshi, A.; Nasraoui, O.; Yi, L., Low-complexity fuzzy relational clustering algorithms for web mining, IEEE Trans. Fuzzy Syst., 9, 4, 595-607 (2001)
[24] Sardá-Espinosa, A., Comparing time-series clustering algorithms in r using the dtwclust package, R Package Vignette, 12, 41 (2017)
[25] Maier, M. J., DirichletReg: Dirichlet Regression in R, r package version 0.7-0 (2020)
[26] Ferrari, S.; Cribari-Neto, F., Beta regression for modelling rates and proportions, J. Appl. Stat., 31, 7, 799-815 (2004) · Zbl 1121.62367
[27] Cribari-Neto, F.; Zeileis, A., Beta regression in R, J. Stat. Softw., 34, 2, 1-24 (2010)
[28] G. Campbell, J. Mosimann, Modelling continuous proportional data with the Dirichlet distribution, Unpublished manuscript.
[29] Gueorguieva, R.; Rosenheck, R.; Zelterman, D., Dirichlet component regression and its applications to psychiatric data, Comput. Stat. Data Anal., 52, 12, 5344-5355 (2008) · Zbl 1452.62066
[30] Hijazi, R. H., Residuals and diagnostics in Dirichlet regression, (ASA Proceedings of the General Methodology Section (2006)), 1190-1196
[31] de Silva, V.; Ghrist, R., Coverage in sensor networks via persistent homology, Algebraic Geom. Topol., 7, 1, 339-358 (2007) · Zbl 1134.55003
[32] Rotman, J. J., An Introduction to Algebraic Topology (1988), Springer-Verlag: Springer-Verlag New York, NY · Zbl 0661.55001
[33] McCord, M. C., Homotopy type comparison of a space with complexes associated with its open covers, Proc. Am. Math. Soc., 18, 4, 705-708 (1967) · Zbl 0171.22001
[34] Carlssony, G.; Zomorodian, A.; Collins, A.; Guibas, L., Persistence barcodes for shapes, (Symposium on Geometry Processing (2004), The Eurographics Association)
[35] Edelsbrunner, Z. A.H.; Letscher, D., Topological persistence and simplification, Discrete Comput. Geom., 28, 511-533 (2002) · Zbl 1011.68152
[36] Khasawneh, F. A.; Munch, E., Chatter detection in turning using persistent homology, Mech. Syst. Signal Process., 70-71, 527-541 (2016)
[37] Kaditis, A. G.; Alonso Alvarez, M. L.; Boudewyns, A.; Alexopoulos, E. I.; Ersu, R.; Joosten, K.; Larramona, H.; Miano, S.; Narang, I.; Trang, H.; Tsaoussoglou, M.; Vandenbussche, N.; Villa, M. P.; Van Waardenburg, D.; Weber, S.; Verhulst, S., Obstructive sleep disordered breathing in 2- to 18-year-old children: diagnosis and management, Eur. Respir. J., 47, 1, 69-94 (2016)
[38] Owens, J.; Spirito, A.; McGuinn, M., The children’s sleep habits questionnaire (cshq): psychometric properties of a survey instrument for school-aged children, Sleep, 23, 1043-1051 (2001)
[39] da Silva, V. C.; Leite, A. J.M., Quality of life in children with sleep-disordered breathing: evaluation by osa-18, Braz. J. Otorhinolaryngol., 72, 6, 747-756 (2006)
[40] Varni, J. W.; Seid, M.; Kurtin, P. S., Pedsql™ 4.0: reliability and validity of the pediatric quality of life inventory™ version 4.0 generic core scales in healthy and patient populations, Med. Care, 800-812 (2001)
[41] Chervin, R.; Hedger, K.; Dillon, J.; Pituch, K., Pediatric sleep questionnaire (psq): validity and reliability of scales for sleep-disordered breathing, snoring, sleepiness, and behavioral problems, Sleep Med., 1, 21-32 (2000)
[42] D. Peña, P. Galeano, et al. Multivariate analysis in vector time series, DES-Working Papers. Statistics and Econometrics. WS (ws012415).
[43] Caiado, J.; Crato, N.; Peña, D., A periodogram-based metric for time series classification, Comput. Stat. Data Anal., 50, 10, 2668-2684 (2006) · Zbl 1445.62222
[44] Welch, P., The use of fast Fourier transform for the estimation of power spectra: a method based on time averaging over short, modified periodograms, IEEE Trans. Audio Electroacoust., 15, 2, 70-73 (1967)
[45] Dave, R.; Sen, S., Robust fuzzy clustering of relational data, IEEE Trans. Fuzzy Syst., 10, 6, 713-727 (2002)
[46] Overall, D. S.J. E., Estimating sample sizes for repeated measurement designs, Control. Clin. Trials, 15, 2, 100-123 (1994)
[47] Güzel, İsmail; Kaygun, A., Hierarchical clustering and zeroth persistent homology (2020)
[48] Aliverti, A., Wearable technology: role in respiratory health and disease, Breathe, 12 (2017), e27-236
[49] D. Morozov, Persistence algorithm takes cubic time in worst case, BioGeometry News, Dept. Comput. Sci., Duke Univ.
[50] Silva, V.d.; Carlsson, G., Topological estimation using witness complexes, (Gross, M.; Pfister, H.; Alexa, M.; Rusinkiewicz, S., SPBG’04 Symposium on Point - Based Graphics 2004 (2004), The Eurographics Association)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.