×

Learning the structure of the mTOR protein signaling pathway from protein phosphorylation data. (English) Zbl 07880704

Summary: Statistical learning of the structures of cellular networks, such as protein signaling pathways, is a topical research field in computational systems biology. To get the most information out of experimental data, it is often required to develop a tailored statistical approach rather than applying one of the off-the-shelf network reconstruction methods. The focus of this paper is on learning the structure of the mTOR protein signaling pathway from immunoblotting protein phosphorylation data. Under two experimental conditions eleven phosphorylation sites of eight key proteins of the mTOR pathway were measured at ten non-equidistant time points. For the statistical analysis we propose a new advanced hierarchically coupled non-homogeneous dynamic Bayesian network (NH-DBN) model, and we consider various data imputation methods for dealing with non-equidistant temporal observations. Because of the absence of a true gold standard network, we propose to use predictive probabilities in combination with a leave-one-out cross validation strategy to objectively cross-compare the accuracies of different NH-DBN models and data imputation methods. Finally, we employ the best combination of model and data imputation method for predicting the structure of the mTOR protein signaling pathway.

MSC:

62-XX Statistics
62J99 Linear inference, regression
62P10 Applications of statistics to biology and medical sciences; meta analysis

Software:

imputeTS; GauPro; BayesDA

References:

[1] Aslam, M., Azam, M., and Jun, C.-H., A new lot inspection procedure based on exponentially weighted moving average, Int. J. Syst. Sci.46 (2015), pp. 1392-1400. · Zbl 1312.93070
[2] Dalle Pezze, P., Ruf, S., Sonntag, A.G., Langelaar-Makkinje, M., Hall, P., Heberle, A.M., Razquin Navas, P., van Eunen, K., Tölle, R.C., Schwarz, J.J., Wiese, H., Warscheid, B., Deitersen, J., Stork, B., Fäßler, E., Schäuble, S., Hahn, U., Horvatovich, P., Shanley, D.P., and Thedieck, K., A systems study reveals concurrent activation of AMPK and mTOR by amino acids, Nat. Commun.7 (2016), Article ID 13254.
[3] Erickson, C., GauPro: Gaussian Process Fitting, 2021. R Package Version 0.2.4.
[4] Gaaloul, N. and Daouas, N., An extended approach of a Kalman smoothing technique applied to a transient nonlinear two-dimensional inverse heat conduction problem, Int. J. Therm. Sci.134 (2018), pp. 224-241.
[5] Gelman, A., Carlin, J., Stern, H., and Rubin, D., Bayesian Data Analysis, 2nd ed., Chapman and Hall/CRC, London, 2004. · Zbl 1039.62018
[6] Graham, M.A., Mukherjee, A., and Chakraborti, S., Distribution-free exponentially weighted moving average control charts for monitoring unknown location, Comput. Stat. Data Anal.56 (2012), pp. 2539-2561. · Zbl 1252.62117
[7] Grzegorczyk, M. and Husmeier, D., Regularization of non-homogeneous dynamic Bayesian networks with global information-coupling based on hierarchical Bayesian models, Mach. Learn.91 (2013), pp. 105-154. · Zbl 1273.68378
[8] Grzegorczyk, M. and Husmeier, D., Modelling non-homogeneous dynamic Bayesian networks with piecewise linear regression models, in Handbook of Statistical Genetics, 4th ed., D. Balding, I. Moltke, and J. Marioni, eds., Wiley, New Jersey, 2019, chapter 32, pp. 899-931.
[9] Moritz, S. and Bartz-Beielstein, T., Imputets: Time series missing value imputation in R, R J.9 (2017), p. 207.
[10] Salam, A. and Grzegorczyk, M., Model averaging for sparse seemingly unrelated regression using Bayesian networks among the errors, Comput. Stat. (2022). Available at https://link.springer.com/article/.
[11] Sanguinetti, G. and Huynh-Thu, V., Gene Regulatory Networks, , Humana Press, New York, 2019. · Zbl 1417.92005
[12] Shafiee Kamalabad, M. and Grzegorczyk, M., Non-homogeneous dynamic Bayesian networks with edge-wise sequentially coupled parameters, Bioinform.36 (2020), pp. 1198-1207.
[13] Shafiee Kamalabad, M. and Grzegorczyk, M., A new Bayesian piecewise linear regression model for dynamic network reconstruction, BMC Bioinform.22 (2021), pp. 1-24.
[14] Shafiee Kamalabad, M., Heberle, A.M., Thedieck, K., and Grzegorczyk, M., Partially non-homogeneous dynamic Bayesian networks based on Bayesian regression models with partitioned design matrices, Bioinform.35 (2019), pp. 2108-2117.
[15] Staal, O.M., Sælid, S., Fougner, A., and Stavdahl, Ø., Kalman smoothing for objective and automatic preprocessing of glucose data, IEEE J. Biomed. Health Inform.23 (2018), pp. 218-226.
[16] Zou, Z., Tao, T., Li, H., and Zhu, X., mTOR signaling pathway and mTOR inhibitors in cancer: Progress and challenges, Cell. Biosci.10 (2020), Article ID 31.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.