Skip to main content

Showing 1–27 of 27 results for author: Coull, B A

  1. arXiv:2409.18005  [pdf, other

    stat.ME

    Collapsible Kernel Machine Regression for Exposomic Analyses

    Authors: Glen McGee, Brent A. Coull, Ander Wilson

    Abstract: An important goal of environmental epidemiology is to quantify the complex health risks posed by a wide array of environmental exposures. In analyses focusing on a smaller number of exposures within a mixture, flexible models like Bayesian kernel machine regression (BKMR) are appealing because they allow for non-linear and non-additive associations among mixture components. However, this flexibili… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

  2. arXiv:2312.13331  [pdf, other

    stat.ME stat.AP

    A Bayesian Spatial Berkson error approach to estimate small area opioid mortality rates accounting for population-at-risk uncertainty

    Authors: Emily N Peterson, Rachel C. Nethery, Jarvis T. Chen, Loni P. Tabb, Brent A. Coull, Frederic B. Piel, Lance A Waller

    Abstract: Monitoring small-area geographical population trends in opioid mortality has large scale implications to informing preventative resource allocation. A common approach to obtain small area estimates of opioid mortality is to use a standard disease mapping approach in which population-at-risk estimates are treated as fixed and known. Assuming fixed populations ignores the uncertainty surrounding sma… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  3. arXiv:2209.04316  [pdf, other

    stat.AP

    Impacts of Census Differential Privacy for Small-Area Disease Mapping to Monitor Health Inequities

    Authors: Yanran Li, Brent A. Coull, Nancy Krieger, Emily Peterson, Lance A. Waller, Jarvis T. Chen, Rachel C. Nethery

    Abstract: The US Census Bureau will implement a new privacy-preserving disclosure avoidance system (DAS), which includes application of differential privacy, on the public-release 2020 census data. There are concerns that the DAS may bias small-area and demographically-stratified population counts, which play a critical role in public health research and policy, serving as denominators in estimation of dise… ▽ More

    Submitted 29 March, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

  4. arXiv:2204.07293  [pdf, other

    stat.ML cs.LG

    Towards a Unified Framework for Uncertainty-aware Nonlinear Variable Selection with Theoretical Guarantees

    Authors: Wenying Deng, Beau Coker, Rajarshi Mukherjee, Jeremiah Zhe Liu, Brent A. Coull

    Abstract: We develop a simple and unified framework for nonlinear variable selection that incorporates uncertainty in the prediction function and is compatible with a wide range of machine learning models (e.g., tree ensembles, kernel methods, neural networks, etc). In particular, for a learned nonlinear model $f(\mathbf{x})$, we consider quantifying the importance of an input variable $\mathbf{x}^j$ using… ▽ More

    Submitted 27 May, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: 50 pages, 16 figures, 11 tables

  5. arXiv:2204.00040  [pdf, other

    stat.ME stat.AP

    Integrating Biological Knowledge in Kernel-Based Analyses of Environmental Mixtures and Health

    Authors: Glen McGee, Ander Wilson, Brent A Coull, Thomas F Webster

    Abstract: A key goal of environmental health research is to assess the risk posed by mixtures of pollutants. As epidemiologic studies of mixtures can be expensive to conduct, it behooves researchers to incorporate prior knowledge about mixtures into their analyses. This work extends the Bayesian multiple index model (BMIM), which assumes the exposure-response function is a non-parametric function of a set o… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

  6. arXiv:2202.04198  [pdf, other

    stat.AP

    Multivariate cluster point process to quantify and explore multi-entity configurations: Application to biofilm image data

    Authors: Suman Majumder, Brent A. Coull, Jessica L. Mark Welch, Patrick J. La Riviere, Floyd E. Dewhirst, Jacqueline R. Starr, Kyu Ha Lee

    Abstract: Clusters of similar or dissimilar objects are encountered in many fields. Frequently used approaches treat the central object of each cluster as latent. Yet, often objects of one or more types cluster around objects of another type. Such arrangements are common in biomedical images of cells, in which nearby cell types likely interact. Quantifying spatial relationships may elucidate biological mech… ▽ More

    Submitted 8 October, 2024; v1 submitted 8 February, 2022; originally announced February 2022.

    MSC Class: 62

  7. arXiv:2112.09813  [pdf, other

    stat.ME

    A Bayesian hierarchical small-area population model accounting for data source specific methodologies from American Community Survey, Population Estimates Program, and Decennial Census data

    Authors: Emily N Peterson, Rachel C Nethery, Tullia Padellini, Jarvis T Chen, Brent A Coull, Frederic B Piel, Jon Wakefield, Marta Blangiardo, Lance A Waller

    Abstract: Small area estimates of population are necessary for many epidemiological studies, yet their quality and accuracy are often not assessed. In the United States, small area estimates of population counts are published by the United States Census Bureau (USCB) in the form of the Decennial census counts, Intercensal population projections (PEP), and American Community Survey (ACS) estimates. Although… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

  8. Heterogeneous Distributed Lag Models to Estimate Personalized Effects of Maternal Exposures to Air Pollution

    Authors: Daniel Mork, Marianthi-Anna Kioumourtzoglou, Marc Weisskopf, Brent A Coull, Ander Wilson

    Abstract: Children's health studies support an association between maternal environmental exposures and children's birth outcomes. A common goal is to identify critical windows of susceptibility--periods during gestation with increased association between maternal exposures and a future outcome. The timing of the critical windows and magnitude of the associations are likely heterogeneous across different le… ▽ More

    Submitted 30 June, 2023; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: 37 pages, 6 figures, 3 tables

    Journal ref: Journal of the American Statistical Association 2023

  9. arXiv:2101.05352  [pdf, other

    stat.ME

    Bayesian Multiple Index Models for Environmental Mixtures

    Authors: Glen McGee, Ander Wilson, Thomas F. Webster, Brent A. Coull

    Abstract: An important goal of environmental health research is to assess the risk posed by mixtures of environmental exposures. Two popular classes of models for mixtures analyses are response-surface methods and exposure-index methods. Response-surface methods estimate high-dimensional surfaces and are thus highly flexible but difficult to interpret. In contrast, exposure-index methods decompose coefficie… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  10. arXiv:2006.07305  [pdf, other

    stat.AP

    Reflection on modern methods: Good practices for applied statistical learning in epidemiology

    Authors: Yanelli Nunez, Elizabeth A. Gibson, Eva M. Tanner, Chris Gennings, Brent A. Coull, Jeff A. Goldsmith, Marianthi-Anna Kioumourtzoglou

    Abstract: Statistical learning (SL) includes methods that extract knowledge from complex data. SL methods beyond generalized linear models are being increasingly implemented in public health research and epidemiology because they can perform better in instances with complex or high-dimensional data---settings when traditional statistical methods fail. These novel methods, however, often include random sampl… ▽ More

    Submitted 2 October, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: 19 pages, 5 figures, 1 table. For associated code, visit https://github.com/yanellinunez/Commentary-to-mixture-methods-paper

  11. arXiv:1912.07359  [pdf, other

    stat.AP stat.ME

    Function-on-Function Regression for the Identification of Epigenetic Regions Exhibiting Windows of Susceptibility to Environmental Exposures

    Authors: Michele Zemplenyi, Mark J. Meyer, Andres Cardenas, Marie-France Hivert, Sheryl L. Rifas-Shiman, Heike Gibson, Itai Kloog, Joel Schwartz, Emily Oken, Dawn L. DeMeo, Diane R. Gold, Brent A. Coull

    Abstract: The ability to identify time periods when individuals are most susceptible to exposures, as well as the biological mechanisms through which these exposures act, is of great public health interest. Growing evidence supports an association between prenatal exposure to air pollution and epigenetic marks, such as DNA methylation, but the timing and gene-specific effects of these epigenetic changes are… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

    Comments: 20 pages, 10 figures

  12. arXiv:1910.07438  [pdf, other

    stat.ME stat.AP

    On the Interplay Between Exposure Misclassification and Informative Cluster Size

    Authors: Glen McGee, Marianthi-Anna Kioumourtzoglou, Marc G. Weisskopf, Sebastien Haneuse, Brent A. Coull

    Abstract: In this paper we study the impact of exposure misclassification when cluster size is potentially informative (i.e., related to outcomes) and when misclassification is differential by cluster size. First, we show that misclassification in an exposure related to cluster size can induce informativeness when cluster size would otherwise be non-informative. Second, we show that misclassification that i… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

  13. Bayesian Wavelet-packet Historical Functional Linear Models

    Authors: Mark J. Meyer, Elizabeth J. Malloy, Brent A. Coull

    Abstract: Historical Functional Linear Models (HFLM) quantify associations between a functional predictor and functional outcome where the predictor is an exposure variable that occurs before, or at least concurrently with, the outcome. Current work on the HFLM is largely limited to frequentist estimation techniques that employ spline-based basis representations. In this work, we propose a novel use of the… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

    Comments: Submitted for publication in JCGS

  14. arXiv:1904.12417  [pdf, other

    stat.AP stat.ME

    Kernel Machine and Distributed Lag Models for Assessing Windows of Susceptibility to Environmental Mixtures in Children's Health Studies

    Authors: Ander Wilson, Hsiao-Hsien Leon Hsu, Yueh-Hsiu Mathilda Chiu, Robert O. Wright, Rosalind J. Wright, Brent A. Coull

    Abstract: Exposures to environmental chemicals during gestation can alter health status later in life. Most studies of maternal exposure to chemicals during pregnancy have focused on a single chemical exposure observed at high temporal resolution. Recent research has turned to focus on exposure to mixtures of multiple chemicals, generally observed at a single time point. We consider statistical methods for… ▽ More

    Submitted 21 September, 2021; v1 submitted 28 April, 2019; originally announced April 2019.

    Journal ref: Ann. Appl. Stat. 16(2): 1090-1110 (June 2022)

  15. arXiv:1904.10918  [pdf, other

    stat.AP

    A Cross-validated Ensemble Approach to Robust Hypothesis Testing of Continuous Nonlinear Interactions: Application to Nutrition-Environment Studies

    Authors: Jeremiah Zhe Liu, Jane Lee, Pi-i Debby Lin, Linda Valeri, David C. Christiani, David C. Bellinger, Robert O. Wright, Maitreyi M. Mazumdar, Brent A. Coull

    Abstract: Gene-environment and nutrition-environment studies often involve testing of high-dimensional interactions between two sets of variables, each having potentially complex nonlinear main effects on an outcome. Construction of a valid and powerful hypothesis test for such an interaction is challenging, due to the difficulty in constructing an efficient and unbiased estimator for the complex, nonlinear… ▽ More

    Submitted 24 April, 2019; originally announced April 2019.

  16. arXiv:1904.00521  [pdf, other

    stat.ME

    Adaptive Ensemble Learning of Spatiotemporal Processes with Calibrated Predictive Uncertainty: A Bayesian Nonparametric Approach

    Authors: Jeremiah Zhe Liu, John Paisley, Marianthi-Anna Kioumourtzoglou, Brent A. Coull

    Abstract: Ensemble learning is a mainstay in modern data science practice. Conventional ensemble algorithms assign to base models a set of deterministic, constant model weights that (1) do not fully account for individual models' varying accuracy across data subgroups, nor (2) provide uncertainty estimates for the ensemble prediction. These shortcomings can yield predictions that are precise but biased, whi… ▽ More

    Submitted 31 March, 2019; originally announced April 2019.

  17. arXiv:1902.10613  [pdf, other

    stat.ME stat.AP

    Bayesian data fusion for unmeasured confounding

    Authors: Leah Comment, Brent A. Coull, Corwin Zigler, Linda Valeri

    Abstract: Bayesian causal inference offers a principled approach to policy evaluation of proposed interventions on mediators or time-varying exposures. We outline a general approach to the estimation of causal quantities for settings with time-varying confounding, such as exposure-induced mediator-outcome confounders. We further extend this approach to propose two Bayesian data fusion (BDF) methods for unme… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

  18. Ordinal Probit Functional Outcome Regression with Application to Computer-Use Behavior in Rhesus Monkeys

    Authors: Mark J. Meyer, Jeffrey S. Morris, Regina Paxton Gazes, Brent A. Coull

    Abstract: Research in functional regression has made great strides in expanding to non-Gaussian functional outcomes, but exploration of ordinal functional outcomes remains limited. Motivated by a study of computer-use behavior in rhesus macaques (Macaca mulatta), we introduce the Ordinal Probit Functional Outcome Regression model (OPFOR). OPFOR models can be fit using one of several basis functions includin… ▽ More

    Submitted 18 March, 2021; v1 submitted 23 January, 2019; originally announced January 2019.

  19. arXiv:1812.03350  [pdf, other

    cs.LG stat.ML

    Adaptive and Calibrated Ensemble Learning with Dependent Tail-free Process

    Authors: Jeremiah Zhe Liu, John Paisley, Marianthi-Anna Kioumourtzoglou, Brent A. Coull

    Abstract: Ensemble learning is a mainstay in modern data science practice. Conventional ensemble algorithms assigns to base models a set of deterministic, constant model weights that (1) do not fully account for variations in base model accuracy across subgroups, nor (2) provide uncertainty estimates for the ensemble prediction, which could result in mis-calibrated (i.e. precise but biased) predictions that… ▽ More

    Submitted 19 December, 2018; v1 submitted 8 December, 2018; originally announced December 2018.

    Comments: Work-in-progress manuscript appeared at Bayesian Nonparametrics Workshop, Neural Information Processing Systems 2018

  20. arXiv:1812.02829  [pdf, other

    stat.AP stat.ME

    The Role of Body Mass Index at Diagnosis on Black-White Disparities in Colorectal Cancer Survival: A Density Regression Mediation Approach

    Authors: Katrina L. Devick, Linda Valeri, Jarvis Chen, Alejandro Jara, Marie-Abèle Bind, Brent A. Coull

    Abstract: The study of racial/ethnic inequalities in health is important to reduce the uneven burden of disease. In the case of colorectal cancer (CRC), disparities in survival among non-Hispanic Whites and Blacks are well documented, and mechanisms leading to these disparities need to be studied formally. It has also been established that body mass index (BMI) is a risk factor for developing CRC, and recen… ▽ More

    Submitted 16 November, 2018; originally announced December 2018.

    Comments: 15 pages, 2 tables, 4 figures

  21. arXiv:1811.11025  [pdf, other

    stat.CO stat.AP stat.ME

    CVEK: Robust Estimation and Testing for Nonlinear Effects using Kernel Machine Ensemble

    Authors: Wenying Deng, Jeremiah Zhe Liu, Erin Lake, Brent A. Coull

    Abstract: The R package CVEK introduces a suite of flexible machine learning models and robust hypothesis tests for learning the joint nonlinear effects of multiple covariates in limited samples. It implements the Cross-validated Ensemble of Kernels (CVEK)(Liu and Coull 2017), an ensemble-based kernel machine learning method that adaptively learns the joint nonlinear effect of multiple covariates from data,… ▽ More

    Submitted 18 December, 2020; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: 5 figures. arXiv admin note: text overlap with arXiv:1710.01406

  22. arXiv:1811.10453  [pdf, other

    stat.ME

    Bayesian kernel machine regression-causal mediation analysis

    Authors: Katrina L. Devick, Jennifer F. Bobb, Maitreyi Mazumdar, Birgit Claus Henn, David C. Bellinger, David C. Christiani, Robert O. Wright, Paige L. Williams, Brent A. Coull, Linda Valeri

    Abstract: Greater understanding of the pathways through which an environmental mixture operates is important to design effective interventions. We present new methodology to estimate natural direct and indirect effects and controlled direct effects of a complex mixture exposure on an outcome through a mediator variable. We implement Bayesian Kernel Machine Regression (BKMR) to allow for all possible interac… ▽ More

    Submitted 21 December, 2021; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: 22 pages, 12 figures, and 1 tables

  23. arXiv:1811.02609  [pdf, other

    stat.CO

    A Variational Inference Algorithm for BKMR in the Cross-Sectional Setting

    Authors: Raphael Small, Brent A. Coull

    Abstract: The identification of pollutant effects is an important task in environmental health. Bayesian kernel machine regression (BKMR) is a standard tool for inference of individual-level pollutant health-effects, and we present a mean field Variational Inference (VI) algorithm for quick inference when only a single response per individual is recorded. Using simulation studies in the case of informative… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Comments: 23 pages, 5 figures

  24. arXiv:1711.11239  [pdf, other

    stat.ME

    Estimating the health effects of environmental mixtures using Bayesian semiparametric regression and sparsity inducing priors

    Authors: Joseph Antonelli, Maitreyi Mazumdar, David Bellinger, David C. Christiani, Robert Wright, Brent A. Coull

    Abstract: Humans are routinely exposed to mixtures of chemical and other environmental factors, making the quantification of health effects associated with environmental mixtures a critical goal for establishing environmental policy sufficiently protective of human health. The quantification of the effects of exposure to an environmental mixture poses several statistical challenges. It is often the case tha… ▽ More

    Submitted 29 October, 2019; v1 submitted 30 November, 2017; originally announced November 2017.

  25. arXiv:1711.00157  [pdf, ps, other

    stat.AP

    Bayesian Variable Selection for Multivariate Zero-Inflated Models: Application to Microbiome Count Data

    Authors: Kyu Ha Lee, Brent A. Coull, Anna-Barbara Moscicki, Bruce J. Paster, Jacqueline R. Starr

    Abstract: Microorganisms play critical roles in human health and disease. It is well known that microbes live in diverse communities in which they interact synergistically or antagonistically. Thus for estimating microbial associations with clinical covariates, multivariate statistical models are preferred. Multivariate models allow one to estimate and exploit complex interdependencies among multiple taxa,… ▽ More

    Submitted 20 May, 2018; v1 submitted 31 October, 2017; originally announced November 2017.

  26. Bayesian Distributed Lag Interaction Models to Identify Perinatal Windows of Vulnerability in Children's Health

    Authors: Ander Wilson, Yueh-Hsiu Mathilda Chiu, Hsiao-Hsien Leon Hsu, Robert O. Wright, Rosalind J. Wright, Brent A. Coull

    Abstract: Epidemiological research supports an association between maternal exposure to air pollution during pregnancy and adverse children's health outcomes. Advances in exposure assessment and statistics allow for estimation of both critical windows of vulnerability and exposure effect heterogeneity. Simultaneous estimation of windows of vulnerability and effect heterogeneity can be accomplished by fittin… ▽ More

    Submitted 17 December, 2016; originally announced December 2016.

    Journal ref: Biostatistics 2007

  27. General Design Bayesian Generalized Linear Mixed Models

    Authors: Y. Zhao, J. Staudenmayer, B. A. Coull, M. P. Wand

    Abstract: Linear mixed models are able to handle an extraordinary range of complications in regression-type analyses. Their most common use is to account for within-subject correlation in longitudinal data analysis. They are also the standard vehicle for smoothing spatial count data. However, when treated in full generality, mixed models can also handle spline-type smoothing and closely approximate krigin… ▽ More

    Submitted 20 June, 2006; originally announced June 2006.

    Comments: Published at http://dx.doi.org/10.1214/088342306000000015 in the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS157

    Journal ref: Statistical Science 2006, Vol. 21, No. 1, 35-51