-
An Interpretable Machine Learning Model with Deep Learning-based Imaging Biomarkers for Diagnosis of Alzheimer's Disease
Authors:
Wenjie Kang,
Bo Li,
Janne M. Papma,
Lize C. Jiskoot,
Peter Paul De Deyn,
Geert Jan Biessels,
Jurgen A. H. R. Claassen,
Huub A. M. Middelkoop,
Wiesje M. van der Flier,
Inez H. G. B. Ramakers,
Stefan Klein,
Esther E. Bron
Abstract:
Machine learning methods have shown large potential for the automatic early diagnosis of Alzheimer's Disease (AD). However, some machine learning methods based on imaging data have poor interpretability because it is usually unclear how they make their decisions. Explainable Boosting Machines (EBMs) are interpretable machine learning models based on the statistical framework of generalized additiv…
▽ More
Machine learning methods have shown large potential for the automatic early diagnosis of Alzheimer's Disease (AD). However, some machine learning methods based on imaging data have poor interpretability because it is usually unclear how they make their decisions. Explainable Boosting Machines (EBMs) are interpretable machine learning models based on the statistical framework of generalized additive modeling, but have so far only been used for tabular data. Therefore, we propose a framework that combines the strength of EBM with high-dimensional imaging data using deep learning-based feature extraction. The proposed framework is interpretable because it provides the importance of each feature. We validated the proposed framework on the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset, achieving accuracy of 0.883 and area-under-the-curve (AUC) of 0.970 on AD and control classification. Furthermore, we validated the proposed framework on an external testing set, achieving accuracy of 0.778 and AUC of 0.887 on AD and subjective cognitive decline (SCD) classification. The proposed framework significantly outperformed an EBM model using volume biomarkers instead of deep learning-based features, as well as an end-to-end convolutional neural network (CNN) with optimized architecture.
△ Less
Submitted 15 August, 2023;
originally announced August 2023.
-
Prior-knowledge-informed deep learning for lacune detection and quantification using multi-site brain MRI
Authors:
Bo Li,
Jeroen de Bresser,
Wiro Niessen,
Matthias van Osch,
Wiesje M. van der Flier,
Geert Jan Biessels,
Meike W. Vernooij,
Esther Bron
Abstract:
Lacunes of presumed vascular origin, also referred to as lacunar infarcts, are important to assess cerebral small vessel disease and cognitive diseases such as dementia. However, visual rating of lacunes from imaging data is challenging, time-consuming, and rater-dependent, owing to their small size, sparsity, and mimics. Whereas recent developments in automatic algorithms have shown to make the d…
▽ More
Lacunes of presumed vascular origin, also referred to as lacunar infarcts, are important to assess cerebral small vessel disease and cognitive diseases such as dementia. However, visual rating of lacunes from imaging data is challenging, time-consuming, and rater-dependent, owing to their small size, sparsity, and mimics. Whereas recent developments in automatic algorithms have shown to make the detection of lacunes faster while preserving sensitivity, they also showed a large number of false positives, which makes them impractical for use in clinical practice or large-scale studies. Here, we develop a novel framework that, in addition to lacune detection, outputs a categorical burden score. This score could provide a more practical estimate of lacune presence that simplifies and effectively accelerates the imaging assessment of lacunes. We hypothesize that the combination of detection and the categorical score makes the procedure less sensitive to noisy labels.
△ Less
Submitted 18 June, 2023;
originally announced June 2023.
-
Cross-Cohort Generalizability of Deep and Conventional Machine Learning for MRI-based Diagnosis and Prediction of Alzheimer's Disease
Authors:
Esther E. Bron,
Stefan Klein,
Janne M. Papma,
Lize C. Jiskoot,
Vikram Venkatraghavan,
Jara Linders,
Pauline Aalten,
Peter Paul De Deyn,
Geert Jan Biessels,
Jurgen A. H. R. Claassen,
Huub A. M. Middelkoop,
Marion Smits,
Wiro J. Niessen,
John C. van Swieten,
Wiesje M. van der Flier,
Inez H. G. B. Ramakers,
Aad van der Lugt
Abstract:
This work validates the generalizability of MRI-based classification of Alzheimer's disease (AD) patients and controls (CN) to an external data set and to the task of prediction of conversion to AD in individuals with mild cognitive impairment (MCI). We used a conventional support vector machine (SVM) and a deep convolutional neural network (CNN) approach based on structural MRI scans that underwe…
▽ More
This work validates the generalizability of MRI-based classification of Alzheimer's disease (AD) patients and controls (CN) to an external data set and to the task of prediction of conversion to AD in individuals with mild cognitive impairment (MCI). We used a conventional support vector machine (SVM) and a deep convolutional neural network (CNN) approach based on structural MRI scans that underwent either minimal pre-processing or more extensive pre-processing into modulated gray matter (GM) maps. Classifiers were optimized and evaluated using cross-validation in the ADNI (334 AD, 520 CN). Trained classifiers were subsequently applied to predict conversion to AD in ADNI MCI patients (231 converters, 628 non-converters) and in the independent Health-RI Parelsnoer data set. From this multi-center study representing a tertiary memory clinic population, we included 199 AD patients, 139 participants with subjective cognitive decline, 48 MCI patients converting to dementia, and 91 MCI patients who did not convert to dementia. AD-CN classification based on modulated GM maps resulted in a similar AUC for SVM (0.940) and CNN (0.933). Application to conversion prediction in MCI yielded significantly higher performance for SVM (0.756) than for CNN (0.742). In external validation, performance was slightly decreased. For AD-CN, it again gave similar AUCs for SVM (0.896) and CNN (0.876). For prediction in MCI, performances decreased for both SVM (0.665) and CNN (0.702). Both with SVM and CNN, classification based on modulated GM maps significantly outperformed classification based on minimally processed images. Deep and conventional classifiers performed equally well for AD classification and their performance decreased only slightly when applied to the external cohort. We expect that this work on external validation contributes towards translation of machine learning to clinical practice.
△ Less
Submitted 26 May, 2021; v1 submitted 16 December, 2020;
originally announced December 2020.
-
The influence of diversity on the measurement of functional impairment: An international validation of the Amsterdam IADL Questionnaire in 8 countries
Authors:
Mark A. Dubbelman,
Merike Verrijp,
David Facal,
Gonzalo Sánchez-Benavides,
Laura J. E. Brown,
Wiesje M. van der Flier,
Hanna Jokinen,
Athene Lee,
Iracema Leroi,
Cristina Lojo-Seoane,
Vuk Milosevic,
José Luís Molinuevo,
Arturo X. Pereiro Rozas,
Craig Ritchie,
Stephen Salloway,
Gemma Stringer,
Stelios Zygouris,
Bruno Dubois,
Stéphane Epelbaum,
Philip Scheltens,
Sietske A. M. Sikkes
Abstract:
INTRODUCTION: To understand the potential influence of diversity on the measurement of functional impairment in dementia, we aimed to investigate possible bias caused by age, gender, education, and cultural differences. METHODS: 3,571 individuals (67.1 {\pm} 9.5 years old, 44.7% female) from the Netherlands, Spain, France, United States, United Kingdom, Greece, Serbia and Finland were included. Fu…
▽ More
INTRODUCTION: To understand the potential influence of diversity on the measurement of functional impairment in dementia, we aimed to investigate possible bias caused by age, gender, education, and cultural differences. METHODS: 3,571 individuals (67.1 {\pm} 9.5 years old, 44.7% female) from the Netherlands, Spain, France, United States, United Kingdom, Greece, Serbia and Finland were included. Functional impairment was measured using the Amsterdam IADL Questionnaire. Item bias was assessed using differential item functioning (DIF) analysis. RESULTS: There were some differences in activity endorsement. A few items showed statistically significant DIF. However, there was no evidence of meaningful item bias: effect sizes were low (ΔR2 range 0-0.03). Impact on total scores was minimal. DISCUSSION: The results imply a limited bias for age, gender, education and culture in the measurement of functional impairment. This study provides an important step in recognizing the potential influence of diversity on primary outcomes in dementia research.
△ Less
Submitted 29 January, 2020; v1 submitted 4 September, 2019;
originally announced September 2019.
-
Blood-based metabolic signatures in Alzheimer's disease
Authors:
Francisca A. de Leeuw,
Carel F. W. Peeters,
Maartje I. Kester,
Amy C. Harms,
Eduard A. Struys,
Thomas Hankemeier,
Herman W. T. van Vlijmen,
Sven J. van der Lee,
Cornelia M. van Duijn,
Philip Scheltens,
Ayşe Demirkan,
Mark A. van de Wiel,
Wiesje M. van der Flier,
Charlotte E. Teunissen
Abstract:
Introduction: Identification of blood-based metabolic changes might provide early and easy-to-obtain biomarkers.
Methods: We included 127 AD patients and 121 controls with CSF-biomarker-confirmed diagnosis (cut-off tau/A$β_{42}$: 0.52). Mass spectrometry platforms determined the concentrations of 53 amine, 22 organic acid, 120 lipid, and 40 oxidative stress compounds. Multiple signatures were as…
▽ More
Introduction: Identification of blood-based metabolic changes might provide early and easy-to-obtain biomarkers.
Methods: We included 127 AD patients and 121 controls with CSF-biomarker-confirmed diagnosis (cut-off tau/A$β_{42}$: 0.52). Mass spectrometry platforms determined the concentrations of 53 amine, 22 organic acid, 120 lipid, and 40 oxidative stress compounds. Multiple signatures were assessed: differential expression (nested linear models), classification (logistic regression), and regulatory (network extraction).
Results: Twenty-six metabolites were differentially expressed. Metabolites improved the classification performance of clinical variables from 74% to 79%. Network models identified 5 hubs of metabolic dysregulation: Tyrosine, glycylglycine, glutamine, lysophosphatic acid C18:2 and platelet activating factor C16:0. The metabolite network for APOE $ε$4 negative AD patients was less cohesive compared to the network for APOE $ε$4 positive AD patients.
Discussion: Multiple signatures point to various promising peripheral markers for further validation. The network differences in AD patients according to APOE genotype may reflect different pathways to AD.
△ Less
Submitted 21 September, 2017;
originally announced September 2017.