For the recruitment and monitoring of subjects for therapy studies, it is important to predict whether mild cognitive impaired (MCI) subjects will prospectively develop Alzheimer's disease (AD). Machine learning (ML) is suitable to improve early AD prediction. The etiology of AD is heterogeneous, which leads to high variability in disease patterns. Further variability originates from multicentric study designs, varying acquisition protocols, and errors in the preprocessing of magnetic resonance imaging (MRI) scans. The high variability makes the differentiation between signal and noise difficult and may lead to overfitting. This article examines whether an automatic and fair data valuation method based on Shapley values can identify the most informative subjects to improve ML classification.
Pubmed ID: 34526114 RIS Download
Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.
Open source software suite for processing and analyzing human brain MRI images. Used for reconstruction of brain cortical surface from structural MRI data, and overlay of functional MRI data onto reconstructed surface. Contains automatic structural imaging stream for processing cross sectional and longitudinal data. Provides anatomical analysis tools, including: representation of cortical surface between white and gray matter, representation of the pial surface, segmentation of white matter from rest of brain, skull stripping, B1 bias field correction, nonlinear registration of cortical surface of individual with stereotaxic atlas, labeling of regions of cortical surface, statistical analysis of group morphometry differences, and labeling of subcortical brain structures.Operating System: Linux, macOS.
View all literature mentionsDatabase of the results of the ADNI study. ADNI is an initiative to develop biomarker-based methods to detect and track the progression of Alzheimer's disease (AD) that provides access to qualified scientists to their database of imaging, clinical, genomic, and biomarker data.
View all literature mentionsSoftware tool as unified framework for interpreting predictions of machine learning models. Used to explain output of any machine learning model. Connects optimal credit allocation with local explanations using classic Shapley values from game theory and their related extensions.
View all literature mentions