Quantifying brain atrophy in Frontotemporal Dementia: a head-to-head comparison of neuroimaging techniques
Abstract
Frontotemporal Dementia (FTD) is a neurodegenerative disorder characterized by extensive atrophy in the frontal and temporal lobes of the brain as well as high cerebrovascular burden. While anatomical Magnetic Resonance Imaging (MRI) is well established for quantifying brain atrophy in FTD, the variability in (pre-)processing methods limits the generalizability and comparability of findings. This study systematically compared the robustness and sensitivity of multiple widely used neuroimaging metrics, namely Deformation-Based Morphometry (DBM), Voxel-Based Morphometry (VBM), Cortical Thickness (CT), and segmentation-based grey matter Volumes, in detecting atrophy across FTD subtypes. We processed 732 T1-weighted MRI scans from 156 participants with FTD and 139 healthy controls from the Frontotemporal Lobar Degeneration Neuroimaging Initiative using our in-house pipeline PELICAN (Dadar et al., 2025) for volumetric measures and FreeSurfer version 7 (Fischl, 2012) for CT and grey matter segmentations. Visual quality control using consistent quality control images at each step of the pipelines revealed significantly higher failure rates for CT (38.52%) and FreeSurfer segmentations (23.63%) relative to PELICAN’s volumetric measures (2.04% DBM, 3.05% VBM). Failure rates differed between FTD subtypes and were related to pathological burden. Particularly for FreeSurfer, errors occurred predominantly in regions with high prevalence of atrophy and White Matter Hyperintensities. In PELICAN, the addition of a FTD-specific template as an intermediate step during nonlinear registration decreased the failure rates in this step in the FTD population. We then applied linear regression models to assess each metric’s sensitivity in detecting cross-sectional differences between FTD groups controls as well as linear mixed-effects models to determine which method is most sensitive to longitudinal anatomical changes. While CT yielded effect sizes comparable to VBM and DBM when analyzing the same subset of successfully processed scans, VBM and DBM demonstrated enhanced power to detect effects due to lower failure rates and higher participant retention in the full sample. Overall, we demonstrate that image processing methodology and pipeline selection profoundly influences effect sizes and statistical power to detect meaningful between-group differences or longitudinal changes. Volumetric measures (DBM and VBM) yielded sufficiently robust pipeline outcomes to maintain adequate statistical power for capturing atrophy patterns after quality control procedures.
Related articles
Related articles are currently not available for this article.