Abstract
In this study we assessed the accuracy of dual X-ray absorptiometry (DXA) compared to magnetic resonance imaging (MRI) for evaluating body composition. Using data from 32,961 participants in the UK Biobank, including 1,928 re-scanned participants after about two and a half years, we examined cross-sectional and longitudinal agreements in DXA and MRI measurements within android and gynoid regions. Our results showed that DXA reliably captured fat measurements but overestimated lean mass compared to MRI, particularly in android regions for men (4.10 vs. 1.74 kg) and women (2.92 vs. 1.10 kg). Longitudinal MRI data revealed a 4-5% muscle and lean mass decrease, undetected by DXA, which showed lean mass increases in women at the follow-up visit. Although DXA is practical for population-level fat assessments, MRI remains the preferred method for detailed and precise longitudinal body composition analysis.
Introduction
Detailed body composition assessment, including the measurement of adipose tissue and muscle content and distribution, provides a valuable role for understanding trends in health and disease prevalence in large-scale population studies. They allow the development of targeted interventions to promote healthy ageing and mitigate the impact of conditions associated with an adverse body composition1,2. Additionally, these measurements are crucial for assessing the effects of lifestyle factors, such as diet and exercise, on disease mitigation and overall well-being. Longitudinal body composition monitoring is particularly significant in ageing populations since fat distribution and muscle mass changes are associated with sarcopenia and reduced functional capacity3.
Many methodologies have been developed to measure body composition, from indirect and accessible techniques such as body mass index (BMI), waist-to-hip ratio or skinfold anthropometry, to more direct but more expensive and time-consuming imaging-based methods including dual X-ray absorptiometry (DXA), computed tomography (CT) and magnetic resonance imaging (MRI) which have the advantage of assessing the distribution of adipose tissue and lean mass (or muscle).
DXA, initially developed for the assessment of bone mineral density (BMD), using low-energy x-ray acquisitions, was subsequently upgraded for the automatic derivation of overall adipose tissue and lean mass, as well as from relevant anatomical areas, including the android and gynoid regions4,5. Despite the planar nature of the acquired images, DXA provides estimates for the volume or mass of the body in three dimensions, offering a more comprehensive understanding of body composition compared with indirect methods. DXA is characterised by rapid processing, low levels of ionising radiation and relatively low cost, making it widely used for individuals and large-scale population studies. However, DXA has limitations, unlike CT and MRI, it cannot directly measure specific individual adipose tissue depots. DXA also provides an estimate of “lean tissue”, but it is important to note that so-called lean tissue includes other non-fat tissues besides muscle. Therefore, beyond appendicular measures DXA may not offer specificity for muscle tissue. Despite these limitations, DXA remains a widely used tool in body composition assessments, offering a balance of accuracy and practicality for various clinical and research applications6.
In contrast, MRI is generally regarded as the gold standard for assessing body composition. Its non-invasive and non-ionising three-dimensional approach allows volumetric assessment of organs, adipose tissue, and muscles, from the largest tissues down to the smallest organ substructures with millimetre precision. While MRI provides unparalleled detail and accuracy, its widespread use has been limited by cost, accessibility, and the time and expertise required for image acquisition and quantitative analysis. A trade-off between DXA and MRI depends on the specific research or clinical goals, balancing factors such as precision, cost-effectiveness, and practicality in the given context.
Although DXA is widely accepted for assessing body composition,7, and most studies demonstrate strong and significant correlations between MRI and DXA measurements8,9, discrepancies between the two methods have frequently been noted. DXA is reported to overestimate subcutaneous adipose tissue (SAT)10 and underestimate visceral adipose tissue compared to MRI, especially in individuals with low levels of visceral adipose tissue (VAT)10. Interestingly, Neeland et al noted that in addition to underestimating VAT at low VAT levels, DXA also modestly overestimated VAT at higher VAT levels. Similarly, although good correlations have been reported between muscle mass measurements by DXA vs MRI and CT10–12, the agreement between measurements is poor, with DXA overestimating lean or muscle tissue12,13,14. A potential reason for the lack of agreement may relate to the few direct comparisons using the DXA manufacturer’s predefined regions of interest against MRI equivalents.
Similar issues have been observed comparing DXA against MRI for measuring body composition in longitudinal studies. Here a confounding factor is the sample size of longitudinal comparison studies, which tend to be much smaller than those used in some cross-sectional studies. Studies focussed on body fat content in adults and children have demonstrated that DXA substantially underestimated longitudinal changes in VAT15, with no association between longitudinal changes in DXA vs MRI-VAT16. For lean mass and muscle, some studies in adults have reported comparable age-related loss of muscle mass by DXA and MRI13, while others have suggested that compared with MRI DXA is only sensitive enough to measure large changes in lean mass17. Similarly, studies in children are conflicting; Bridge et al reported age-related changes in both fat and muscle mass to be similar in children for both MRI and DXA18, while Lee and Kuk reported that DXA overestimated fat loss and muscle gain compared with MRI in a weight loss intervention (N=39)19.
To overcome these discrepancies approaches including deriving equations to convert DXA values to MRI equivalents20, or constructing linear regression models21 have been proposed. Whether these strategies would enhance agreement between methods in longitudinal studies remains uncertain. Thus, a more comprehensive study comparing DXA and MRI, using anatomically equivalent anatomical regions and a larger cohort of subjects, is clearly needed.
This study aimed to leverage the extensive UK Biobank data to evaluate the agreement between DXA and MRI measurements of adipose tissue, lean mass, and muscle mass, using comparable DXA-defined android and gynoid regions in the MRI analyses. Additionally, the longitudinal data from this large cohort allowed us to assess the effectiveness of DXA, relative to MRI, in tracking changes in body composition over time.
Methods
Image Acquisitions
MRI scans were acquired using a Siemens Aera 1.5T scanner (Syngo MR D13) (Siemens, Erlangen, Germany), with full details of the UK Biobank abdominal MRI protocol reported previously22. Our analysis used the neck-to-knee chemical-shift encoded MRI, commonly known as Dixon MRI. All data were processed and segmented using automated methods23, with the outputs arising from this work already returned and available from the UK Biobank (Category 158).
DXA scans were acquired using a GE iDXA (GE-Lunar, Madison, WI). Full details of the UK Biobank DXA protocol have been previously reported22. Fields available from the UK Biobank DXA scanner outputs include: category 124, “Body composition by DXA”1, specifically: Android fat mass (field 23245), Android lean mass (23246), Android tissue fat percentage (23247), Visceral adipose tissue volume (23289), Gynoid fat mass (23262), Gynoid lean mass (23263), and Gynoid tissue fat percentage (23264). The only DXA reported volume (in litres) is for VAT, all other measurements are provided as a percentage or mass (kg).
Study Design and Cohort
DXA and MRI scans were acquired from every participant on the same day with no more than 2-4 hours between scans. Longitudinal data was acquired from a subcohort (n=3,088) after an average time interval of 2.3 ± 0.1 years, utilising the same scanners and protocols for each imaging modality.
Dixon MRI data for 41,707 participants that was acquired at the baseline imaging visit between 2014 and 2020 with data comprising imaging, health-related diagnoses and biological measurements were available. Participants with missing body composition measurements by DXA were excluded from the study. More specifically, android region measurements for 10,815 participants were missing VAT volume measured by DXA from the baseline imaging visit and for 1,011 participants from the follow-up imaging visit. For the gynoid region, 10,674 participants were missing lean and fat mass measured by DXA from the baseline imaging visit and 978 participants from the follow-up imaging visit. Out of 3,091 UK Biobank participants who underwent a follow-up imaging visit24, 2,906 participants had complete imaging data at both visits and were included in the study. Overall, after these exclusions, a dataset of 32,787 (30,892 from baseline imaging visit and 1,895 from follow-up imaging visit) and 32,961 (31,033 from baseline imaging visit and 1,928 from follow-up imaging visit) participants with android and gynoid measurements, respectively, were included in the final analysis.
Data Analysis
To generate a like-for-like comparison between DXA and MRI measurements, we reproduced, on the MRI scans, the DXA regions as defined in the software user manual2, which states that “the android region of interest (ROI) has its lower boundary at the ‘pelvis cut’ and its upper boundary to lie at 20% of the distance between the pelvis and the ‘neck cut’. The gynoid ROI is to be twice the height of the android ROI, and the upper boundary to be below the pelvis cut line, 1.5 times the height of the android ROI”. Therefore, we defined the android and gynoid regions in the MRI utilising the same anatomical landmarks (Figure 1), with the segmented pelvic bones used as the “pelvic cut” and the top of the MRI acquisition as the “neck cut”23.
Given the ability of MRI to differentiate between various tissues within any anatomical area of interest, we categorised “lean tissue” in the DXA-android region to include all non-fat tissues, such as muscle, kidney, and liver, as assessed by MRI. In the gynoid region, lean tissue was defined as all non-adipose and non-bone marrow tissues. To gain further insight, we conducted a detailed comparison between the “lean tissue” identified by DXA and the specific muscle volume measured by MRI. This comparison aimed to highlight any discrepancies between the broader classification of lean tissue in DXA and the more precise muscle measurements provided by MRI, helping us better understand the gradations between these two modalities in body composition assessment.
For the DXA measures that are given in mass, we converted MRI volumes to mass, using density values. Given the lack of consensus in the literature regarding the density of muscle vs lean body mass, we used 1.06g/ml for both25–27, 1.04g/ml for kidney28,29, and 1.05g/ml for liver30,31. Similarly, we took into account the fact that adipose tissue volume measured by MRI and fat mass measured by DXA are not equivalent. Indeed, it is accepted that around 80% of adipose tissue is fat32, and we adjusted adipose tissue volume measurements accordingly. Adipose tissue volume measured by MRI was adjusted to DXA fat mass using a fat density of 0.92g/ml33,34.
Statistical Analysis
Descriptive characteristics are presented as means and standard deviations (SD) using the R software environment35. The Wilcoxon rank-sum test was used to compare means between groups, and the Wilcoxon signed-rank test was used to compare groups with paired observations. A linear regression model was fit to extract the regression equation between DXA and MRI measures. After counting the hypothesis tests, the Bonferroni-corrected threshold for statistical significance was 0.05/53 = 0.00094.
Results
Baseline characteristics and summary statistics for the whole cohort are provided in supplementary Table S1. Among the 31,033 participants who attended the baseline imaging visit, 96.2% were White, and 47.5% were male. The participants had an average age of 64.1 ± 7.8 years and BMI of 26.5 ± 4.4 kg/m2.
Table 1 provides measurements obtained from DXA and MRI in the android and gynoid regions at the baseline imaging visit. These measurements include muscle mass, lean mass, fat tissue mass, and body fat percentage, separated by sex. The VAT volume was further assessed in the android region. Supplementary figures S1-S7, show the relationship between MRI and DXA measures for all participants at the baseline imaging visit.
The results show that DXA overestimates lean mass in the android region compared to MRI by approximately 1.95kg in men and 1.43kg in women (Table 1). Similarly, in the gynoid region, DXA overestimates MRI lean mass by 2.22kg in men and 1.84kg in women. The percent variance in DXA fat mass measures explained by the MRI fat mass measures (or R2 value) was 95.1% in men and 95.9% in women. Similar results were found for gynoid fat mass measurements. VAT also showed a good fit with R2 of 0.9 in both men and women. Similar results were observed in the follow-up imaging visit (Supplementary Table S2).
Table 2 provides a comparison of the longitudinal observations for MRI and DXA measurements, captured approximately 2.3 ± 0.1 years apart between the baseline and follow-up imaging visits. We observed a reduction in muscle and lean mass as measured by MRI in both android and gynoid regions for both sexes, with changes ranging from -4.5% to -3.9% in the angroid region and -5.2% to -4.1% in the gynoid region. In contrast, DXA lean mass in women showed a significant increase of 2.6%. This could be due to differences in anatomy captured within the android region caused by variations in breath-hold during imaging. Figure S8 shows an example of lean mass as measured by MRI in the android region including abdominal organs and muscle, at baseline and follow-up imaging for one participant, illustrating a 12.4% decrease in lean tissue when all organs are included, but only a 1% decrease when the liver and kidneys are omitted. No further significant changes were detected in lean mass measurements from DXA. We found no significant changes in any other tissue measurements from MRI or DXA.
Discussion
In this study, we compared MRI measurements of muscle, lean and adipose tissue mass, and body fat percentage in the android and gynoid regions—areas typically assessed in DXA scans—to DXA measurement estimates in these regions. We assessed the linear relationship between the MRI and DXA-derived body composition measurements in the android and gynoid regions, and we assessed the longitudinal changes in body composition measures by DXA and MRI during follow-up imaging visits. Our primary aim was to evaluate the agreement between MRI and DXA measurements in anatomically equivalent regions and to determine their precision in detecting longitudinal changes.
Android and gynoid regions are constructs specifically defined in DXA scans. Our study replicated these constructs in our MRI measurements to allow a direct comparison between these two imaging methods. DXA showed a good agreement against MRI for adipose tissue measures, with a strong correlation between MRI and DXA fat mass measures, aligning with previous literature36. However, in our cross-sectional analysis we found that DXA overestimated lean mass when compared to MRI in both android and gynoid regions. Although previous studies comparing MRI-derived muscle against DXA-derived lean mass, found a strong correlation between the two17,37,38, others comparing CT and DXA, have reported that DXA overestimates muscle, with larger differences in subjects with higher body weights, suggesting DXA may be less accurate in that population7.
Longitudinal assessment of body composition is vitally important, particularly in ageing, where a decline in skeletal muscle mass, and loss of muscle strength39 contribute to sarcopenia and progression toward frailty40. We found that over the relatively short period of 2.5 years, both muscle and lean mass, as measured by MRI, decreased by 4% to 5% in both android and gynoid regions. This decrease in both muscle and lean mass after two years, likely reflects both a loss of muscle and an increase in intramuscular fat content in that period, both characteristic of the ageing process. The apparent lack of change observed by DXA may reflect its inability to differentiate between these compartments. Previous longitudinal studies in older adults have reported that MRI and DXA detected a similar percentage loss of thigh muscle mass over a five-year period13, with similar agreement reported for changes in leg skeletal muscle mass following two years exercise training38. However, several interventional studies have suggested that compared with MRI and other gold standard methods, DXA is unable to accurately detect longitudinal changes in lean muscle17,41. This inability is clearly reflected in the current longitudinal study.
Unexpectedly, our study found a significant increase in DXA-measured lean mass in the android region among women at follow-up. The reason for this increase, or why it was observed only in women, remains unclear. Prior research has shown that DXA measurements of lean soft tissue mass in the trunk can differ from MRI measurements of muscle in the same region38. This difference may result from DXA’s limited capacity to distinguish muscle from other lean tissues with differing proteins, water, non-fat lipids, carbohydrates, and soft tissue minerals composition6. Additionally, changes in fluid retention could further complicate these measurements, potentially contributing to the inconsistencies observed.
Our study was not without limitations, particularly regarding the variability in the neck placement for the MRI scan, which is essential for our android ROI height definition. However, the impact of this variation is mitigated somewhat since the android ROI height is set only to be one-fifth of the distance between the neck and the pelvic cut. For longitudinal datasets, we fixed the height of the ROIs to match baseline images in order to remove any variability from neck positioning. Furthermore, although DXA appendicular lean mass is a more commonly used measure of muscle42, we are unable to directly compare these measures in our study since the UK Biobank neck-to-knee MRI acquisition does not include the arms and lower legs. Finally, the interval between baseline and follow-up measurements in the longitudinal aspect of this study was only two and a half years, and the changes detected by MRI although consistent were modest24. Longer intervals between scans or interventions designed to produce significant weight change may result in more significant changes in lean mass which may be more reliably detected by DXA.
Conclusion
Our findings in this large-scale study underscore the limitations and strengths of DXA for body composition analysis. While DXA provides reliable cross-sectional estimates for both fat and VAT, it overestimates lean mass and has considerable limitations for the assessment of longitudinal changes; an essential consideration for studies into ageing. The more detailed body composition assessments and precise measurements from MRI are therefore more appropriate for understanding health in longitudinal population studies.
Author Contributions
J.D.B., E.L.T., M.T and N.B. conceived the study. All authors designed the study. M.T., B.W. and N.B implemented the methods. N.B. performed the data analysis and M.T. performed the statistical analysis. All authors drafted the manuscript. All authors read and approved the manuscript.
Declarations
Participant data from the UK Biobank cohort was obtained through UK Biobank Access Application number 44584. The UK Biobank has approval from the North West Multi-Centre Research Ethics Committee (REC reference: 11/NW/0382), with informed consent obtained from all participants. Researchers may apply to use the UKBB data resource by submitting a health-related research proposal in the public interest. Additional information may be found on the UK Biobank researchers and resource catalogue pages (http://www.ukbiobank.ac.uk). All methods were performed in accordance with the relevant guidelines and regulations as presented by the appropriate authorities, including the Declaration of Helsinki.
Data Availability
Our research was conducted using UK Biobank data. Under the standard UK Biobank data sharing agreement, we (and other researchers) cannot directly share raw data obtained or derived from the UK Biobank. However, under this agreement, all of the data generated, and methodologies used in this paper are returned by us to the UK Biobank, where they will be fully available. Access is obtained directly from the UK Biobank to all bona fide researchers upon submitting a health-related research proposal to the UK Biobank https://www.ukbiobank.ac.uk.
Acknowledgements
This study was carried out using UK Biobank Application number 44584, and we thank the participants in the UK Biobank imaging study. This study was funded by Calico Life Sciences LLC.
Footnotes
2 X-ray bone densitometer with enCORE v17 software user manual https://www.gehealthcare.com/support/manuals