Abstract
Objective This study investigated the association between body composition and pulmonary nodule malignancy and growth.
Methods A dataset of subjects with indeterminate pulmonary nodules (IPNs) was created from an internal (n=216) and external (n=162) cohort. Five different body tissues were automatically segmented and quantified from baseline and follow-up chest low-dose computed tomography (LDCT) scans using artificial intelligence (AI) algorithms. Logistic Regression (LR) analyses, t-tests, and Person correlation analyses were performed to study the association between body tissues and nodule malignancy, as well as nodule changes such as density, size, and shape. Gender differences were investigated. The area under the receiver operating characteristic curve (ROC-AUC) was used to assess classifier performance. Average feature importance was evaluated using several machine learning models. Causal relationships were analyzed and visualized using a novel directed graph method.
Results Univariate analysis revealed a significant association between Skeletal muscle density and nodule malignancy in both genders (p<0.001). The multivariate model based on body composition yielded AUCs of 0.77 (95% CI: 0.71 – 0.84) and 0.63 (95% CI: 0.54 – 0.72) on the internal and external datasets, respectively. The composite model based on body composition and nodule features yielded AUCs of 0.87 (95% CI: 0.82 – 0.91) and 0.62 (95% CI: 0.53 – 0.72) on the internal and external datasets, respectively. Skeletal muscle and intermuscular adipose tissue features were highly ranked among tissue features, with skeletal muscle density retaining its highest rank even after adjusting for clinical and nodule features. The causal graph identified two nodule features and skeletal muscle density as directly linked to nodule malignancy. Skeletal muscle density and intramuscular adipose tissue density were identified as nodule growth indicators in both genders.
Conclusions Body composition can serve as a potential biomarker for assessing nodule malignancy and evaluating nodule growth in both genders.
Summary Statement We found that body composition were critical indicators for discriminating malignant nodules from benign ones and for evaluating the nodule growth in both males and females.
Key Results
Univariate analysis revealed a significant association between body composition and nodule malignancy in both males and females. Multivariate analysis further demonstrated the predictive ability of body composition features.
Feature importance analysis and causal graph analysis identified skeletal muscle density as one of the leading features associated with nodule malignancy.
Skeletal muscle density and intramuscular adipose tissue density were identified as nodule growth indicators in both males and females.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study was funded by the National Institutes of Health (NIH) (R01CA237277, U01CA271888, P30CA047904, and U01CA152662) and UPMC Hillman Developmental Pilot Program
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Ethics committee/IRB of University of Pittsburgh gave ethical approval for this work (IRB 011171).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
All data produced in the present study are available upon reasonable request to the authors
Abbreviations
- AI
- artificial intelligence
- BC
- body composition
- CI
- confidence interval
- LDCT
- low-dose computed tomography
- LASSO
- least absolute shrinkage and selection operator
- LR
- logistic regression
- PI
- permutation importance
- ROC-AUC
- the area under the receiver operating characteristic curve
- RF
- random forest
- SVM
- support vector machine