Abstract
Background Emphysema is a common pulmonary pathology known to be associated with increased risk of lung cancer and lung biopsy complications. Prevailing quantitation method of calculating voxel-wise percentage of low attenuation area (LAA) of lung tissue from CT scans is prone to noise and error due overcounting of single voxel LAA and incomplete segmentation of airways.
Purpose We aim to develop an accurate algorithm to quantitatively measure emphysema and classify its severity..
Methods and Materials Two chest CT datasets were obtained from two tertiary hospitals as training and external validation datasets. Exclusion criteria included any patients whose emphysema extent was not specified by the accompanying report. The training dataset included 722 patients, and the validation dataset included 1006 patients. Following lung segmentation and airways removal, we applied convolution of the segmented lung with averaging kernels of different sizes in 2D and 3D. Cutoffs between “none,” “mild to moderate,” and “severe” emphysema were determined via weighted logistic regression on the training dataset, and the categorical emphysema extent was obtained for each patient. The main measure for evaluating model performance was area under the curve (AUC) of the receiver operating characteristic (ROC) on the training dataset and accuracy of classification on both the training and the validation dataset. The 1×1×1 kernel, which is equivalent to the traditional LAA score, was used for comparison to other kernels for performance evaluation.
Results The best model used a 3D 3×3×3 kernel for average filtering with airways post processing and achieved a mean AUC of 0.782 and 0.985 for “none”-versus-rest and “severe”-versus-rest classifications respectively. It achieved a 0.676 and 0.757 multiclass classification accuracy on the training and validation dataset respectively.
Conclusions and Relevance We present an automated pipeline that can achieve accurate emphysema quantification and severity classification. We showed that convolving the segmented lung with a 3D 3×3×3 kernel and post-processing to remove airways can reliably quantify emphysema.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study did not receive any funding.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
IRB of University of California San Francisco (UCSF) gave ethical approval for this work.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
Data produced by this study contains confidential information about patients and is not publicly available. Codes accompanying this study are available at https://github.com/bdrad/Emphysema_Quantification