Abstract
Background and Purpose Increased demand for head and neck cancer (HNC) radiotherapy data for algorithmic development has prompted an escalation of head and neck image dataset sharing. Medical images must comply with data protection requirements so that data re-use is enabled without disclosing direct or indirect identifiers of study participants. Defacing, i.e., the removal of facial features, is often considered an acceptable compromise between data protection and re-usability. While existing defacing tools have been developed by the neuroimaging community, their acceptability for preserving data that is crucial to radiotherapy applications have not been previously explored. Therefore, we systematically investigated the impact of currently available defacing algorithms on HNC organs at risk (OARs) used for radiotherapy planning.
Materials and Methods We utilized a publicly available dataset of T2-weighted magnetic resonance imaging (MRI) scans for 55 HNC patients with eight previously segmented OARs (right/left submandibular glands, right/left parotid glands, right/left level II neck lymph nodes, right/left level III neck lymph nodes). Four publicly available defacing algorithms were investigated: mri_deface, defacer, fsl_deface, and pydeface. We implemented a 5-fold cross-validation 3D U-net based OAR auto-segmentation model to perform two main experiments: 1.) comparing original and defaced data for training when evaluated on original data; 2.) using original data for training and comparing model evaluation on original and defaced data. These models were primarily investigated using the Dice similarity coefficient (DSC). The mean surface distance (MSD), and mean Hausdorff distance at 95% (MHD95) were also explored.
Results Of the studied algorithms, mri_deface and defacer were unable to produce any usable images for evaluation, while fsl_deface and pydeface were unable to remove the face for 18 and 24% of subjects, respectively. All the remaining subjects had voxels removed from all OARs. Composite OAR DSC using defaced data for model training and evaluation yielded statistically worse mean DSC performance (p ≤ 0.05) for both fsl_deface (training = 0.770, evaluation = 0.709) and pydeface (training = 0.631, evaluation = 0.453) compared to the original data (0.780). Significance testing demonstrated that the DSC and MHD95 were significantly better (p ≤ 0.05) for the model trained with the original data for both submandibular glands and for the right parotid gland compared to the model trained with fsl_deface data. Similarly, the MSD was significantly better (p ≤ 0.05) for the model trained with the original data for the left submandibular gland and both parotid glands. When trained with pydeface data, the model had significantly worse performance (p ≤ 1e-5) compared to the model trained with the original data for all structures and metrics.
Conclusion We demonstrate the choice of defacing algorithm has a significant impact on HNC OAR auto-segmentation model development and deployment. Our work incentivizes the further development of HNC radiotherapy-specific defacing methods.
Competing Interest Statement
C.D.F. has received direct industry grant support, speaking honoraria, and travel funding from Elekta AB. The other authors have no conflicts of interest to disclose.
Funding Statement
This work is supported by the International Laboratory of Social Neurobiology ICN HSE RF Government grant ag. No. 075 15 2019 1930 (to E.G.). This work was also supported by the National Institutes of Health (NIH)/National Cancer Institute (NCI) through a Cancer Center Support Grant (CCSG; P30CA016672-44). M.A.N. is supported by an NIH grant (R01DE028290-01). K.A.W. is supported by a training fellowship from The University of Texas Health Science Center at Houston Center for Clinical and Translational Sciences TL1 Program (TL1TR003169), the American Legion Auxiliary Fellowship in Cancer Research, and an NIH/National Institute for Dental and Craniofacial Research (NIDCR) F31 fellowship (1 F31DE031502-01). C.D.F. received funding from the NIH/NIDCR (1R01DE025248-01/R56DE025248); an NIH/NIDCR Academic-Industrial Partnership Award (R01DE028290); the National Science Foundation (NSF), Division of Mathematical Sciences, Joint NIH/NSF Initiative on Quantitative Approaches to Biomedical Big Data (QuBBD) Grant (NSF 1557679); the NIH Big Data to Knowledge (BD2K) Program of the NCI Early Stage Development of Technologies in Biomedical Computing, Informatics, and Big Data Science Award (1R01CA214825); the NCI Early Phase Clinical Trials in Imaging and Image-Guided Interventions Program (1R01CA218148); an NIH/NCI Pilot Research Program Award from the UT MD Anderson CCSG Radiation Oncology and Cancer Imaging Program (P30CA016672); an NIH/NCI Head and Neck Specialized Programs of Research Excellence (SPORE) Developmental Research Program Award (P50CA097007); and the National Institute of Biomedical Imaging and Bioengineering (NIBIB) Research Education Program (R25EB025787).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study involves only openly avalailbe human data, which can be obtained from: https://wiki.cancerimagingarchive.net/display/Public/AAPM+RT-MAC+Grand+Challenge+2019. As per the corresponding description of this dataset: “Images of patients were retrospectively retrieved under an Institutional Review Board approved protocol and waiver of informed consent (RCR08-0300).”
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
Funding Acknowledgements: This work is supported by the International Laboratory of Social Neurobiology ICN HSE RF Government grant ag. No. 075–15-2019–1930 (to E.G.). This work was also supported by the National Institutes of Health (NIH)/National Cancer Institute (NCI) through a Cancer Center Support Grant (CCSG; P30CA016672-44). M.A.N. is supported by an NIH grant (R01DE028290-01). K.A.W. is supported by a training fellowship from The University of Texas Health Science Center at Houston Center for Clinical and Translational Sciences TL1 Program (TL1TR003169), the American Legion Auxiliary Fellowship in Cancer Research, and an NIH/National Institute for Dental and Craniofacial Research (NIDCR) F31 fellowship (1 F31DE031502-01). C.D.F. received funding from the NIH/NIDCR (1R01DE025248-01/R56DE025248); an NIH/NIDCR Academic-Industrial Partnership Award (R01DE028290); the National Science Foundation (NSF), Division of Mathematical Sciences, Joint NIH/NSF Initiative on Quantitative Approaches to Biomedical Big Data (QuBBD) Grant (NSF 1557679); the NIH Big Data to Knowledge (BD2K) Program of the NCI Early Stage Development of Technologies in Biomedical Computing, Informatics, and Big Data Science Award (1R01CA214825); the NCI Early Phase Clinical Trials in Imaging and Image-Guided Interventions Program (1R01CA218148); an NIH/NCI Pilot Research Program Award from the UT MD Anderson CCSG Radiation Oncology and Cancer Imaging Program (P30CA016672); an NIH/NCI Head and Neck Specialized Programs of Research Excellence (SPORE) Developmental Research Program Award (P50CA097007); and the National Institute of Biomedical Imaging and Bioengineering (NIBIB) Research Education Program (R25EB025787).
Data Availability
All data produced in the present study are available upon reasonable request to the authors.