PT - JOURNAL ARTICLE AU - Chu, Phuong Thi Minh AU - Ha, Tram Pham Bich AU - Vu, Ngoc Minh AU - Ha, Hoang AU - Doan, Thu Minh TI - The application of deep learning in lung cancerous lesion detection AID - 10.1101/2024.04.12.24305708 DP - 2024 Jan 01 TA - medRxiv PG - 2024.04.12.24305708 4099 - http://medrxiv.org/content/early/2024/04/15/2024.04.12.24305708.short 4100 - http://medrxiv.org/content/early/2024/04/15/2024.04.12.24305708.full AB - Background Characterized by rapid metastasis and a significant death rate, lung cancer presents a formidable challenge, which underscores the critical role of early detection in combating the disease. This study addresses the urgent need for early lung cancer detection using deep learning models applied to computed tomography (CT) images.Methods Our study introduced a unique non-cancer pneumonia dataset, a publicly available large-scale collection of high-quality pneumonia CT scans with detailed descriptions. We utilized this dataset to fine-tune nine pretrained models, including DenseNet121, MobileNetV2, InceptionV3, InceptionResNetV2, ResNet50, ResNet101, VGG16, VGG19, and Xception for the classification of lung cancer and pneumonia.Results ResNet50 demonstrated the highest accuracy and sensitivity (97.7% and 100%, respectively), while InceptionV3 excelled in precision (97.9%) and specificity (98.0%). The study also highlighted the contribution of the gradient-weighted class activation mapping (Grad-CAM) technique in examining the effectiveness of the model-training process via the visualization of features learned across different layers. Grad-CAM revealed that among the best-performed models, InceptionV3 successfully identified cancerous lesions in CT scans. Our findings demonstrated the potential of deep learning models in early lung cancer screening and improving the accuracy of the diagnosis procedure.Data availability The pneumonia CT scan dataset used in this study is extracted from peer-reviewed publications and can be accessed at https://github.com/ReiCHU31/CT-pneumonia-datasetCompeting Interest StatementThe authors have declared no competing interest.Funding StatementThis study did not receive any fundingAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:https://doi.org/10.7937/TCIA.2020.NNC2-0461I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesThe lung cancer CT image dataset is obtained from 101 random patients in a large-scale CT and PET/CT open-access library containing approximately 251,135 scans of 355 lung cancer patients [23, 24]. Meanwhile, the pneumonia dataset is a newly established library extracted from peer-reviewed scientific publications and is available at https://github.com/ReiCHU31/CT-pneumonia-dataset. It is imperative that this dataset be used solely for research purposes with responsibility. Ethical approval is not required. The results will be shared through various avenues, including peer-reviewed publications, conference presentations, and communication with other segments of healthcare and society.