RT Journal Article SR Electronic T1 Machine learning for prediction of immunotherapy efficacy in non-small cell lung cancer from simple clinical and biological data JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2021.11.30.21267064 DO 10.1101/2021.11.30.21267064 A1 Benzekry, Sébastien A1 Grangeon, Mathieu A1 Karlsen, Mélanie A1 Alexa, Maria A1 Bicalho-Frazeto, Isabella A1 Chaleat, Solène A1 Tomasini, Pascale A1 Barbolosi, Dominique A1 Barlesi, Fabrice A1 Greillier, Laurent YR 2021 UL http://medrxiv.org/content/early/2021/12/02/2021.11.30.21267064.abstract AB Background Immune checkpoint inhibitors (ICIs) are now a therapeutic standard in advanced non-small cell lung cancer (NSCLC), but strong predictive markers for ICIs efficacy are still lacking. We evaluated machine learning models built on simple clinical and biological data to individually predict response to ICIs.Methods Patients with metastatic NSCLC who received ICI in second line or later were included. We collected clinical and hematological data and studied the association of this data with disease control rate (DCR), progression free survival (PFS) and overall survival (OS). Multiple machine learning (ML) algorithms were assessed for their ability to predict response.Results Overall, 298 patients were enrolled. The overall response rate and DCR were 15.3 % and 53%, respectively. Median PFS and OS were 3.3 and 11.4 months, respectively. In multivariable analysis, DCR was significantly associated with performance status (PS) and hemoglobin level (OR 0.58, p<0.0001; OR 1.8, p<0.001). These variables were also associated with PFS and OS and ranked top in random forest-based feature importance. Neutrophils-to-lymphocytes ratio was also associated with DCR, PFS and OS. The best ML algorithm was a random forest. It could predict DCR with satisfactory efficacy based on these three variables. Ten-fold cross-validated performances were: accuracy 0.68 ± 0.04, sensitivity 0.58 ± 0.08; specificity 0.78 ± 0.06; positive predictive value 0.70 ± 0.08; negative predictive value 0.68 ± 0.06; AUC 0.74 ± 0.03.Conclusion Combination of simple clinical and biological data could accurately predict disease control rate at the individual level.Highlights- Machine learning applied to a large set of NSCLC patients could predict efficacy of immunotherapy with a 69% accuracy using simple routine data- Hemoglobin levels and performance status were the strongest predictors and significantly associated with DCR, PFS and OS- Neutrophils-to-lymphocyte ratio was also associated with outcome- Benchmark of 8 machine learning modelsCompeting Interest StatementThe authors have declared no competing interest.Funding StatementThis work is part of the QUANTIC project funded by ITMO Cancer AVIESAN and French Institut National du Cancer (grant #19CM148-00)Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The study protocol and retrospective data collection were approved by the Institutional Review Board of the French Society of Respiratory Diseases, under reference number: CEPRO 2019- 007.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesData used in this study is anonymized yet sensitive health patient data. It cannot be made public for legal reasons