PT - JOURNAL ARTICLE AU - Berman, Adam G. AU - Fitzgerald, Rebecca C. AU - Markowetz, Florian TI - A composite biomarker for esophageal cancer risk from automated analysis of a non-endoscopic device AID - 10.1101/2021.08.20.21262366 DP - 2021 Jan 01 TA - medRxiv PG - 2021.08.20.21262366 4099 - http://medrxiv.org/content/early/2021/08/25/2021.08.20.21262366.short 4100 - http://medrxiv.org/content/early/2021/08/25/2021.08.20.21262366.full AB - Barrett’s esophagus containing intestinal metaplasia predisposes to cancer, yet the majority of cases are undiagnosed. The length of a Barrett’s segment is a key indicator of cancer risk, but measuring it has so far relied on endoscopy, which is expensive and invasive. Cytosponge-TFF3 is a minimally-invasive test that identifies intestinal metaplasia for endoscopic confirmation. We report a machine learning technique to quantify the extent of intestinal metaplasia and predict Barrett’s segment length from whole-slide image tile counts automatically generated from Cytosponge-TFF3 histology slides. Utilizing data from 529 patients, our segment length prediction model achieves an average validation fold accuracy of 0.84. Applying this algorithm to an independent test set of 162 patients from a screening trial shows a precision of 0.90 for identifying short-segment disease. This advance will enable higher-risk patients to be prioritized for endoscopy while saving more than half of Cytosponge-TFF3-positive patients from endoscopy in the screening setting.Competing Interest StatementR.C.F is named on patents related to Cytosponge-TFF3 that have been licensed by the Medical Research Council to Covidien (now Medtronic). RCF is a founder and shareholder of Cyted Ltd. FM is a founder, director and shareholder of Tailor Bio.Clinical TrialISRCTN12730505, ISRCTN68382401Clinical Protocols https://github.com/markowetzlab/barretts-segment-length-predictor Funding StatementThis research was supported by Cancer Research UK (FM: C14303/A17197), a Medical Research Council programme grant (RCF: RG84369) and Cambridge University Hospitals NHS Foundation Trust. BEST2 was funded by Cancer Research UK (12088 and 16893). We would also like to thank the NIHR Cambridge Biomedical Research Centre (BRC-1215-20014) and the Experimental Cancer Medicine Centre for their support. AGB acknowledges support from a Gates Cambridge Scholarship from the Bill & Melinda Gates Foundation. FM is a Royal Society Wolfson Research Merit Award holder.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:BEST2: Ethics approval was obtained from the East of England - Cambridge Central Research Ethics Committee (number 10/H0308/71) and registered in the UK Clinical Research Network Study Portfolio (9461). BEST3: Ethics approval was obtained from the East of England - Cambridge Central Research Ethics Committee (number 16/EE/0546).All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data and fully documented code to perform all of the analyses of this paper can be found at this public repository in the form of two illustrative Jupyter notebooks: https://github.com/markowetzlab/barretts-segment-length-predictor https://github.com/markowetzlab/barretts-segment-length-predictor