Abstract
Despite their promise, circulating tumor DNA (ctDNA)-based assays for multi-cancer early detection face challenges in test performance, due mostly to the limited abundance of ctDNA and its inherent variability. To address these challenges, published assays to date demanded a very high-depth sequencing, resulting in an elevated price of test. Herein, we developed a multimodal assay called SPOT-MAS (Screening for the Presence Of Tumor by Methylation And Size) to simultaneously profile methylomics, fragmentomics, copy number, and end motifs in a single workflow using targeted and shallow genome-wide sequencing (∼0.55X) of cell-free DNA. We applied SPOT-MAS to 738 nonmetastatic patients with breast, colorectal, gastric, lung and liver cancer, and 1,550 healthy controls. We then employed machine learning to extract multiple cancer and tissue-specific signatures for detecting and locating cancer. SPOT-MAS successfully detected the five cancer types with a sensitivity of 72.4% at 97.0% specificity. The sensitivities for detecting early-stage cancers were 62.3% and 73.9% for stage I and II, respectively, increasing to 88.3% for nonmetastatic stage IIIA. For tumor-of-origin, our assay achieved an accuracy of 0.7. Our study demonstrates comparable performance to other ctDNA-based assays while requiring significantly lower sequencing depth, making it economically feasible for population-wide screening.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
The study was funded by Gene Solutions
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
This study was approved by the Ethics Committee of the Medic Medical Center, University of Medicine and Pharmacy and Medical Genetics Institute, Ho Chi Minh city, Vietnam.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
Section on Discussion updated to clarify
Data Availability
Sequencing data will be deposited in a public portal database (NCBI SRA) upon acceptance and are available on request from the corresponding author, LST. The data are not publicly available due to ethical restrictions.
List of abbreviations
- MCED
- Multi-cancer early detection
- TOO
- tissue of origin
- cfDNA
- Circulating cell-free DNA
- ctDNA
- Circulating tumor DNA
- SPOT-MAS
- Screening for the Presence Of Tumor by DNA Methylation And Size
- AUC
- Area Under the Curve
- TM
- Targeted methylation
- GWM
- Genome-wide methylation
- CNA
- Copy number aberration
- FLEN
- Fragment length
- EM
- End motif
- LB
- Liquid Biopsy
- LR
- Logistic regression
- RF
- Random Forest
- XGB
- Extreme Gradient Boosting
- DNN
- Deep neural network
- GCNN
- Graph Convolutional Neural Network
- CRC
- Colorectal cancer
- ROC
- Receiver Operating Characteristic