Abstract
The Pancreatic Expression Database (PED) is a powerful resource dedicated to the mining and analysis of pancreatic -omics datasets. Here, we demonstrate the biological interpretations that are possible because of vital updates that have transformed PED into a dynamic analytics hub accommodating an extensive range of publicly available datasets. PED now hosts clinical and molecular datasets from four primary sources (Cancer Genome Atlas, International Cancer Genome Consortium, Cancer Cell Line Encyclopaedia and Genomics Evidence Neoplasia Information Exchange) that together form the foundation of omics profiling of pancreatic malignancies and related lesions (n=7,760 specimens). Several user-friendly analytical tools to explore and integrate the molecular data derived from these primary specimens and cell lines are now available. Crucially, PED is integrated as the data access point for Pancreatic Cancer Research Fund Tissue Bank – the only national pancreatic cancer biobank in the UK. This will pioneer a new era of biobanking to promote collaborative studies and effective sharing of multi-modal molecular, histopathology and imaging data from biobank samples (>60,000 specimens from >3,400 cases and controls; 2,037 H&E images from 349 donors) and accelerate validation of in silico findings in patient-derived material. These updates place PED at the analytical forefront of pancreatic biomarker-based research, providing the user community with a distinct resource to facilitate hypothesis-testing on public data, validate novel research findings, and access curated, high-quality patient tissues for translational research. To demonstrate the practical utility of PED, we investigate somatic variants associated with established transcriptomic subtypes and disease prognosis: several patient-specific variants are clinically actionable and may be leveraged for precision medicine.
Competing Interest Statement
H.M.K. received research grant for conducting this trial (Celgene: institutional) and educational grant support for attending or organizing conferences (Celgene, Baxalta, Mylan, Medtronic, Oncosil: institutional) which are unrelated to this work. No other authors have interests to declare.
Funding Statement
The PCRFTB is funded by PCRF. Organoid generation is supported by Barts Charity. HMK and CC acknowledge support of NIHR Barts BRC. This work was supported by Barts Charity (grant code MGU0504) and Barts NIHR BRC (grant code BTXH1A1R), part of the Precision Medicine programme.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
TCGA data can be accessed at https://portal.gdc.cancer.gov/. Original ICGC data can now be accessed at https://ega-archive.org/dacs/EGAC00001000010. GENIE data are available from https://www.aacr.org/professionals/research/aacr-project-genie/aacr-project-genie-data/.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
PCRFTB: Mo Abu Hilal (Southampton), Bilal Al Sarireh (Swansea), Somaiah Aroori (Plymouth), Ali Arshad (Southampton), Satyajit Bhattacharya (The London Clinic, London), Brian Davidson (Royal Free, London), John Isherwood (Leicester), Deep Malde (Leicester), Stuart Robinson (Newcastle), Zahir Soonawalla (Oxford).
Data Availability
N/A - all data presented in the study are from publicly available resources (TCGA, ICGC, GENIE, CCLE).
https://portal.gdc.cancer.gov/
https://sites.broadinstitute.org/ccle/datasets
https://www.aacr.org/professionals/research/aacr-project-genie/aacr-project-genie-data/