Abstract
Genome-wide DNA methylation aberrations are pervasive and associated with clinicopathological features across pituitary tumors (PT) subtypes. The feasibility to detect CpG methylation abnormalities in circulating cell-free DNA (cfDNA) has been reported in central nervous system tumors other than PT. Here, we aimed to profile and identify methylome-based signatures in the serum of patients harboring PT (n =13). Our analysis indicated that serum cfDNA methylome from patients with PT are distinct from the counterparts in patients with other tumors (gliomas, meningiomas, colorectal carcinomas, n =134) and nontumor conditions (n = 4). Furthermore, the serum methylome patterns across PT was associated with functional status and adenohypophyseal cell lineage PT subtypes, recapitulating epigenetic features reported in PT-tissue. A machine learning algorithm using serum PT-specific signatures generated a score that distinguished PT from non-PT conditions with 100% accuracy in our validation set. These preliminary results underpin the potential clinical application of a liquid biopsy-based DNA methylation profiling as a noninvasive approach to identify clinically relevant epigenetic markers that can be used in the management of PT.
Introduction
Liquid biopsy (LB) is a method used to detect molecular elements (e.g., DNA, RNA, etc.) shed by tumors in biofluids (blood, cerebrospinal fluid etc). Circulating cell-free DNA (cfDNA), specifically the tumor DNA fraction (ctDNA), is thought to originate from cellular death (apoptosis and necrosis) or secretion from live cells, especially from proliferative tissues or tumors 1–5. Blood-based LB has emerged as a reliable and a minimally invasive approach to identify clinically relevant molecular biomarkers from several tumor origins, including from central nervous system (CNS)neoplasms 5–8.
In contrast to CNS tumors that are shielded by the blood-brain barrier, the pituitary gland presents an anatomical structure that facilitates the spillage of tumor cellular material into the bloodstream, i.e. a fenestrated pituitary portal system and/or an access to the cavernous system. This structural advantage creates an opportunity to profile tumor-specific molecular features of material released from these tumors potentially suitable for clinicopathological application 9,10. Indeed, the feasibility to detect and sequence somatic gene variants in ctDNA has recently been reported in PT 1; however, the detection sensitivity of this approach was low in these tumors 1. The paucity of genetic alterations in the pathogenesis of PT such as recurrent somatic mutations may have contributed to these results11–15. In contrast, genome-wide methylation abnormalities detected in the tissue are knowingly pervasive across PT subtypes 13,16–23.Additionally,DNA methylome patterns are tissue- and tumor-specific providing an opportunity to predict the tissue of origin of the tumor through DNA methylation profiling 5,20,24,25. In fact, many studies showed that specific methylome patterns detected in the tissue distinguished PT from other CNS tumors and defined discrete methylation subtypes among different CNS tumors 20,26,27. Additionally, methylation markers presented diagnostic, prognostic and predictive applications in CNS tumors 20,26,27. The feasibility of detecting these tissue- or tumor-specific methylation signatures using a liquid biopsy approach is an emerging field that has not been reported in PT to date.
In this study, we profiled the serum cfDNA methylome derived from patients with PT or other tumors and nontumor conditions. We identify unique methylation signatures in the serum associated with clinicopathological features specific to PT. This proof-of concept study paves the way for the potential clinical application of a liquid biopsy as a noninvasive approach to identify and assess relevant epigenetic markers that may be useful in the management of patients with PT.
Results
Characterization of pituitary cell-free DNA methylome cfDNA quantification
Total extracted serum cfDNA quantity, normalized to the genomic size (ng/ml, see Methods), were not significantly different from controls (mean±SD, 59.3±134.2 vs 5±5.0 ng/ml, respectively; p=.14) or in relation to functional or invasion status in PT (Supplemental Figure 1A).
Deconvolution
The deconvolution of the serum cfDNA methylome showed that patients harboring PT had higher proportion of bulk pituitary gland signatures compared to the control serum and other CNS conditions (3%higher, p =.05)(Supplemental Figure 1B; Supplemental Table 2).
Methylome analysis
The genome-wide mean methylation landscape of the serum cfDNA from patients with PT and non-PT conditions (gliomas, meningiomas, colorectal carcinomas and nontumor conditions) showed that PT segregated into a hypomethylated and a hypermethylated cluster; the latter, shared similar CpG methylation degree with the serum methylome from patients with glioma, meningioma, colorectal cancer and nontumor controls (Figure 1A and 1B).
Conducting a supervised analysis between PT and non-PT serum specimens and selecting probes that shared similarities with the matching PT tissue (Supplemental Figure 1D, left), we identified 46 differentially methylated probes (DMP), namely Pituitary Tumors-specific Epigenetic-Liquid Biopsy (PeLB) probes, that significantly distinguished both groups (Figure 1B)and distinguished two subgroups across PT (hyper and hypomethylated) (Figure 1C),
The two methylation clusters were associated with distinct clinicopathological status, i.e. the hypermethylated cluster was predominantly composed of nonfunctioning, mainly encompassed by SF1 lineage and null cell tumors and the hypomethylated with functioning PT mostly comprised of Pit1-lineage tumors (Figure 1C). As an exception, one lactotroph adenoma/Pit1-lineage segregated with nonfunctioning PT, despite being clinically classified as functioning, and a functioning Tpit1-lineage tumor clustered with nonfunctioning tumors. These results recapitulate the findings in their matching tissue (Supplemental Figure 1E).
cfDNA methylome from patients with PT pituitary-specific epigenetic signatures distinct from other pathological conditions
Overlapped with tumor tissues, PeLB probes clustered with PT tissue and significantly separated PT from other CNS tumor-tissue, confirming PeLB-specificity to PT in an independent cohort (Figure 1D). Taking this feature into account, we developed and cross-validated a score derived from a machine learning (ML) model (repeated 5000 times), namely the PeLB score, to predict whether a serum specimen originates from a patient with PT or a non-PT condition (Figures 1E–F). Pituitary-derived serum methylome samples carried the highest values of PeLB score (71–99%), whereas the serum of non-PT tumors carried the lowest values (0–45%) (Figure F). The evaluation of the model in the validation sample set showed that the model performed with an accuracy of 100%, taking into account a 50% PeLB cutoff.
We also defined serum-based methylation signatures (n=70) accounting for the functional/lineage status of PT (nonfunctioning vs functioning PT) (p <.01, differential mean methylation >.2, FDR <.26), we named functioning-PeLB (Func-PeLB)(Figures 2A, Supplemental Figure 1D, right). Harnessing the methylome from matching tissue and publicly available data reporting on the functional status of PT, we observed that a subset of the Func-PeLB probes (overlapped with the 450K platform, used to profile the tissue-methylome of those samples) (n=22 probes) (Supplemental Figure 1D, right), also discriminated the two functional groups at the tissue and respective serum levels (Figure 2B–D)
The CpG probes that distinguished the methylation clusters either in tissue (n= 5000) or serum (6000) were most frequently located in open sea regions (67% and 61%, respectively) and gene bodies (61 and 55%, respectively) (Figure 1B,Supplemental Table 2).
Discussion
Methylome-derived signatures define molecular subtypes that are useful for the diagnosis and prognostication across many tumors 13,16–18,20,21,26–29. Additionally, genome-wide DNA methylation patterns are cell-specific either in healthy or tumor specimens5,18,20,24,25,30–32. The ability to detect methylation signatures and tumor-specific abnormalities by the profiling of circulating cell-free DNA (cfDNA) in biofluids (liquid biopsy), such as blood, has been useful for the early detection and surveillance of malignant neoplasms 5,33–35. In relation to CNS tumors, our group has recently reported on the feasibility to identify methylation-based markers in serum-derived cfDNA for the diagnosis and prognostication of gliomas and meningiomas 36. Herein, we show that, similar to malignant and other CNS tumors, PT releases tumor-related information in the blood that allows the identification of clinically relevant methylation signatures specific to patients with PT, namely PeLB probes (Figure 1A–B, Supplemental Figure 1D).Capitalizing on the specificity of these probes, we used a machine learning approach to generate the PeLB score (Figure 1D–E) to predict the presence of a pituitary tumor using liquid biopsy. We showed that PeLB score performed with a 100% accuracy to predict that serum was derived from patients with PT in our validation cohort (Figure 1F). These results remain to be confirmed in an independent cohort of PT-derived, currently unavailable.
In addition, distinct serum DNA methylation landscape, specifically PeLB probes, defined two methylation groups that recapitulated the clinicopathological findings displayed in their matching tissue as reported in other studies 13,16–18,20,21(Figure 1C–D,Supplemental Figure 1E). These serum-derived clusters showed that the hypermethylated group was enriched by nonfunctioning PT mainly originated from SF1 and Tpit cell lineages and the hypomethylated set mainly composed of functioning PT mostly originated from Pit-1 cell lineages (Figure 1C, Supplemental Figure 1D). We narrowed down to a subset of PeLB probes (Func-PeLB) that preserved the distinction between both clusters in tumor-tissue specimens as well (Figure 2C, Supplemental Figure 1D). Altogether, these results suggest that PT releases DNA methylation markers in the serum that reflect clinicopathological features such as functional status and adenohypophyseal lineage of these tumors. Confirmation of these findings in a larger and more comprehensive cohort lay the groundwork to the application of PeLB probes as an objective approach to classify PT according to cell-lineage as recommended by the 2017 WHO37.
Considering the prognostic value reported in glioma or meningiomas, we surveyed serum-methylation markers specific to the invasion status of PT. Corroborating the findings reported in the tissue, we found slight serum differences between invasive and noninvasive groups (data not shown)13,16,17,21. However, the association of tissue- or serum-derived methylation groups with the criteria that better predict PT with higher risk to progress or recur remains to be elucidated 13,18,38–43.
The application of PeLB score is not intended to replace the standard approaches to diagnose and classify PT which, in most of the cases, is satisfactorily performed by clinical features, hormonal assessment in the blood/urine and on the imaging of the pituitary gland 44. However, these results provide evidence that serum cfDNA constitutes a reliable source of clinically relevant tumor-specific epigenetic signatures in PT as observed in other CNS tumors 36. Potentially, the specificity of PeLB probes could be helpful to distinguish PT from other rare primary or secondary sellar tumors whose diagnosis by morphologic and immunohistochemical approaches may be challenging, unavailable and/or inconclusive (e.g. craniopharyngioma variants, lymphoma, metastasis etc)5,34,45,46.
In conclusion, our results indicate that similar to malignant tumors, PT releases circulating tumor DNA that present specific methylation patterns, recapitulating molecular features detected in PT-tissue (e.g. adenohypophyseal lineage-related). Serum from patients with PT provides tumor-specific methylation signatures that allow the classification of samples into PT subtypes or non-PT groups. Finally, our preliminary results underpin the potential application of methylation profile in the serum-based liquid biopsy as a noninvasive approach to assess clinically relevant epigenetic features useful for clinical purposes in the management of patients (e.g. aggressiveness markers, actionable markers to guide future clinical trials to treat aggressive, resistant or recurrent PT etc).
Methods
Patients
– We conducted a retrospective analysis of a cohort comprised of archival serum and paired tissue (fresh-frozen) from 13 patients who underwent transsphenoidal surgery for the resection of invasive (n=5) or noninvasive (n=8) macroadenomas of different functional status and histological subtypes (9 nonfunctioning: 4 gonadotroph and 5 null cell and 4 functioning: 2 lactotroph, 1 corticotroph and 1 mixed GH/PRL/TSH) (Table 1).Criteria for invasiveness was based on Knosp grades 3–4 (n=4) or invasion of clivus (n=1)47,48. MRI assessment for size, and invasiveness classification was blindly and independently performed by two physicians from the Henry Ford Health System (HFHS)(TA, KPA). HFHS Pathologists provided a comprehensive pathology report on adenohypophyseal immunostaining, necrosis and quantification of markers of proliferation (Ki-67, mitotic counts, p53). Control serum was obtained from patients without PT (three epileptic patients and one with a nontumor condition). Control pituitary tissue was obtained from non-neoplastic pituitary harvested at autopsy (FFPE). We also generated serum methylome data from patients with glioma (n=114), meningiomas (n=6) and other CNS conditions (brain metastasis, 1 brain colloid cyst, 6 brain radiation necrosis) (Supplemental Table 2) The project was approved by the HFHS Institutional Review Board (IRB#10963) and patients consented to have their specimens used for research purposes. Publicly available methylome data from colorectal carcinoma was retrieved (CRC, n=2 pooled samples)49.
Serum collection and processing
For the specimens originated from the HFHS, peripheral blood (15 mL) was drawn from each subject at the time of surgery before the tumor excision (transphenoidal).
Serum sample was separated within 1 hour from collection by centrifugation at 1,300 x g for 10 minutes at 20°C; aliquoted into up to five 2 mL cryovials and stored at –80°C until processing. The methods for the publicly available data is described in their respective manuscripts 49
DNA isolation, quantification,and DNA methylation data generation
Tissue and serum DNA were extracted from 2.2–9.3mL aliquots of serum using the Quick-cfDNA Serum & Plasma Kit according to the manufacturer’s protocol (Zymo Research – catalog # D4076). DNA concentration was measured with Qubit (Thermo Fisher Scientific) /or with 4200 TapeStation (Agilent Technologies). The concentration of cfDNA in the serum was calculated by dividing the total amount of cfDNA extracted by the amount of serum used for extraction. We then converted the concentration of cfDNA in the serum (ng/mL) into haploid genome equivalents/mL by multiplying by a factor of 303 (assuming the mass of a haploid genome 3.3 pg) 50.
The extracted DNA (30–300 ng) was bisulfite-converted (Zymo EZ DNA methylation Kit; Zymo Research) and profiled using an Illumina Human EPIC array (HM850K), at the USC Epigenome Center, Keck School of Medicine, University of Southern California, Los Angeles, California. The raw DNA methylation data reported in this paper has been deposited to Mendeley Data at https://data.mendeley.com/datasets/cgrz6zztfg.
DNA methylation pre-processing
Methylation array data was processed with the minfi package in R. The raw signal intensities were extracted from the *.IDAT files and corrected for background fluorescence intensities and red-green dye-bias using the function preprocess Noob as described by Triche et al., 2013 51. The beta-values were calculated as (M/(M+U)), in which M and U refer to the (pre-processed) mean methylated and unmethylated probe signal intensities, respectively. Measurements in which the fluorescent intensity was not statistically significant above background signal (detection p value > 10−16) were removed from the data set. Before the analysis, we filtered out probes that were designed for sequences with known polymorphisms or probes with poor mapping quality (complete list of masked probes provided by Zhou et al.52) and the X and Y chromosomes.
Deconvolution
We applied a previously described methodology 50 to deconvolute the relative contribution of cell types to a given sample 50. We included methylation signatures from cell lines, immune cells (B-cell, CD4T, CD8T, natural killer cells and white blood cells (monocytes, neutrophils) and vascular endothelial cells 50(Supplemental Table 2) For lack of information related to methylation signatures from individual cells that comprise the pituitary gland, we generated genome-wide methylation signatures from bulk non- neoplastic pituitaries obtained from cadavers (unpublished data) and followed the steps for defining the signatures as previously described 50. Briefly, we selected the 100 most specific hypermethylated and hypomethylated CpG probes for each cell/tissue type of interest. Using this signature, we applied a non-negative least squares method to deconvolute our serum and tissue cohort using the standalone program provided by Moss and colleagues 50. We then normalized the percentages generated by the standalone program for each cell type/PT-tissue from 0 to 100 by serum.
DNA methylation exploratory analysis (unsupervised analysis)
In order to evaluate the DNA methylation profile in the serum from patients with distinct tumor types and non-neoplastic brain diseases, we performed a genome-wide Principal Component Analysis (PCA) across the samples (N=147) using the function prcomp(version 3.6.0). Consensus clustering was determined by k-means clustering of euclidean distance from the ConsensusClusterPlus(version 1.48.0) package.
Supervised analysis
We also performed an epigenome-wide differential analysis across the serum from 10 patients with PT and 105 with non-PT conditions patients (4 non-tumor, 114 glioma, 3 meningioma, 1 brain metastasis carcinoma, 1 colloid cyst, and 4 from other CNS necrotic tumors). We used the Wilcoxon rank-sum test to identify differentially methylated probes between two different pairs: PT vs non-PT and functioning vs nonfunctioning PT.
For the comparison between PT and non-PT, probes were considered differentially methylated when the false discovery rate (FDR) was less than .001 and absolute value of the difference of a pair of probe mean methylation between each group was greater than 20%. To identify DMP in the serum that were tissue-specific, we calculated the differences in DNA methylation between the matching serum and tissue, by patient. We then selected probes with less than 5% difference between tissue and serum and considered them tissue-specific.
To validate their PT-specificity, we overlapped PeLB probes with the DNA methylome of an independent cohort consisting of pituitary-, glioma- and meningioma-tissue(Figure 1D).
For the comparison between functioning and nonfunctioning, probes were considered differentially methylated when the p-value was less than .01 and absolute value of the difference of probe mean methylation between each group was greater than 20%.
Random Forest
We used a random forest machine-learning (ML) model for binary classification of the specimens with the aim to classify available cfDNA methylation (from serum) derived from patients with PT and non-PT (other neoplastic or non-neoplastic conditions: meningioma, glioma and colorectal carcinoma and nontumor). We first randomly allocated 20% of all samples for the validation set (n = 3 PT; n = 29 Non-PT) only analyzed for the assessment of the prediction model accuracy. The remainder serum specimens were used for the feature extraction or training of the random forest model. For developing the model we randomly partitioned the remainder samples into a training (n = 8 PT; n =84 Non-PT) and testing set (n = 2 PT; n = 21 Non-PT). We used the function train (package caret version 6.0.82) in CRAN, with 5000 trees, and 10 fold cross validation to generate our model. When testing the model, we used an output of 50% probability as a cut-off for classification.
Based on this result, we adopted the default PeLB score cutoff value of 50 to determine whether a patient had PT. We evaluated the performance of the prediction by applying the ML model on the validation set.
Probe annotation
CpG probes were mapped to their CpG genomic location as CpG islands (CGI), shores, shelves, and open sea regions as previously defined 52–55.
Statistical analysis
All processing and statistical analyses were done in R (3.6.1). Wilcoxon rank-sum test and multiple testing adjustments (e.g. FDR) were used to identify differentially methylated probes (DMP) as stated in the previous sections.
Data Availability
The data will be available under the accession code GSEXXXX. All the other data supporting the findings of this study are available within the article and supplemental information and from the corresponding author upon reasonable request.
Funding
This work was supported by the Henry Ford Health System, Department of Neurosurgery, and the Hermelin Brain Tumor Center. MSM and MC are supported by the São Paulo Research Foundation (FAPESP), Brazil (#16/11039–3; #17/10357-4,#14/03989–6); AVC and KPA by Henry Ford Hospital (A30935, A30957; GME 202199); LMP, HN, AD, MW, and AM by the National Institutes of Health (R01CA222146), HN, TSS, TMM, LMP, and AD are supported by the Department of Defense (CA170278).
Author contributions
Overall concept and coordination of the study: AVC, JR, HN, KPA; retrieval of publicly available molecular and clinical data: KPA, MW, AVC; Bioinformatic and statistical analyses: MW, TSS, TMM, MSM, HN and input from LMP; HFHF cohort: pathology review AM, DC; molecular data generation: TMM, AD; the manuscript was written by AVC, HN, MW and intellectual contribution from JS, TM, SK, TW. All authors contributed to the revision of the manuscript.
Data availability
The data is available under the accession code GSEXXXX. All the other data supporting the findings of this study are available within the article and supplemental information and from the corresponding author upon reasonable request.
Competing interests
The authors declare to have no competing interests.
Footnotes
These authors contributed equally as first authors: Michael Wells, Karam P. Asmaro,Thais S. Sabedot, Tathiane M. Malta, and Maritza S. Mosella. These authors contributed equally as senior authors: Houtan Noushmehr, Ana Valeria Castro
Contributor information
Ana Valeria Castro, Email: acastro1{at}hfhs.org
Houtan Noushmehr, Email:hnoush1{at}hfhs.org contributed to the revision of the manuscript. the findings of this study are available within the article and supplemental information and from the corresponding author upon reasonable request.
Acknowledgements
The authors are grateful to the HFHS patients who consented to the usage of PT for research purposes. We thank Nancy Takacs and Heather Mengel for their administrative support; Kevin Nelson for the collection, handling and maintenance of the tumor bank at the Hermelin Brain Tumor Center; Andrea Transou for tumor pathology processing; Laura A. Hasselbach for DNA extraction; Daniel Weisenberger and team at USC Epigenome Center for assistance with DNA methylation profiling (HFHS support);Susan MacPhee for proofreading the manuscript.