Abstract
Alterations in DNA methylation patterns are a frequent finding in cancer. Methylation aberrations can drive tumorigenic pathways and serve as potential biomarkers. The role of epigenetic alterations in thyroid cancer is still poorly understood Here, we analyzed methylome data of a total of 810 thyroid samples (n=256 for discovery and n=554 for validation), including benign and malignant follicular cell-derived thyroid neoplasms, as well as normal thyroid tissue. In the discovery phase, we employed an unsupervised machine-learning method to search for methylation patterns. We found evidence supporting the existence of three distinct methylation subtypes: a normal-like, a hypermethylated follicular-like, and a hypomethylated papillary-like cluster. Follicular adenomas, follicular carcinomas, oncocytic adenomas, oncocytic carcinomas, and NIFTP samples were grouped within the follicular-like cluster, indicating that these pathologies shared numerous epigenetic alterations, with a predominance of hypermethylation events. Conversely, classic papillary thyroid carcinomas (PTC) and tall cell PTC formed a separate subtype characterized by the predominance of hypomethylated positions. Interestingly, follicular variant papillary thyroid carcinomas (FVPTC) were as likely to be classified as follicular-like or PTC-like during the discovery phase, indicating a heterogeneous group likely to be formed by at least two distinct diseases. In the validation phase, we found that FVPTC with follicular-like methylation patterns were enriched for RAS mutations. In contrast, FVPTC with PTC-like methylation patterns were enriched for BRAF and RET alterations. Our data provide novel insights into the epigenetic alterations of thyroid tumors. Since the classification method relies on a fully unsupervised machine learning approach for subtype discovery, our results offer a robust background to support the classification of thyroid neoplasms based on methylation patterns.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study was financed in part by the Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior - Brasil (CAPES) - Finance Code 001. This work received financial support from Conselho Nacional de Desenvolvimento Cientifico e Tecnologico (CNPq/AWS No 032/2019, 440005/2020-5). V.R.M. received a PhD scholarship from Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior (CAPES) (88887.354162/2019-00). A.L.M. is a recipient of scholarship (No 316323/2021-7) and research grants (No 408344/2021-0) from the Conselho Nacional de Desenvolvimento Cientifico e Tecnologico (CNPq), and Programa de Apoio a Nucleos de Excelencia (PRONEX/FAPERGS; No 16/2551-0000486-2). The funder had no role in the design of the study; the collection, analysis, and interpretation of the data; the writing of the manuscript; or the decision to submit the manuscript for publication.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
All the data used in our study are publicly available at Gene Expression Omnibus (GEO)(https://www.ncbi.nlm.nih.gov/geo/) or GDC Data Portal (https://portal.gdc.cancer.gov/).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
Competing Interest Statement: VRM, MRM, ALM, and IMG declare no competing interests concerning the work described.
Funding Statement: This study was financed by grants from the government research agencies Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES), Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq), and Fundação de Amparo à Pesquisa do Estado do Rio Grande do Sul (FAPERGS).
Data Availability: All data used in our study is publicly available at Gene expression. Omnibus (GEO). All R packages employed in our study are also freely available at CRAN or Bioconductor.
Ethics approval statement: The present study was approved by the institutional ethical committee.
Data Availability
All the data used in our study are publicly available at Gene Expression Omnibus (GEO) or GDC Data Portal. All R packages employed in our study are also freely available at CRAN or Bioconductor.