PT - JOURNAL ARTICLE AU - Waller, Rosalie Griffin AU - Hanson, Heidi A. AU - Avery, Brian J. AU - Madsen, Michael J. AU - Sborov, Douglas W. AU - Camp, Nicola J. TI - Deep transcriptome profiling of multiple myeloma with quantitative measures using the SPECTRA approach AID - 10.1101/2020.10.06.20206714 DP - 2022 Jan 01 TA - medRxiv PG - 2020.10.06.20206714 4099 - http://medrxiv.org/content/early/2022/06/16/2020.10.06.20206714.short 4100 - http://medrxiv.org/content/early/2022/06/16/2020.10.06.20206714.full AB - SPECTRA is a new data framework to describe variation in a transcriptome as a set of unsupervised quantitative variables. Spectra variables provide a deep dive into the transcriptome, representing both large and small sources of variance, and are ideal for modeling alongside other variables for any outcome of interest. Each spectrum can also be considered a phenotypic trait, providing new avenues for disease characterization or to explore disease risk. We applied the SPECTRA approach to multiple myeloma (MM), the second most common blood cancer. Using RNA sequencing from malignant CD138+ cells, we derived 39 spectra in 767 patients from the MMRF CoMMpass study. We included spectra in prediction models for clinical endpoints, compared to established expression-based risk scores, and used descriptive modeling to identify associations with patient characteristics. Spectra-based risk scores added predictive value beyond established clinical risk factors and other expression-based risk scores for overall survival, progression-free survival, and time to first-line treatment failure. Significant spectra in models may provide mechanistic insight via gene set enrichment based on their gene weights. Gene set enrichment in CD138+ spectrum S5, which was significant for all prognostic endpoints, indicated enrichment for genes in the unfolded protein response, a mechanism targeted by proteasome inhibitors, common first line agents in MM treatment. We also identified significant associations between CD138+ spectra and tumor cytogenetics, race, gender, and age at diagnosis. The SPECTRA approach provides measures of transcriptome variation to deeply profile tumors with greater flexibility to model clinical outcomes and characteristics.AUTHOR SUMMARY Complex diseases, including cancer, are highly heterogeneous, and large molecular datasets are increasingly part of describing an individual’s unique experience. Gene expression is particularly attractive because it captures genetic, epigenetic, and environmental consequences. Transcriptome studies are gaining momentum in genomic epidemiology, and the need to incorporate these data in multivariable models alongside other risk factors brings demands for new approaches. The SPECTRA approach is a new intrinsic quantitative data framework for transcriptomes. A tissue is described by a set of quantitative measures (or ‘spectra’ variables) to deeply profile gene expression in a tissue. Spectra variables are independent and offer flexibility for use in predictive or descriptive modeling. We applied the SPECTRA approach to multiple myeloma, the second most common blood cancer. A set of 39 spectra variables were derived to represent the myeloma tumors. Outcome modeling provided SPECTRA-based risk scores that added predictive value for clinical outcomes beyond established risk factors.Competing Interest StatementThe authors have declared no competing interest.Funding StatementResearch reported in this publication was supported by the National Cancer Institute (Award Numbers F99CA234943, K00CA234943, and P30CA042014-29S9), the National Center for Advancing Translational Sciences (Award Number UL1TR002538), the National Library of Medicine (Award Number T15LM007124) of the National Institutes of Health. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Publicly available RNAseq data were used in this project from the Multiple Myeloma Research Foundation (MMRF) CoMMpass trial (NCT145429). To be eligible, a patient must have read, understood and signed informed consent. RNAseq data was downloaded directly from the MMRF Researcher Gateway (https://research.themmrf.org) after creating an account. Permission was also obtained to access these data via dbGaP under accessions phs000348.v2.p1 and phs000748.v4.p3.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAll data is publicly available through the MMRF Researcher Gateway (https://research.themmrf.org). https://portal.gdc.cancer.gov/projects/MMRF-COMMPASS