Abstract
Metabolic biomarker data quantified by nuclear magnetic resonance (NMR) spectroscopy for 249 circulating metabolites, lipids, and lipoprotein sub-fractions has recently become available in UK Biobank for approximately 121,657 participants. Here, we describe procedures for quality control and removal of technical variation for this biomarker data. We show that technical and biological effects with linear effects on individual biomarkers can combine in a non-linear fashion on (the 61) composite biomarkers and (81) biomarker ratios, and thus composite biomarkers and ratios should be re-derived after removal of unwanted variation to avoid introducing such cryptic effects. We make available an R package, ukbnmr, for extracting the metabolic biomarker data from UK Biobank and removing the unwanted technical variation. We also make available code for re-deriving the 61 composite biomarkers and 81 ratios, and for further derivation of 76 additional biomarker ratios of potential biological significance. Finally, we demonstrate that our removal of technical variation leads to increased signal for genetic and epidemiological studies of the NMR metabolic biomarkers in UK Biobank.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
S.C.R was funded by the National Institute for Health Research (NIHR) Cambridge BRC (BRC-1215-20014). P.S. was supported by a Rutherford Fund Fellowship from the Medical Research Council grant MR/S003746/1. S.A.L. is supported by a Canadian Institutes of Health Research postdoctoral fellowship (MFE-171279). T.B. was funded by the NIHR Blood and Transplant Research Unit in Donor Health and Genomics (NIHR BTRU-2014-10024). L.P. and S.K. were funded by a BHF Programme Grant (RG/18/13/33946). J.D. holds a British Heart Foundation Professorship and a NIHR Senior Investigator Award. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The views expressed are those of the author(s) and not necessarily those of the NIHR or the Department of Health and Social Care.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Ethics approval was obtained from the North West Multi-Center Research Ethics Committee. The current analysis was approved under UK Biobank Projects 30418 and 7439
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
All data described are available through UK Biobank subject to approval from the UK Biobank access committee. See https://www.ukbiobank.ac.uk/enable-your-research/apply-for-access for further details. Extensive diagnostic plot showing the impact of technical variation and its removal on all biomarkers can be obtained from FigShare at doi: 10.6084/m9.figshare.16671208.