Abstract
Genome-wide association studies (GWAS) in predominately European-ancestry (EUR) populations have identified numerous genetic variants associated with adiposity-related traits. An emerging challenge is the limited transferability of genetic scores constructed based on GWAS results from one ancestry for trait prediction in other ancestries. We performed trans-ancestry meta-analysis (TAMA) for eight adiposity-related traits using genetic data from 96,124 East Asian (EAS) and 443,359 EUR individuals. We identified >1400 genomic regions significantly associated with one or more traits. Despite EAS comprising only ∼20% of the study population, genetic scores constructed from the trans-ancestry (TA) results accounted for between 30% and 79% more variation in the adiposity traits in EAS compared with scores derived from the EUR GWAS alone. Furthermore, TA scores also modestly improved variance explained in African/African American, Hispanic and South Asian populations. Our findings highlight the utility of TAMA for increasing variance explained by genetic scores across populations of different ancestries.
Competing Interest Statement
MMcC and AM were employees of the University of Oxford when this work was conducted but are currently employees of Genentech and holders of Roche stock. WG is a current employee of Novo Nordisk but was at the University of Oxford when the work was conducted. MVH. was at the University of Oxford when this work was conducted and was supported by a British Heart Foundation Intermediate Clinical Research Fellowship (FS/18/23/33512). MVH is currently employed by 23andMe and holds stock in 23andMe. The spouse of CJW is employed at Regeneron Pharmaceuticals.
Funding Statement
The chief acknowledgment is to the CKB and UKB participants. We also thank the CKB project staff, and the China National Centre for Disease Control and Prevention (CDC) and its regional offices for assisting with the fieldwork The CKB baseline survey and the first re-survey were supported by the Kadoorie Charitable Foundation in Hong Kong. The long-term follow-up has been supported by Wellcome grants to Oxford University (212946/Z/18/Z, 202922/Z/16/Z, 104085/Z/14/Z, 088158/Z/09/Z) and grants from the National Key Research and Development Program of China (2016YFC0900500, 2016YFC0900501, 2016YFC0900504, 2016YFC1303904) and from the National Natural Science Foundation of China (91843302). DNA extraction and genotyping was supported by grants from GlaxoSmithKline and the UK Medical Research Council (MC-PC-13049, MC-PC-14135). The UK Medical Research Council (MC_UU_00017/1,MC_UU_12026/2 MC_U137686851), Cancer Research UK (C16077/A29186; C500/A16896) and the British Heart Foundation (CH/1996001/9454) provide core funding to the Clinical Trial Service Unit and Epidemiological Studies Unit at Oxford University for the project. ZFH was partly funded by a grant from The Wellcome Trust (H5R00150). The computational aspects of this research were supported by the Wellcome Trust Core Award Grant Number 203141/Z/16/Z and the NIHR Oxford BRC. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health. YFL. is supported by the National Health Research Institutes (NP-110-PP-09; NP-111-PP-09) and the Ministry of Science and Technology (110-2314-B-400-028-MY3) of Taiwan. We thank the National Core Facility for Biopharmaceuticals (NCFB, MOST 106-2319-B-492-002) and National Center for High-performance Computing (NCHC) of National Applied Research Laboratories (NARLabs) of Taiwan for providing computational and storage resources. The Mexico City Prospective Study has received funding from the Mexican Health Ministry, the National Council of Science and Technology for Mexico, the Wellcome Trust, Cancer Research UK, British Heart Foundation and the UK Medical Research Council. GH is funded by The Wellcome Trust and Royal Society (208806/Z/17/Z). The UK Medical Research Council and Wellcome (217065/Z/19/Z and WT088806) and the University of Bristol provide core support for ALSPAC. This publication is the work of the authors and RGW and GH will serve as guarantor for the contents of this paper. The BioBank Japan Project was supported by AMED under Grant Number JP21km0605001. LB and BB receive support from the K.G. Jebsen Center for Genetic Epidemiology funded by Stiftelsen Kristian Gerhard Jebsen; Faculty of Medicine and Health Sciences, NTNU; The Liaison Committee for education, research and innovation in Central Norway; and the Joint Research Committee between St Olavs Hospital and the Faculty of Medicine and Health Sciences, NTNU. The Population Architecture Using Genomics and Epidemiology (PAGE) program is funded by the National Human Genome Research Institute (NHGRI) with co-funding from the National Institute on Minority Health and Health Disparities (NIMHD). The contents of this paper are solely the responsibility of the authors and do not necessarily represent the official views of the NIH. The PAGE consortium thanks the staff and participants of all PAGE studies for their important contributions. We thank Rasheeda Williams and Margaret Ginoza for providing assistance with program coordination. The complete list of PAGE members can be found at http://www.pagestudy.org. Assistance with data management, data integration, data dissemination, genotype imputation, ancestry deconvolution, population genetics, analysis pipelines, and general study coordination was provided by the PAGE Coordinating Center (NIH U01HG007419). Genotyping services were provided by the Center for Inherited Disease Research (CIDR). CIDR is fully funded through a federal contract from the National Institutes of Health to The Johns Hopkins University, contract number HHSN268201200008I. Genotype data quality control and quality assurance services were provided by the Genetic Analysis Center in the Biostatistics Department of the University of Washington, through support provided by the CIDR contract. The data and materials included in this report result from collaboration between the following studies and organizations: BioMe Biobank: Samples and data of The Charles Bronfman Institute for Personalized Medicine (IPM) BioMe Biobank used in this study were provided by The Charles Bronfman Institute for Personalized Medicine at the Icahn School of Medicine at Mount Sinai (New York). Phenotype data collection was supported by The Andrea and Charles Bronfman Philanthropies. Funding support for the Population Architecture Using Genomics and Epidemiology (PAGE) IPM BioMe Biobank study was provided through the National Human Genome Research Institute (NIH U01HG007417). HCHS/SOL: Primary funding support to Dr. North and colleagues is provided by U01HG007416. Additional support was provided via R01DK101855 and 15GRNT25880008. The Hispanic Community Health Study/Study of Latinos was carried out as a collaborative study supported by contracts from the National Heart, Lung, and Blood Institute (NHLBI) to the University of North Carolina (N01-HC65233), University of Miami (N01-HC65234), Albert Einstein College of Medicine (N01-HC65235), Northwestern University (N01-HC65236), and San Diego State University (N01-HC65237). The following Institutes/Centers/Offices contribute to the HCHS/SOL through a transfer of funds to the NHLBI: National Institute on Minority Health and Health Disparities, National Institute on Deafness and Other Communication Disorders, National Institute of Dental and Craniofacial Research, National Institute of Diabetes and Digestive and Kidney Diseases, National Institute of Neurological Disorders and Stroke, NIH Institution-Office of Dietary Supplements. MEC: The Multiethnic Cohort study (MEC) characterization of epidemiological architecture is funded through the NHGRI Population Architecture Using Genomics and Epidemiology (PAGE) program (NIH U01 HG007397). The MEC study is funded through the National Cancer Institute U01 CA164973. PAGE Global Reference Panel: The Stanford Global Reference Panel was created by Stanford-contributed samples and comprises multiple datasets from multiple researchers across the world designed to provide a resource for any researchers interested in diverse population data on the Multi-Ethnic Global Array (MEGA), funded by the NHGRI PAGE program (NIH U01HG007419). The authors thank the researchers and research participants who made this dataset available to the community. The specific datasets are: Mexico: Samples of indigenous origin in Oaxaca were kindly provided by Drs. Karla Sandoval Mendoza, Samuel Canizales Quinteros, and Victor Acuñ a Alonzo. Peru: Individuals from a primarily Quechuan and Aymaran-speaking community in Puno, Peru were kindly provided by Drs. Julie Baker and Carlos Bustamante, with funding support from the Burroughs Welcome Fund. Rapa Nui (Easter Island): Samples were kindly provided by Drs. Karla Sandoval Mendoza and Andres Moreno Estrada with funding from the Charles Rosenkranz Prize for Health Care Research in Developing Countries. South Africa: Samples of KhoeSan individuals from the Khomani and Nama communities were kindly provided by Drs. Brenna Henn and Christopher Gignoux with funding from the Morrison Institute for Population and Resource Studies. Honduras and Colombia: Samples from communities in Honduras and Colombia were kindly provided by Dr. Kathleen Barnes (University of Colorado, Denver), Edwin Herraro-Paz (Universidad Católica de Honduras, San Pedro Sula, Honduras), Alvaro Mayorga (Universidad Católica de Honduras, San Pedro Sula, Honduras), Luis Caraballo (University of Cartagena), Javier Marrugo (university of Cartagena) Additional global samples: The following datasets are open access and available through the lab website of Carlos Bustamante (https://bustamantelab.stanford.edu/). The Human Genome Diversity Panel (HGDP-CEPH) is a group of cell lines maintained by the Centre dÉtude du Polymorphisme Humain, Fondation Jean Dausset (Paris, France) comprising 52 diverse populations across the world (Africa, Near East, Europe, South Asia, Central Asia, East Asia, Oceania and the Americas). Additional information on these datasets can be found on the CEPH website (http://www.cephb.fr/en/hgdp_panel.php), or originally at http://www.ncbi.nlm.nih.gov/pubmed/11954565 and http://www.ncbi.nlm.nih.gov/pubmed/12493913, with numerous subsequent publications. Samples were filtered to include the H952 unrelated individuals as published here: http://www.ncbi.nlm.nih.gov/pubmed/17044859. Also available on the Bustamante Lab website is genotype data for the Maasai from Kinyawa, Kenya (MKK) samples maintained by the Coriell Institute for Medical Research (https://catalog.coriell.org/1/NHGRI/Collections/HapMap-Collections/Maasai-in-Kinyawa-Kenya-MKK) and genotyped as part of the International HapMap Project Phase 3(http://hapmap.ncbi.nlm.nih.gov/, http://www.sanger.ac.uk/resources/downloads/human/hapmap3.html). We have genotyped a subset of unrelated individuals using the filters recommended in http://www.ncbi.nlm.nih.gov/pubmed/20869033. WHI: Funding support for the Exonic variants and their relation to complex traits in minorities of the WHI study is provided through the NHGRI PAGE program (NIH U01HG007376). The WHI program is funded by the National Heart, Lung, and Blood Institute, National Institutes of Health, U.S. Department of Health and Human Services through contracts HHSN268201100046C, HHSN268201100001C, HHSN268201100002C, HHSN268201100003C, HHSN268201100004C, and HHSN271201100004C. The authors thank the WHI investigators and staff for their dedication, and the study participants for making the program possible. A listing of WHI investigators can be found at: https://www.whi.org/researchers/Documents%20%20Write%20a%20Paper/WHI%20Investigator%20Short%20List.pdf
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The China Kadoorie Biobank (CKB) complies with all the required ethical standards for medical research on human subjects. Ethical approvals were granted and have been maintained by the relevant institutional ethical research committees in the UK and China. Informed consent was obtained from all participants included in the study.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
CKB phenotype data are available under the CKB Open Access Data Policy to bona fide researchers. Sharing of genotyping data is currently constrained by the Administrative Regulations on Human Genetic Resources of the People’s Republic of China. Access to these and certain other data is available through collaboration with CKB researchers. Full details of the CKB Data Sharing Policy are available at www.ckbiobank.org. The UKB genotype and phenotype data were accessed under project numbers 50474 and 9161 (http://www.ukbiobank.ac.uk/). UKB data can be accessed upon request once a research project has been submitted and approved by the UKB committee.
Abbreviations
- AFR/AA
- African/African American
- BBJ
- Biobank Japan
- BMI
- body mass index
- bp
- base pair
- CKB
- China Kadoorie Biobank
- CHNS
- China Health and Nutrition Survey
- DEG
- differentially expressed genes
- EAS
- East Asian
- EUR
- European
- Fat%
- body fat percentage
- FDR
- false discovery rate
- GCTA
- Genome-wide Complex Trait Analysis
- GWAS
- genome-wide association study
- HE-RE
- Han-Eskin random effects
- HIP
- hip circumference
- HIPadjBMI
- HIP adjusted for BMI
- HIS
- Hispanic
- kb
- kilobase pair
- LD
- linkage disequilibrium
- LDSC
- LD score
- MAF
- minor allele frequency
- MR
- Mendelian randomisation
- PGS
- polygenic score
- SAS
- South Asian
- TA
- trans-ancestry
- TAMA
- trans-ancestry meta-analysis
- TWB
- Taiwan Biobank
- WC
- waist circumference
- WCadjBMI
- WC adjusted for body mass index
- UKB
- UK Biobank
- WHR
- waist-hip ratio
- WHRadjBMI
- WHR adjusted for BMI