Global Biobank Meta-analysis Initiative: powering genetic discovery across human diseases
Summary
Biobanks are being established across the world to understand the genetic, environmental, and epidemiological basis of human diseases with the goal of better prevention and treatments. Genome-wide association studies (GWAS) have been very successful at mapping genomic loci for a wide range of human diseases and traits, but in general, lack appropriate representation of diverse ancestries - with most biobanks and preceding GWAS studies composed of individuals of European ancestries. Here, we introduce the Global Biobank Meta-analysis Initiative (GBMI) -- a collaborative network of 19 biobanks from 4 continents representing more than 2.1 million consented individuals with genetic data linked to electronic health records. GBMI meta-analyzes summary statistics from GWAS generated using harmonized genotypes and phenotypes from member biobanks. GBMI brings together results from GWAS analysis across 6 main ancestry groups: approximately 33,000 of African ancestry either from Africa or from admixed-ancestry diaspora (AFR), 18,000 admixed American (AMR), 31,000 Central and South Asian (CSA), 341,000 East Asian (EAS), 1.4 million European (EUR), and 1,600 Middle Eastern (MID) individuals. In this flagship project, we generated GWASs from across 14 exemplar diseases and endpoints, including both common and less prevalent diseases that were previously understudied. Using the genetic association results, we validate that GWASs conducted in biobanks worldwide can be successfully integrated despite heterogeneity in case definitions, recruitment strategies, and baseline characteristics between biobanks. We demonstrate the value of this collaborative effort to improve GWAS power for diseases, increase representation, benefit understudied diseases, and improve risk prediction while also enabling the nomination of disease genes and drug candidates by incorporating gene and protein expression data and providing insight into the underlying biology of the studied traits.
Competing Interest Statement
M.J.D. is a founder of Maze Therapeutics. B.M.N. is a member of the scientific advisory board at Deep Genomics and consultant for Camp4 Therapeutics, Takeda Pharmaceutical, and Biogen. The spouse of C.J.W works at Regeneron Pharmaceuticals. C.Y.C. is employed by Biogen. C.R.G. owns stock in 23andMe, Inc. T.R.G. has received research funding from various pharmaceutical companies to support the application of Mendelian randomization to drug target prioritization. E.E.K. has received speaker fees from Regeneron, Illumina, and 23&Me, and is a member of the advisory board for Galateo Bio. R.E.M. has received speaker fees from Illumina and is a scientific advisor to the Epigenetic Clock Development Foundation. G.D.S has received research funding from various pharmaceutical companies to support the application of Mendelian randomization to drug target prioritization. K.S. and U.T. are employed by deCODE Genetics/Amgen inc. J.Z. has received research funding from various pharmaceutical companies to support the application of Mendelian randomization to drug target prioritization.
Funding Statement
Please refer to Supplementary Note for information regarding individual studies involved in the Global Biobank Meta-analysis Initiative.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Please refer to Supplementary Note for information regarding individual studies involved in the Global Biobank Meta-analysis Initiative.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
↵* These authors jointly supervised this work
Data Availability
The all-biobank meta-analysis results and plots for the 14 endpoints (including both ancestry-specific and cross-ancestry meta-analyses and sex stratified meta-analyses) are available for downloading at https://www.globalbiobankmeta.org/resources and browsed at the PheWeb Browser http://results.globalbiobankmeta.org. Custom scripts used for quality control, meta-analysis and summary of results are available at https://github.com/globalbiobankmeta. The optimized trans-ancestry and single-ancestry polygenic score weights will be deposited within the PGS Catalog (https://www.pgscatalog.org/).
Subject Area
- Addiction Medicine (405)
- Allergy and Immunology (714)
- Anesthesia (209)
- Cardiovascular Medicine (2989)
- Dermatology (254)
- Emergency Medicine (447)
- Epidemiology (12859)
- Forensic Medicine (12)
- Gastroenterology (839)
- Genetic and Genomic Medicine (4663)
- Geriatric Medicine (428)
- Health Economics (735)
- Health Informatics (2963)
- Health Policy (1078)
- Hematology (394)
- HIV/AIDS (940)
- Medical Education (432)
- Medical Ethics (116)
- Nephrology (478)
- Neurology (4448)
- Nursing (239)
- Nutrition (653)
- Oncology (2313)
- Ophthalmology (657)
- Orthopedics (260)
- Otolaryngology (329)
- Pain Medicine (286)
- Palliative Medicine (85)
- Pathology (504)
- Pediatrics (1205)
- Primary Care Research (506)
- Public and Global Health (7043)
- Radiology and Imaging (1561)
- Respiratory Medicine (927)
- Rheumatology (447)
- Sports Medicine (388)
- Surgery (495)
- Toxicology (60)
- Transplantation (213)
- Urology (186)