Polygenic Risk Prediction using Gradient Boosted Trees Captures Non-Linear Genetic Effects and Allele Interactions in Complex Phenotypes
Michael Elgart, Genevieve Lyons, Santiago Romero-Brufau, Nuzulul Kurniansyah, View ORCID ProfileJennifer A. Brody, Xiuqing Guo, Henry J Lin, View ORCID ProfileLaura Raffield, Yan Gao, View ORCID ProfileHan Chen, Paul de Vries, Donald M. Lloyd-Jones, Leslie A Lange, Gina M Peloso, View ORCID ProfileMyriam Fornage, Jerome I Rotter, Stephen S Rich, Alanna C Morrison, Bruce M Psaty, Daniel Levy, Susan Redline, the NHLBI’s Trans-Omics in Precision Medicine (TOPMed) Consortium, View ORCID ProfileTamar Sofer
doi: https://doi.org/10.1101/2021.07.09.21260288
Michael Elgart
1Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, MA, USA
2Department of Medicine, Harvard Medical School, Boston, MA, USA
Genevieve Lyons
1Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, MA, USA
3Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Santiago Romero-Brufau
3Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
4Department of Medicine, Mayo Clinic, Rochester, Minnesota, USA
Nuzulul Kurniansyah
1Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, MA, USA
Jennifer A. Brody
5Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, Washington
Xiuqing Guo
6The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
Henry J Lin
6The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
Laura Raffield
7Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
Yan Gao
8The Jackson Heart Study, University of Mississippi Medical Center, Jackson, MS, USA
Han Chen
9Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
10Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA
Paul de Vries
9Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Donald M. Lloyd-Jones
11Department of Preventive Medicine, Northwestern University, Chicago, IL, USA
Leslie A Lange
12Department of Medicine, University of Colorado Denver, Anschutz Medical Campus, Aurora, CO, USA
Gina M Peloso
13Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
Myriam Fornage
9Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
14Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX
Jerome I Rotter
6The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
Stephen S Rich
15Center for Public Health Genomics, University of Virginia School of Medicine, Charlottesville, VA, USA
Alanna C Morrison
9Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Bruce M Psaty
16Cardiovascular Health Research Unit, Departments of Medicine, Epidemiology, and Health Services, University of Washington, Seattle, Washington
Daniel Levy
17The Population Sciences Branch of the National Heart, Lung and Blood Institute, Bethesda, MD, United States
18The Framingham Heart Study, Framingham, MA, United States
Susan Redline
1Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, MA, USA
2Department of Medicine, Harvard Medical School, Boston, MA, USA
Tamar Sofer
1Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, MA, USA
2Department of Medicine, Harvard Medical School, Boston, MA, USA
3Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Data Availability
TOPMed freeze 8 WGS data are available by application to dbGaP according to the study specific accessions: FHS: phs000974.v4.p3, JHS: phs000964.v1.p1, MESA: phs001211.v3.p2, CARDIA: phs001612.v1.p1, CFS: phs000954.v3.p2, CHS: phs001368.v2.p1, HCHS/SOL: phs001395.v1.p1, ARIC phs001416.v2.p1. Study phenotypes are available from dbGaP from parent studies accession: FHS: phs000007.v32.p13, JHS: phs000286.v6.p2, MESA: phs000209.v13.p3, CARDIA: phs000285.v3.p2, CFS: phs000284.v2.p1, CHS: phs000287.v7.p1, HCHS/SOL: phs000810.v1.p1, ARIC: phs000090.v7.p1.
Posted July 16, 2021.
Polygenic Risk Prediction using Gradient Boosted Trees Captures Non-Linear Genetic Effects and Allele Interactions in Complex Phenotypes
Michael Elgart, Genevieve Lyons, Santiago Romero-Brufau, Nuzulul Kurniansyah, Jennifer A. Brody, Xiuqing Guo, Henry J Lin, Laura Raffield, Yan Gao, Han Chen, Paul de Vries, Donald M. Lloyd-Jones, Leslie A Lange, Gina M Peloso, Myriam Fornage, Jerome I Rotter, Stephen S Rich, Alanna C Morrison, Bruce M Psaty, Daniel Levy, Susan Redline, the NHLBI’s Trans-Omics in Precision Medicine (TOPMed) Consortium, Tamar Sofer
medRxiv 2021.07.09.21260288; doi: https://doi.org/10.1101/2021.07.09.21260288
Polygenic Risk Prediction using Gradient Boosted Trees Captures Non-Linear Genetic Effects and Allele Interactions in Complex Phenotypes
Michael Elgart, Genevieve Lyons, Santiago Romero-Brufau, Nuzulul Kurniansyah, Jennifer A. Brody, Xiuqing Guo, Henry J Lin, Laura Raffield, Yan Gao, Han Chen, Paul de Vries, Donald M. Lloyd-Jones, Leslie A Lange, Gina M Peloso, Myriam Fornage, Jerome I Rotter, Stephen S Rich, Alanna C Morrison, Bruce M Psaty, Daniel Levy, Susan Redline, the NHLBI’s Trans-Omics in Precision Medicine (TOPMed) Consortium, Tamar Sofer
medRxiv 2021.07.09.21260288; doi: https://doi.org/10.1101/2021.07.09.21260288
Subject Area
Subject Areas
- Addiction Medicine (394)
- Allergy and Immunology (706)
- Anesthesia (197)
- Cardiovascular Medicine (2888)
- Dermatology (248)
- Emergency Medicine (433)
- Epidemiology (12634)
- Forensic Medicine (10)
- Gastroenterology (814)
- Genetic and Genomic Medicine (4493)
- Geriatric Medicine (411)
- Health Economics (718)
- Health Informatics (2876)
- Health Policy (1059)
- Hematology (381)
- HIV/AIDS (913)
- Medical Education (419)
- Medical Ethics (115)
- Nephrology (466)
- Neurology (4264)
- Nursing (228)
- Nutrition (626)
- Oncology (2232)
- Ophthalmology (637)
- Orthopedics (255)
- Otolaryngology (323)
- Pain Medicine (272)
- Palliative Medicine (83)
- Pathology (491)
- Pediatrics (1183)
- Primary Care Research (489)
- Public and Global Health (6845)
- Radiology and Imaging (1505)
- Respiratory Medicine (910)
- Rheumatology (431)
- Sports Medicine (378)
- Surgery (476)
- Toxicology (60)
- Transplantation (206)
- Urology (176)