Abstract
The prevalence of most complex diseases varies across human populations, and a combination of socioeconomic and biological factors drives these differences. Likewise, divergent evolutionary histories can lead to different genetic architectures of disease, where allele frequencies and linkage disequilibrium patterns at disease-associated loci differ across global populations. However, it is presently unknown how much natural selection contributes to the health inequities of complex polygenic diseases. Here, we focus on ten hereditary diseases with the largest global disease burden in terms of mortality rates (e.g., coronary artery disease, stroke, type 2 diabetes, and lung cancer). Leveraging multiple GWAS and polygenic risk scores for each disease, we examine signatures of selection acting on sets of disease-associated variants. First, on a species level, we find that genomic regions associated with complex diseases are enriched for signatures of background selection. Second, tests of polygenic adaptation incorporating demographic histories of continental super-populations indicate that most complex diseases are primarily governed by neutral evolution. Third, we focus on a finer scale, testing for recent positive selection on a population level. We find that even though some disease-associated loci have undergone recent selection (extreme values of integrated haplotype scores), sets of disease-associated loci are not enriched for selection when compared to baseline distributions of control SNPs. Collectively, we find that recent natural selection has had a negligible role in driving differences in the genetic risk of complex diseases between human populations. These patterns are consistent with the late age of onset of many complex diseases.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was supported by an NIH MIRA grant to Joseph Lachance (R35GM133727).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
This revised version of our manuscript now includes additional tests of polygenic selection (B-statistics and iHS). It also includes a new method of applying these tests of selection to sets of disease-associated SNPs. We have also enhanced by manuscript by repeating our scans of selection for multiple ascertainment schemes (i.e., we tested whether our conclusions are robust to ancestry of each source GWAS).
Data Availability
The GWAS summary statistics used in this paper are publicly available. Details about specific studies can be found in Table 1.