Transferability and accuracy of electronic health record-based predictors compared to polygenic scores

Kira E. Detrois; Tuomo Hartonen; Maris Teder-Laving; Bradley Jermy; Kristi Läll; Zhiyu Yang; Estonian Biobank research team, FinnGen; Reedik Mägi; Samuli Ripatti; Andrea Ganna

doi:10.1101/2024.10.08.24315073

Abstract

Electronic health record (EHR)-based phenotype risk scores (PheRS) leverage individuals’ health trajectories to infer disease risk. Similarly, polygenic scores (PGS) use genetic information to estimate disease risk. While PGS generalizability has been previously studied, less is known about PheRS transferability across healthcare systems and whether PheRS provide complementary risk information to PGS.

We trained PheRS to predict the onset of 13 common diseases with high health burden in a total of 845,929 individuals (age 32-70) from 3 biobank-based studies from Finland (FinnGen), the UK (UKB) and Estonia (EstB). The PheRS were based on elastic-net models, incorporating up to 242 diagnoses captured in the EHR up to 10 years before baseline. Individuals were followed up for a maximum of 8 years, during which disease incidence was observed. PGS were calculated for each disease using recent publicly available results from genome-wide association studies.

All 13 PheRS were significantly associated with the diseases of interest. The PheRS trained in different biobanks utilized partially distinct diagnoses, reflecting differences in medical code usage across the countries. Even with the large variability in the prevalence of various diagnoses, most PheRS trained in the UKB or EstB transferred well to FinnGen without re-training. PheRS and PGS were only moderately correlated (Pearson’s r ranging from 0.00 to 0.08), and models including both PheRS and PGS improved onset prediction compared to PGS alone for 8/13 diseases. PheRS was able to identify a subset of individuals at high-risk better than PGS for 8/13 disease.

Our results indicate that EHR-based risk scores and PGS capture largely independent information and provide additive benefits for disease risk prediction. Furthermore, for many diseases the PheRS models transfer well between different EHRs. Given the large availability of EHR, PheRS can provide a complementary tool to PGS for risk stratification.

Introduction

With the advent of large-scale genetic studies and the widespread availability of electronic health record (EHR) data, it is possible to combine these resources to more efficiently predict the risk of a wide range of diseases^1–3. Disease risk estimation can guide the efficient allocation of screening, preventative interventions, and treatments in the early stages of diseases. Two lines of research have emerged in the past years. Some researchers have focused on machine learning approaches for EHR data ^2,4 and showed some promising results in deriving EHR-based predictors for pancreatic cancer⁵ and cardiovascular disease^6–8, among others. Many studies have focused on genetic data. Polygenic scores (PGSs) use combined information from a person’s genome to estimate their genetic risk of developing a specific disease or trait. Numerous studies have examined the predictive ability of PGS across multiple diseases, and there is an extensive discussion about their clinical and public health value^9–24.

EHR and PGS-based prediction models have different strengths and limitations. EHRs allow access to a vast variety of data, including but not limited to disease diagnosis history, laboratory measurements, free text reports, and various socio-economic information²⁵. However, EHRs are also known to be noisy^1,2, and the models are expected to suffer from poor generalizability because of differences in data availability, as well as in clinical and recording practices across healthcare systems^{2,3,5,25–28}. So far, most research has been conducted on a single EHR with limited work on validating the models in different EHR systems and countries^2,27,29,30. Recent studies, however, show promising results when validating EHR-based predictors in a different healthcare system in the US and UK. For example, an EHR-based prediction model trained in a US study (BioMe) outperformed conventional clinical guidelines in predicting coronary artery disease (CAD) susceptibility, and the results could be externally replicated in the UK Biobank (UKB)^7,8. A similar recent study successfully transferred an EHR-based model trained in the BioMe study for the prediction of autoimmune diseases to All of US, another US-based study. Another systematic effort to train deep learning-based prediction models on the UKB EHR data for 1.568 diseases showed that when transferring these models to the All of Us study, 1.347 (85.9%) of the models improved disease onset prediction over a baseline model with age and sex³¹.

PGS are less likely to suffer from measurement errors compared to EHR-based models, however, they are known to be poorly transferable across ancestries, thus risking increasing health disparities^14,32. PGS are also not routinely measured in the healthcare setting, although some healthcare systems have piloted programs to return PGS to individuals^33,34. Further, as PGS keep improving through larger and more representative genome-wide association studies, there is a growing interest in the integration of other predictors and risk factors to better capture the disease risk of individuals. Some studies have been recently published integrating, for example, proteomics^35,36 or metabolomics-based risk scores^37,38 with PGS. Compared to omics, EHR data has the advantage that it is already routinely electronically collected in many countries and does not require invasive and often relatively expensive additional measurements³. Importantly, there is a gap in our understanding of how PGS complement both established clinical risk factors and EHR-based risk scores. Numerous studies have investigated the additive value of PGS with clinical risk factors for a subset of diseases, including type 2 diabetes (T2D) and CAD^9,16,39. For EHR-based risk scores and many other diseases the added benefit of PGS for disease onset prediction and risk stratification remains understudied.

In this study, we aimed to directly compare, within and across studies, the predictive performance and transferability of EHR-based scores vs. PGS using a longitudinal prospective design. We conducted this comparison across 13 common diseases and 3 large biobank-based studies with high-quality EHR: UK Biobank⁴⁰ (UKB, United Kingdom), FinnGen⁴¹ (Finland) and Estonian Biobank (EstB, Estonia)³⁴. We created the EHR-based scores using the PheRS (Phenotype Risk Score) framework^42,43 with PheRS derived from longitudinal diagnostic codes translated into consistent disease diagnoses using phecodes⁴⁴.

Results

Study overview

We included 845,929 individuals (Supplement Table 1) aged 32 to 70 on 01/01/2011 (Figure 1A). These individuals belong to 3 biobank-based studies (FinnGen, UKB, EstB) linked with national registers or EHRs. The individuals gathered a total of 293,019 new diagnoses during an 8-year prediction period (01/01/2011 – 31/12/2018) across 13 common and high-burden diseases: prostate cancer, breast cancer, colorectal cancer, lung cancer, type 2 diabetes (T2D), atrial fibrillation (AF), major depression (MDD), coronary heart disease (CHD), hip osteoarthritis (hip OA), knee osteoarthritis (knee OA), asthma, gout, and epilepsy. We observed the highest number of events for knee OA (N=43,767) and the lowest for lung cancer (N=4,796, Figure 1C, Supplement Table 2).

Figure 1: Study overview.

Panel A Outline of the study design. A separate study is conducted for each of the 13 diseases in the three biobank-based studies. Each study consists of an observation and a prediction period, separated by a 2-year washout period. Each disease’s case and control definitions were based on diagnoses acquired in the prediction period (1/1/2011 – 31/12/2018). We removed all individuals diagnosed before our baseline (1/1/2011) and only considered adults aged 32-70 in 2011 (see Methods for more details). Panel B: We compared the PGSs with PheRSs – trained on phecodes recorded during the observation period (1/1/1999 – 31/12/2008). The PGS were based on recent publicly available GWAS summary statistics using MegaPRS. Ultimately, each individual was assigned 13 different PGS and PheRS scores describing their risk of getting a disease diagnosis during the prediction period. We trained the PheRS on 50% of individuals separately in the three studies (FinnGen, UKB, EstB). In each study, we then used the other half of the population as a test set where we used the scores as predictors in Cox-proportional hazards models⁴⁵ (Cox-PH). Panel C: Number of new diagnoses for each disease during the prediction period (1/1/2011 – 31/12/2018) for each of the 13 diseases in the three cohorts (green: EstB, yellow: FinnGen, brown: UKB). This figure was created with the help of BioRender.com.

Construction of PGS and PheRS

We constructed the PGS and PheRS separately for each disease (Figure 1B). PGS were previously derived by the INTERVENE consortium⁴⁶. PheRS were based on phecodes⁴⁷ recorded during a 10-year observation period (01/01/1999 – 31/12/2009; Figure 1A), separated from the prediction period (01/01/2011 – 31/12/2018) by a 2-year washout period. In total, we considered 242 phecodes with a prevalence of at least 1% in any study. Each PheRS model was trained separately to predict disease occurrence in the prediction period using 50% of the individuals in each study. We used an elastic net model, a type of regularized regression method that combines the properties of both Ridge (L2) and Lasso (L1) regression⁴³. The effect of age, sex and the ten first genetic principal components (PCs) were regressed out from both PheRS and PGS to make the scores comparable. A more detailed description of the PheRS construction can be found in the Methods. Disease prevalence during the prediction period (01/01/2011 – 31/12/2018, Figure 1C) varied substantially across the 3 studies. For example, we found a higher prevalence of knee OA in the EstB (12.3%, N=14,180) compared to FinnGen (4.8%, N=12,874) and the UKB (3.6%, N=16,713), while T2D diagnoses show a lower prevalence both in the EstB (3.6%, N=104,161) and UKB (3.6%, N=16,850) compared to FinnGen (6.8%, N=18,099; Supplement Table 2).

PheRS were significantly associated with all 13 diseases

We evaluated the association between PheRS and 13 diseases independently from age and sex using Cox proportional hazard models (Cox-PH) on a test set in each study. All PheRS were significantly associated (p<0.05) with higher disease risk (Figure 2A, Supplement Table 3) with the largest association for gout (meta-analyzed hazard ratio (HR) per 1 standard deviation (SD) of PheRS=1.55; 95% confidence interval (CI): 1.43-1.67), T2D (HR=1.47; 95% CI: 1.36-1.59), and lung cancer (HR=1.47; 95% CI: 1.39-1.55). Further, adding the PheRS to a baseline model with age and sex significantly (p<0.05; two-tailed p-values based on the z-scores of the c-index differences) improved the predictive accuracy (c-index) in all three studies for 7/13 diseases: asthma, MDD, T2D, knee OA, hip OA, gout, and AF (Figure 2B, Supplement Figure 1A-1, Supplement Table 4). The improvement persisted for 4 of these diseases (asthma, MDD, T2D, knee OA) in all of the three studies when compared to a baseline including additionally highest achieved education level and the Charlson comorbidity index^48,49 (CCI; Supplement Figure 1A-2). Overall, integrating education and CCI only led to minor improvements in the model’s discriminative ability (Supplement Figures 2&3).

Figure 2: PheRS performance across studies.

Panel A: Association between PheRS and disease onset during the prediction period independent of age and sex. The HRs and 95% CIs in each study – FinnGen: yellow, UKB: brown, EstB: green – and meta-analyzed results (red). The HRs are shown for an increase of the PheRS by 1 standard deviation (SD) after regressing out age and sex. Panel B: Increase in predictive accuracy when adding the PheRS to a baseline model with age and sex. The meta-analyzed c-indices and 95% CIs of the baseline model (x-axis) compared to a model with added PheRS (y-axis).

We found that, in FinnGen, all PheRS were correlated, mostly positively, with the total number of phecodes an individual had recorded (Persons’ r ranging from 0.77 for asthma to –0.43 for breast cancer; Supplement Figure 1C). To further test whether this meant that the PheRS are more predictive in older individuals who have had more time to accumulate diagnoses in their EHR, we stratified the FinnGen test set to a younger group aged 32-51 and an older group aged 52-70 years. However, unexpectedly, we found a significantly stronger association of the PheRS in the younger age group for 4/13 disease and only for breast cancer was the relative risk in the older group significantly larger than for the younger group, while no differences were observed in the remaining diseases (Supplement Figure 4).

PheRS transfer well between studies

We examined PheRS transferability by comparing, in FinnGen, externally– and internally-trained PheRS. Externally-trained PheRS were trained on the training set of the UKB and EstB study and tested on the same test set as the FinnGen internally-trained PheRS. Externally-trained PheRS were moderate to strongly correlated with internally-trained PheRS (average Pearsons’ r=0.45, range –0.09-0.74; Figure 3A, Supplement Table 5). Not surprisingly, PheRS that were poor predictors of the disease were also poorly correlated between their internally-trained and externally-trained versions (i.e. colorectal and breast cancer). Most externally-trained PheRS were significantly associated with disease risk in FinnGen (Figure 3B) and showed significant improvements in c-index over age and sex (Supplement Figure 5). In some instances, as in the case of the PheRS models for MDD and CHD trained in the UKB and the gout models trained in the UKB and EstB, externally-trained PheRS c-index improvements were not significantly different from those achieved by the FinnGen internally-trained PheRS. Nonetheless, we observed that most PheRS disease associations were significantly lower with the externally-trained PheRS (Figure 3B).

Figure 3: PheRS validation in FinnGen.

Panel A: Correlation (Persons’ r) between the internally-trained PheRS in FinnGen and the externally-trained PheRS, tested in FinnGen. Externally-trained PheRS were trained in 50% of individuals in UKB (y-axis) and EstB (x-axis). Panel B: Association of FinnGen internally-trained PheRS with each disease compared to the externally-trained models. HRs and 95% CIs of the FinnGen-trained PheRS (x-axis) vs. the externally-trained PheRS (y-axis), with EstB on the left and UKB on the right. The HRs are shown for a 1-SD increase of the PheRS after regressing out age and sex.

Phecode importance varies across studies

Despite good PheRS transferability, we found marked differences in the prevalence of different phecodes between the studies (Supplement Table 6). When considering codes with a prevalence of at least 1%, only 20% of phecodes (N=49) could be observed in all three studies (Figure 4A). These differences can be partially explained by different types of diagnostic information from the EHR available in each study. For example, the inclusion of primary care diagnoses in the EstB study leads to a higher number of phecodes, with 32% (N=77) unique to that study (Figure 4A+B). The FinnGen and UKB studies, on the other hand, only utilized diagnoses from secondary care.

Figure 4: Phecode prevalence and coefficients in each study.

Panel A: A Venn diagram showing the number of phecodes present in each of the three studies and shared between all combinations of the three studies. We only considered phecodes with a prevalence >1% in each study. Yellow color indicates FinnGen-specific codes, brown UKB-specific codes, green EstB-specific codes and black codes present in all three studies. The same color coding applies to panels B and D. Panel B: Phecode prevalences for selected example codes in the three studies. The black dashed line indicates a prevalence of 1%. Panel C: Median of PheRS coefficients over the 13 diseases in each study. Only coefficients used by at least 7/13 models in the studies are shown (see Methods for phecode exclusion rules in the PheRS models). Different colors and the y-axis labels indicate different phecode categories. Black dashed lines correspond to coefficient values of 0. Panel D: A detailed look at all the PheRS coefficients for major depression (MDD) in the three studies. Black color marks common phecodes in the MDD PheRS models across the studies, while other colors indicate biobank-specific codes (yellow=FinnGen, brown=UKB, green=EstB). PheRS coefficients are standardized to 0 mean and 1 standard deviation for each model separately for easier comparison of coefficient importance across the studies.

The set of phecodes unique to each study included important predictors for many of the diseases. To highlight one example in each study, we found neuralgia (code 766) to be among the top 20 predictors for hip OA, CHD, and MDD in the EstB. In FinnGen, schizophrenia (code 295) was an important predictor in T2D, lung cancer, and epilepsy; and in the UKB, tobacco use disorder (code 318) was among the most important predictors for hip OA, T2D, lung cancer, CHD, and MDD. However, other predictors such as hypertension (code 401), overweight (code 278), alcohol abuse (code 317), and peripheral nerve disorders (code 351) were prevalent diagnoses in all three studies and showed a large consistent effect across diseases (Figure 4B+C, Supplement Table 7).

We took a closer look at the top predictors in the individual PheRS models. Figure 4D shows the shared and study-specific predictors in the PheRS models for major depression (MDD). We found that the top predictors in each PheRS captured three main categories: substance abuse, sleep disorders and pain-related problems. The most consistent phecode related to substance abuse in all three studies was alcohol abuse (code 317; FinnGen rank 3, UKB rank 2, and EstB rank 4), while other diagnoses such as tobacco use disorder (code 318) were only captured in the UKB study (rank 3). The most important predictors related to pain disorders in FinnGen were intervertebral disc disorders (code 722, rank 4) and migraine (code 340, rank 5), while in the UKB it was back pain (code 760, rank 4) and in the EstB peripheral nerve disorders (code 351, rank 9) and other headache syndromes (code 229, rank 10). Nevertheless, while the list of most important predictors varied, each of the PheRS models also captured other pain-related diagnoses with lower ranks. Supplement Figure 6 shows, for 6 additional diseases, how common and study-specific phecodes contribute to PheRS prediction.

PGS and PheRS are orthogonal predictors

Finally, we compared the PheRS and corresponding PGS associations. Both were significantly associated with all diseases in the meta-analysis (p<0.05). However, the magnitude of the associations varied across diseases. For 4 out of the 13 diseases (epilepsy, MDD, knee OA, and lung cancer), the PheRS showed a stronger association with the diseases than the PGS, and for 4/13 there was no significant difference (Supplement Figure 7C). However, when looking at the top 10% of most at-risk individuals compared to the 20% at average risk, the PheRS capture the risk better for 8/13 diseases (T2D, gout, lung cancer, asthma, MDD, epilepsy, hip OA, and knee OA; Figure 5A, Supplement Figure 6A). Moreover, PheRS provided additional information on top of PGS. Adding PGS to a model with PheRS, age, and sex led to significant improvements for 10/13 in FinnGen, 3/4 in the UKB, and 6/13 in the EstB (Supplementary Figure 7A). Similarly, adding the PheRS to a model with PGS, age, and sex significantly increased the c-index for 9/13 diseases in FinnGen, 2/4 diseases in the UKB, and 6/13 diseases in the EstB (Supplementary Figure 7B). The number of diseases with significant improvements due to adding the PheRS is similar to that achieved when adding the PheRS to age and sex (Supplement Figure 2A, see Methods and Supplementary Text for more details)

Figure 5: Comparison of PGS and PheRS.

Panel A: Association of PGS (x-axis) and PheRS (y-axis) scores with each disease. The meta-analyzed HRs (95% CI) for the top 10% at risk compared to the average 20% based on the scores after regression out age, sex, and the first 10 PCs. Panel B: Correlation (Persons’ r) between the PheRS and PGS scores separately in each study (FinnGen: yellow, UKB: brown, EstB: green). Due to sample overlap with the GWASs, PGS could only be calculated for 4 diseases in UKB (see Methods for details).

Overall, we found that the EHR data and genetic information capture largely orthogonal information as shown by the low correlation between the two scores (average Pearsons’ r=0.02, range 0.00-0.08, Figure 5B).

Discussion

In this study, we investigated the accuracy and transferability of EHR-based models (PheRS) in predicting the 8-year risk for 13 common diseases in three large biobank-based studies (FinnGen, EstB, and UKB) compared to PGS. Our results highlight the complementarity of PheRS and PGS for a range of diseases, suggesting that combining EHR and genetic data can be an advantageous strategy for the prediction of many common diseases. Both PheRS and PGS were derived to be independent from age and sex effects, thus providing orthogonal information to these two key risk factors. Furthermore, we were able to successfully validate in FinnGen the models trained in EstB and UKB, suggesting that the PheRS models capture relevant risk factors that are not only study– or healthcare system-specific.

While the performance of the PheRS models varied between diseases, the PheRS for asthma, MDD, T2D, knee OA, and gout, in particular, performed well across all three studies. In contrast, colorectal cancer, prostate cancer, and breast cancer PheRS models performed poorly, likely also due to the low case counts in our data. We expected to see low transferability of the PheRS between studies due to differences in clinical and disease coding practices in different countries and healthcare systems. Nonetheless, we found that the PheRS replicated well for many of the diseases although, as expected, most PheRS trained within-study performed better. The good transferability of PheRS was also surprising given the large variability in prevalence of phecodes we found across studies, with only 20% of them observed in all three studies. However, our results are in line with a few previous studies that show that it is possible to create predictors that are generalizable across healthcare systems^6–8,31.

Looking closer at the phecodes prevalent in each study and their importance in the PheRS models, we find that the PheRS models that transfer well from the UKB or EstB to FinnGen utilize both study-specific phecodes, and phecodes shared between the studies. In some cases, such as gout, a large part of the transferability of the models could already be explained by a few major risk factors such as hypertension, high BMI, and diabetes that are consistent in all three studies⁵⁰. In other cases, the relevance of each predictor was more intricate. For example, in UKB, one of the most important phecodes for major depression (MDD) was tobacco use disorder, but this code had very low prevalence in FinnGen. Instead, we found that the phecodes for alcohol abuse (code 317) was a prevalent and important predictor in all three studies. Both alcohol abuse and sleep disorders, another important and prevalent predictor in all the studies, are known complex comorbidities of MDD⁵¹. We hypothesize that many of the different phecodes captured a single underlying risk factor. For example, several different pain-related diagnoses were among the top predictors for MDD, each likely capturing underlying pain problems⁵², with the top predictors differing between models. The elastic net penalty allows a non-zero coefficient for many correlating phecodes, which alleviates the issue of the same underlying medical issue being coded differently in different EHRs. This suggests that leveraging similarity between diagnostic codes is an important aspect in creating transferable EHR-based predictors.

We kept the PheRS approach simple to demonstrate its feasibility. More complex models could further exploit the longitudinal nature of EHR information and utilize other data modalities available in the EHR-systems^5,30,53. Further, by using a 2-year washout period and excluding very closely related conditions from the predictors, we remained conservative in removing co-morbidities directly related to the disease. Without this buffer the performance of the models will likely increase and be more relevant in a clinical context⁶. To improve generalizability of the models we collapsed phecodes into the first three digits to reduce the effect of different diagnostic codes being used in different countries to describe the same underlying phenomenon⁵⁴. For example, in the EstB study phecodes hypertensive heart disease (code 401.21) and essential hypertension (code 401.1) were equally prevalent diagnoses capturing the risk factor hypertension (code 401), while in FinnGen hypertensive heart disease (code 401.21) had a prevalence of <1%. Other approaches could include mapping diagnostic codes to OMOP-concepts, which has been shown to facilitate EHR-based models that transfer between different countries³¹.

Importantly, while we did not exclude individuals based on their genetic ancestry, the UKB still consists mainly, and FinnGen and the EstB almost exclusively, of individuals of European ancestry. Thus, our study does not properly assess the important issue that individuals of different ethnicities face inequalities in healthcare access^55,56. Important open questions for future work, in addition to the generalizability of EHR-based scores for non-European genetic ancestries, include how to optimally model diagnostic codes for best generalizability as well as leveraging data from different and diverse cohorts with for example federated learning approaches⁵⁷.

To our knowledge, correlation between PGS and EHR-based scores have not been comprehensively studied. For CAD, Petrazzini et al. ⁷ found that the inclusion of PGS did not improve prediction compared to an EHR-based score, while Zhao et al. ⁶ found that the inclusion of genetic information significantly improved models with both EHR-based predictors and the gold standard model for CAD risk prediction (ACC/AHA) Pooled Cohort Risk Equations). For 8/13 diseases studied here, we observe a significant improvement in onset prediction when integrating PheRS on top of PGS. While for many of the cancers (colorectal cancer, prostate cancer, and breast cancer), the PGS were more informative, for diseases such as MDD, epilepsy, and knee OA the PheRS better captured the risk. Interestingly, PheRS were specifically better than PGS in capturing high-risk individuals. Individuals in the top 10% of PheRS had higher HR than those in the top 10% of PGS for 8/13 disease, probably reflecting those individuals with key co-morbidities. Further, we observe very low correlation between PGS and PheRS, indicating that these two data sources contain largely independent information that is predictive of disease onset. A few prior studies on the interaction between PGS and selected risk factors found no evidence for interaction^11,51.

Patient’s diagnostic history has always been a key piece of information for medical professionals when considering future treatment. As we are moving towards translating PGS to clinical use, it is worth considering integrating, in a comprehensive manner, also the information about an individual’s diagnosis history which in many countries is already collected in a centralized electronic manner. This would not be a large shift from current practice, as selected comorbidities are used in many clinical risk stratification algorithms, for example, QRisk⁵⁸ for evaluating risk of heart attack or stroke in 10 years, or QDiabetes for evaluating 10 year risk of T2D⁵⁹. A recent study (Steinfeld et al. ⁶⁰), showed that EHR-based models trained specifically to predict risk of five different cardiovascular events performed similarly or better than conventional risk scores (QRISK3, ASCVD and SCORE2)³¹. Similarly, Zhao et al.⁶ found that the machine learning models trained on longitudinal EHR data outperformed the gold standard risk model (ACC/AHA) Pooled Cohort Risk Equations) for the prediction of cardiovascular disease. These comparisons are interesting for diseases with established risk scores. However, for many of the diseases studied here there are no established risk algorithms, making an EHR-based risk stratification approach even more relevant.

In this study we show that, across many diseases and multiple studies with different underlying healthcare systems and EHRs, relatively simple elastic net-based risk scores that consider an individual’s previous diagnosis history can improve disease risk prediction when combined with PGS. Information already available from the EHR provides orthogonal information to PGS and could be a cost-effective approach for risk estimation.

Methods

Study setup

As outlined in Figure 1B, each study consisted of a 10-year observation (6-year for EstBB due to shorter follow-up) and an 8-year prediction period, separated by a 2-year washout period. Each disease’s case and control definitions were based on diagnoses acquired in the 8-year prediction period (2011/01/01 –> 2019/01/01). The ICD-codes used to define the cases for each disease were based on previous harmonization between FinnGen and the EstBB phenotypes by the INTERVENE consortium⁴⁶ (Supplement Table 9). We consider all individuals as controls that are not cases. We only considered adults aged 32-70 in 2011/01/01 and removed all individuals diagnosed with the disease before this time. The lower limit for age of inclusion was chosen due to the inclusion of education level in some of the models and determined based on the median age of obtaining a doctoral degree in the FinnGen dataset. Using this lower limit, most individuals included have finished their highest level of education. Further, we remove all individuals with a diagnosis outside the prediction period (2011/01/01 –> 2019/01/01) and those lost to follow-up before the start of the prediction period. The ICD-codes used to define the cases for each disease and the number of cases and controls in each study are listed in Supplement Tables 9+2.

We included 845,929 individuals (Supplement Table 1) from 3 biobank-based studies: FinnGen⁴¹, UKB⁴⁰ and EstB ³⁴ linked with national registers or EHRs. In FinnGen we used Data Freeze 10, which includes 412,090 individuals, of which 266,179 were aged 32-70 in 2011/01/01. The longitudinal ICD-code diagnoses used to define the phecodes and the case and control status for each disease were based on in– and outpatient hospital register information. The UKB study included 464,076 individuals aged 40-70, with the ICD-code diagnoses based on inpatient information. The EstB study included 199,868 individuals of which 115,674 were aged 32-70. Here we also had primary care data as well as self-reported diagnoses available. More details on the phenotype harmonization can be found from Jermy et al.⁴⁶ and the Supplement Methods.

Predictors

PGS

The PGS were previously computed by the INTERVENE consortium⁴⁶ and based on the recent publicly available genome-wide association study (GWAS) summary statistics, with minimal overlap with our study cohorts (Supplement Table 10) using MegaPRS ⁶¹ with the BLD-LDAK heritability model. For the Cox-PH models we removed individuals from the studies that were part of the GWAS on which the PGS were based. Due to the large overlap with the UKB individuals, we only had PGS for gout, epilepsy, breast and prostate cancer available in the UKB.

PheRS

For the EHR-based models, we trained elastic net models⁴³ on ICD-9 and ICD-10 diagnoses mapped to phecodes. The phecode mapping was based on the v1.2b1 of the phecode map^44,47 from https://phewascatalog.org/, with some manual additions. Since we only considered diagnoses during the observation period starting in 1999, all diagnoses were ICD-10 based in our data. To get the most complete mapping we removed all special characters from the ICD-code and then if we could not find a match in the phecode map, we shortened the code by one digit until it could either be mapped or had to be removed. The complete mapping used can be found from Supplement Table 11. As our target phenotypes were defined based on ICD-codes we exclude predictors part of the exclusion range of the phecodes separately for each phenotype (Supplement Table 12). We only considered phecodes with a prevalence at least 1% of the study population (Supplement Table 6).

We implemented the PheRS using the LogisticRegression function from scikit-learn (version 1.3.2)⁶². We included age (at the start of the prediction period 2011/01/01) and sex as predictors in the PheRS models because they are important predictors and otherwise the models would reconstruct predictors for age and sex using combinations of the phecode diagnoses, which would make interpretation of the phecode coefficient values challenging. Nonetheless, age and sex effect were then regressed out when evaluating the performances of the PheRS (see below). Models were penalized with the elastic net penalty. Predictors were coded as 1/0, where 1=”predictor observed during the observation window” and 0=”predictor not observed during the observation window”, for each disease separately. For training, 50% of the data was used, and this was further divided into training (85%) and hold-out test (15%) sets. Sizes of the training data sets are shown for each disease and study Supplement Table 2. L1 to L2 ratio hyperparameter of the elastic net models was optimized using grid search and 5-fold cross-validation over the range 0.05-0.95 (step size = 0.05), simultaneously with inverse of the regularization strength (C) over possible values: 1e-5, 5e-5, 1e-4, 5e-4, 1e-3, 5e-3, 1e-2, 5e-2, 1e-1, 5e-1, 1. Balanced class weights were used, based on class frequencies in the training data.

Model fitting was done using stochastic average gradient descent. Best L1 to L2 ratio was selected based on the average precision score using 5-fold cross validation on the training split. Missing values of predictors were imputed to the mean of the corresponding predictor in the study-specific training data and all predictors were standardized to zero mean and unit variance on the study-specific training data prior to model fitting. The code for training the PheRS models is available at: https://github.com/intervene-EU-H2020/INTERVENE_PheRS.

The PheRS models trained within the UKB or the EstBB data on 50% of individuals were used to make predictions in FinnGen test set as is without any retraining with FinnGen data. Standardization and imputation were performed based on the biobank-specific training data, meaning that e.g. when assessing the performance of the UKB-trained model in FinnGen, the FinnGen test set data was imputed and standardized based on the feature-specific means and standard deviations from the UKB.

Cox proportional-hazards models

Ultimately, each individual was assigned 13 different PGS and PheRS scores describing their risk of getting a disease diagnosis in the prediction period based on genetic or EHR-based information. To make the PheRS and PGS comparable we regressed out the effect of age, sex and the first 10 genetic PCs from all continuous scores using the residuals from a logistic regression with the score as outcome. Subsequently we scaled all predictors to have a mean of zero and standard deviation of 1. We then used these scores in separate Cox proportional-hazards models (Cox-PH), with the survival time defined as the time from 2011 until either diagnosis, censoring (end of follow-up), or the end of the prediction period.

Additionally, we considered the Charlson-Comorbidity Index (CCI)^48,49 – developed to account for the individual’s overall comorbidity burden – and individuals highest achieved education level in 2011 – an indicator of their socio-economic status. For the CCI we compared the top 10% of individuals with the highest CCI to the rest. The high-risk group included individuals with a CCI>=2 and a few younger ones with a CCI of 1. For the highest education level we mapped each study’s education coding to the 2011 International Standard Classification of Education (ISCED-11; Supplement Table 13) codes. We compared the risk of individuals with basic education (ISCED-11: 1-4) to those who achieved high education levels (ISCED-11: 5-7).

We used the survival⁶³ package in R for creating the Cox-PH models and the Hmisc⁶⁴ package to calculate the c-indices and 95% CIs. For a Cox-PH model with binary outcomes, the predicted survival times can be shown to be equal to the survival probability, so the c-index is equivalent to the area under the curve of the receiver operating characteristic curve (AUCROC)^65,66. The meta-analysis of the HRs and c-indices was performed using the metafor^67,68 package in R with a random effects model. We used two-tailed p-values based on the z-scores to compare the difference in HR magnitude and significant increases in the c-index.

Comparison of phecode coefficients between different PheRS models

The elastic net hyperparameters were separately optimized for each PheRS model. This means that the absolute magnitudes of the coefficients for phecodes are not comparable between different PheRS. However, the relative importances of phecodes can still be compared, i.e. whether for example the same phecodes are among the most important predictors in two different PheRS. To make visualization of the phecode importances in different PheRS clearer, we standardized the coefficients of each PheRS separately to a mean of 0 and a standard deviation of 1 for the display items. Further, in each study we ranked the phecodes in descending order by the PheRS coefficient values and assigned them ascending ranks. Thus, a lower rank indicates a higher PheRS coefficient in the model. Both the unscaled PheRS coefficients and ranks are Supplement Table 7.

Author contributions

KED and TH contributed equally. KED, TH and AG wrote the manuscript with significant input from all the other authors. TH and KED wrote the PheRS code, KED wrote the Cox model code, BJ contributed the PGS. KED and TH performed the analyses in FinnGen, TH, KED in the UK Biobank and MTL and KL in the Estonian Biobank. KED combined the results and created the figures together with TH. KED, TH and ZY preprocessed the FinnGen and the UK Biobank data, MTL and KL preprocessed the Estonian Biobank data. TH, BJ and AG developed the original idea. AG, SR and RM supervised the study.

Data and code availability

The code for PheRS model training is available at https://github.com/intervene-EU-H2020/INTERVENE_PheRS and for the Cox-PH models as well as the final analysis of results at https://github.com/intervene-EU-H2020/onset_prediction.

The individual-level data in these studies is protected for data privacy, access is regulated through the biobanks. The Finnish biobank data can be accessed through the Fingenious^® services (https://site.fingenious.fi/en/) managed by FINBB. Researchers interested in EstBB can request access at https://www.geenivaramu.ee/en/access-biobank and the UKB data are available through a procedure described at http://www.ukbiobank.ac. The GWAS data used in this study are available in the GWAS catalog database under accession codes listed in Supplement Table 10. The PGS scores generated in this study are available in the PGS Catalog under publication ID: PGP000618 and score IDs: PGS004869-PGS004886.

Ethics declarations

Patients and control subjects in FinnGen provided informed consent for biobank research, based on the Finnish Biobank Act. Alternatively, separate research cohorts, collected prior to the Finnish Biobank Act came into effect (in September 2013) and the start of FinnGen (August 2017), were collected based on study-specific consents and later transferred to the Finnish Biobanks after approval by Fimea (Finnish Medicines Agency), the National Supervisory Authority for Welfare and Health. Recruitment protocols followed the biobank protocols approved by Fimea. The Coordinating Ethics Committee of the Hospital District of Helsinki and Uusimaa (HUS) statement number for the FinnGen study is Nr HUS/990/2017. The FinnGen study is approved by Finnish Institute for Health and Welfare (permit numbers: THL/2031/6.02.00/2017, THL/1101/5.05.00/2017, THL/341/6.02.00/2018, THL/2222/6.02.00/2018, THL/283/6.02.00/2019, THL/1721/5.05.00/2019 and THL/1524/5.05.00/2020), digital and population data service agency (permit numbers: VRK43431/2017-3, VRK/6909/2018-3, VRK/4415/2019-3), the Social Insurance Institution (permit numbers: KELA 58/522/2017, KELA 131/522/2018, KELA 70/522/2019, KELA 98/522/2019, KELA 134/522/2019, KELA 138/522/2019, KELA 2/522/2020, KELA 16/522/2020), Findata permit numbers THL/2364/14.02/2020, THL/4055/14.06.00/2020, THL/3433/14.06.00/2020, THL/4432/14.06/2020, THL/5189/14.06/2020, THL/5894/14.06.00/2020, THL/6619/14.06.00/2020, THL/209/14.06.00/2021, THL/688/14.06.00/2021, THL/1284/14.06.00/2021, THL/1965/14.06.00/2021, THL/5546/14.02.00/2020, THL/2658/14.06.00/2021, THL/4235/14.06.00/2021, Statistics Finland (permit numbers: TK-53-1041-17 and TK/143/07.03.00/2020 (earlier TK-53-90-20) TK/1735/07.03.00/2021, TK/3112/07.03.00/2021) and Finnish Registry for Kidney Diseases permission/extract from the meeting minutes on 4th July 2019. The Biobank Access Decisions for FinnGen samples and data utilized in FinnGen Data Freeze 10 include: THL Biobank BB2017_55, BB2017_111, BB2018_19, BB_2018_34, BB_2018_67, BB2018_71, BB2019_7, BB2019_8, BB2019_26, BB2020_1, BB2021_65, Finnish Red Cross Blood Service Biobank 7.12.2017, Helsinki Biobank HUS/359/2017, HUS/248/2020, HUS/150/2022 § 12, §13, §14, §15, §16, §17, §18, and §23, Auria Biobank AB17-5154 and amendment #1 (August 17 2020) and amendments BB_2021-0140, BB_2021-0156 (August 26 2021, Feb 2 2022), BB_2021-0169, BB_2021-0179, BB_2021-0161, AB20-5926 and amendment #1 (April 23 2020)and it’s modification (Sep 22 2021), Biobank Borealis of Northern Finland_2017_1013, 2021_5010, 2021_5018, 2021_5015, 2021_5023, 2021_5017, 2022_6001, Biobank of Eastern Finland 1186/2018 and amendment 22 § /2020, 53§/2021, 13§/2022, 14§/2022, 15§/2022, Finnish Clinical Biobank Tampere MH0004 and amendments (21.02.2020 and 06.10.2020), §8/2021, §9/2022, §10/2022, §12/2022, §20/2022, §21/2022, §22/2022, §23/2022, Central Finland Biobank 1-2017, and Terveystalo Biobank STB 2018001 and amendment 25th Aug 2020, Finnish Hematological Registry and Clinical Biobank decision 18th June 2021, Arctic Biobank P0844: ARC_2021_1001.

Ethics approval for the UK Biobank study was obtained from the North West Centre for Research Ethics Committee (11/NW/0382). UK Biobank data used in this study were obtained under approved application 78537.

The activities of the EstBB are regulated by the Human Genes Research Act, which was adopted in 2000 specifically for the operations of the EstBB. Individual level data analysis in the EstBB was carried out under ethical approval 1.1-12/624 from the Estonian Committee on Bioethics and Human Research (Estonian Ministry of Social Affairs), using data according to release application S22, document number 6-7/GI/16259 from the EstBB.

Andrea Ganna is the founder of Real World Genetics Oy. Bradley Jermy became an employee of BioMarin after his part of this work was completed. Kristi Läll has participated as an analyst in a collaboration research project at the Institute of Genomics, University of Tartu, which was funded by Geneto OÜ. No other authors have competing interests to declare.

Acknowledgements

We want to acknowledge the participants and investigators of the FinnGen, UK Biobank and the Estonian Biobank studies.

This study has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement no. 101016775. This Estonian Biobank study was funded by the European Union through the European Regional Development Fund Project No. 2014-2020.4.01.15-0012 GENTRANSMED. A.G. has received funding from the European Union’s Horizon 2020 research and innovation programme under grant no. 101016775, the European Research Council under the European Union’s Horizon 2020 research and innovation programme (grant number 945733) and from Academy of Finland fellowship grant no. 323116.

We want to acknowledge the participants and investigators of the FinnGen study listed in Supplement Figure 14. The FinnGen project is funded by two grants from Business Finland (HUS 4685/31/2016 and UH 4386/31/2016) and the following industry partners: AbbVie Inc., AstraZeneca UK Ltd, Biogen MA Inc., Bristol Myers Squibb (and Celgene Corporation & Celgene International II Sàrl), Genentech Inc., Merck Sharp & Dohme LCC, Pfizer Inc., GlaxoSmithKline Intellectual Property Development Ltd., Sanofi US Services Inc., Maze Therapeutics Inc., Janssen Biotech Inc, Novartis AG, and Boehringer Ingelheim International GmbH. Following biobanks are acknowledged for delivering biobank samples to FinnGen: Auria Biobank (www.auria.fi/biopankki), THL Biobank (www.thl.fi/biobank), Helsinki Biobank (www.helsinginbiopankki.fi), Biobank Borealis of Northern Finland (https://www.ppshp.fi/Tutkimus-ja-opetus/Biopankki/Pages/Biobank-Borealis-briefly-in-English.aspx), Finnish Clinical Biobank Tampere (www.tays.fi/en-US/Research_and_development/Finnish_Clinical_Biobank_Tampere), Biobank of Eastern Finland (www.ita-suomenbiopankki.fi/en), Central Finland Biobank (www.ksshp.fi/fi-FI/Potilaalle/Biopankki), Finnish Red Cross Blood Service Biobank (www.veripalvelu.fi/verenluovutus/biopankkitoiminta), Terveystalo Biobank (www.terveystalo.com/fi/Yritystietoa/Terveystalo-Biopankki/Biopankki/) and Arctic Biobank (https://www.oulu.fi/en/university/faculties-and-units/faculty-medicine/northern-finland-birth-cohorts-and-arctic-biobank). All Finnish Biobanks are members of BBMRI.fi infrastructure (https://www.bbmri-eric.eu/national-nodes/finland/). Finnish Biobank Cooperative –FINBB (https://finbb.fi/) is the coordinator of BBMRI-ERIC operations in Finland.

We thank participants and scientists involved in making the UK Biobank resource available (http://www.ukbiobank.ac.uk/). This research has been conducted using the UKB resource under approved application number 78537.

The EstBB research team received funding from the Estonian Research Council grant TT17 “Estonian Centre for Genomics”. Data analysis was carried out in part in the High-Performance Computing Center of the University of Tartu. K.L. and R.M. received funding from the Estonian Research Council grant PUT (PRG1911) and the Estonian Research Council grant TK (TK214).

We acknowledge CSC—IT Center for Science, Finland, for computational resources.

The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.

References

1.↵
Johnson, K. B. et al. Precision Medicine, AI, and the Future of Personalized Health Care. Clin. Transl. Sci. 14, 86–93 (2021).
OpenUrl CrossRef Google Scholar
2.↵
Shickel, B., Tighe, P. J., Bihorac, A. & Rashidi, P. Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) Analysis. IEEE J. Biomed. Health Inform. 22, 1589–1604 (2018).
OpenUrl Google Scholar
3.↵
Tang, A. S. et al. Harnessing EHR data for health research. Nat. Med. 1–9 (2024) doi:10.1038/s41591-024-03074-8.
OpenUrl CrossRef Google Scholar
4.↵
Ayala Solares, J. R., et al. Deep learning for electronic health records: A comparative review of multiple deep neural architectures. J. Biomed. Inform. 101, 103337 (2020).
OpenUrl CrossRef PubMed Google Scholar
5.↵
Placido, D. et al. A deep learning algorithm to predict risk of pancreatic cancer from disease trajectories. Nat. Med. 29, 1113–1122 (2023).
OpenUrl CrossRef PubMed Google Scholar
6.↵
Zhao, J. et al. Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction. Sci. Rep. 9, (2019).
Google Scholar
7.↵
Petrazzini, B. O. et al. Coronary Risk Estimation Based on Clinical Data in Electronic Health Records. J. Am. Coll. Cardiol. 79, 1155–1166 (2022).
OpenUrl PubMed Google Scholar
8.↵
Forrest, I. S. et al. Machine learning-based marker for coronary artery disease: derivation and validation in two longitudinal cohorts. Lancet Lond. Engl. 401, 215–225 (2023).
OpenUrl CrossRef Google Scholar
9.↵
Lambert, S. A., Abraham, G. & Inouye, M. Towards clinical utility of polygenic risk scores. Hum. Mol. Genet. 28, R133–R142 (2019).
OpenUrl CrossRef PubMed Google Scholar
10.
Khera, A. V. et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Genet. 50, 1219–1224 (2018).
OpenUrl CrossRef PubMed Google Scholar
11.↵
Lewis, C. M. & Vassos, E. Polygenic risk scores from research tools to clinical instruments. Genome Med. 12, 44 (2020).
OpenUrl PubMed Google Scholar
12.
Jiang, X., Holmes, C. & McVean, G. The impact of age on genetic risk for common diseases. PLOS Genet. 17, e1009723 (2021).
OpenUrl CrossRef Google Scholar
13.
Patel, A. P. & Khera, A. V. Advances and Applications of Polygenic Scores for Coronary Artery Disease. Annu. Rev. Med. 74, 141–154 (2023).
OpenUrl CrossRef Google Scholar
14.↵
Martin, A. R. et al. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat. Genet. 51, 584–591 (2019).
OpenUrl CrossRef PubMed Google Scholar
15.
Marston, N. A. et al. A polygenic risk score predicts atrial fibrillation in cardiovascular disease. Eur. Heart J. 44, 221–231 (2023).
OpenUrl Google Scholar
16.↵
Mars, N. et al. Polygenic and clinical risk scores and their impact on age at onset and prediction of cardiometabolic diseases and common cancers. Nat. Med. 26, 549–557 (2020).
OpenUrl PubMed Google Scholar
17.
Tamlander, M. et al. Integration of questionnaire-based risk factors improves polygenic risk scores for human coronary heart disease and type 2 diabetes. Commun. Biol. 5, 158 (2022).
OpenUrl Google Scholar
18.
Tamlander, M. et al. Genome-wide polygenic risk scores for colorectal cancer have implications for risk-based screening. Br. J. Cancer 130, 651–659 (2024).
OpenUrl Google Scholar
19.
Wong, C. K. et al. Polygenic risk scores for cardiovascular diseases and type 2 diabetes. PLOS ONE 17, e0278764 (2022).
OpenUrl CrossRef Google Scholar
20.
Siltari, A. et al. How Well do Polygenic Risk Scores Identify Men at High Risk for Prostate Cancer? Systematic Review and Meta-Analysis. Clin. Genitourin. Cancer 21, 316.e1–316.e11 (2023).
OpenUrl Google Scholar
21.
Martin, A. R., Daly, M. J., Robinson, E. B., Hyman, S. E. & Neale, B. M. Predicting Polygenic Risk of Psychiatric Disorders. Biol. Psychiatry 86, 97–109 (2019).
OpenUrl Google Scholar
22.
Roberts, E., Howell, S. & Evans, D. G. Polygenic risk scores and breast cancer risk prediction. The Breast 67, 71–77 (2023).
OpenUrl Google Scholar
23.
Knowles, J. W. & Ashley, E. A. Cardiovascular disease: The rise of the genetic risk score. PLOS Med. 15, e1002546 (2018).
OpenUrl CrossRef PubMed Google Scholar
24.↵
Torkamani, A., Wineinger, N. E. & Topol, E. J. The personal and clinical utility of polygenic risk scores. Nat. Rev. Genet. 19, 581–590 (2018).
OpenUrl CrossRef PubMed Google Scholar
25.↵
Beesley, L. J. et al. The emerging landscape of health research based on biobanks linked to electronic health records: Existing resources, statistical challenges, and potential opportunities. Stat. Med. 39, 773–800 (2020).
OpenUrl Google Scholar
26.
Botsis, T., Hartvigsen, G., Chen, F. & Weng, C. Secondary Use of EHR: Data Quality Issues and Informatics Opportunities. Summit Transl. Bioinforma. 2010, 1–5 (2010).
OpenUrl Google Scholar
27.↵
Rajkomar, A. et al. Scalable and accurate deep learning with electronic health records. NPJ Digit. Med. 1, 18 (2018).
OpenUrl Google Scholar
28.↵
Shi, X., Li, X. & Cai, T. Spherical Regression Under Mismatch Corruption With Application to Automated Knowledge Translation. J. Am. Stat. Assoc. 116, 1953–1964 (2021).
OpenUrl Google Scholar
29.↵
Wornow, M. et al. The shaky foundations of large language models and foundation models for electronic health records. Npj Digit. Med. 6, 1–10 (2023).
OpenUrl CrossRef Google Scholar
30.↵
Xie, F. et al. Deep learning for temporal data representation in electronic health records: A systematic review of challenges and methodologies. J. Biomed. Inform. 126, 103980 (2022).
Google Scholar
31.↵
Steinfeldt, J. et al. Medical history predicts phenome-wide disease onset and enables the rapid response to emerging health threats. Nat. Commun. 15, 4257 (2024).
OpenUrl Google Scholar
32.↵
Mars, N. et al. Genome-wide risk prediction of common diseases across ancestries in one million people. Cell Genomics 2, 100118 (2022).
Google Scholar
33.↵
Sabatello, M. et al. Return of polygenic risk scores in research: Stakeholders’ views on the eMERGE-IV study. Hum. Genet. Genomics Adv. 5, (2024).
Google Scholar
34.↵
Leitsalu, L., et al. Cohort Profile: Estonian Biobank of the Estonian Genome Center, University of Tartu. Int. J. Epidemiol. 44, 1137–1147 (2015).
OpenUrl CrossRef PubMed Google Scholar
35.↵
Ritchie, S. C. et al. Integrative analysis of the plasma proteome and polygenic risk of cardiometabolic diseases. Nat. Metab. 3, 1476–1483 (2021).
OpenUrl Google Scholar
36.↵
Møller, P. L. et al. Combining Polygenic and Proteomic Risk Scores With Clinical Risk Factors to Improve Performance for Diagnosing Absence of Coronary Artery Disease in Patients With de novo Chest Pain. Circ. Genomic Precis. Med. 16, 442–451 (2023).
OpenUrl Google Scholar
37.↵
Nightingale Health Biobank Collaborative Group et al. Metabolomic and genomic prediction of common diseases in 477,706 participants in three national biobanks. Preprint at doi:10.1101/2023.06.09.23291213 (2023).
OpenUrl Abstract/FREE Full Text Google Scholar
38.↵
Aguilar, O. T., Chang, C., Bismuth, E. & Rivas, M. A. Integrative machine learning approaches for predicting disease risk using multi-omics data from the UK Biobank. 2024.04.16.589819 Preprint at doi:10.1101/2024.04.16.589819 (2024).
OpenUrl Abstract/FREE Full Text Google Scholar
39.↵
Mohsen, F., Al-Absi, H. R. H., Yousri, N. A., El Hajj, N. & Shah, Z. A scoping review of artificial intelligence-based methods for diabetes risk prediction. Npj Digit. Med. 6, 1–15 (2023).
OpenUrl CrossRef Google Scholar
40.↵
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
OpenUrl CrossRef PubMed Google Scholar
41.↵
Kurki, M. I. et al. FinnGen provides genetic insights from a well-phenotyped isolated population. Nature 613, 508–518 (2023).
OpenUrl CrossRef Google Scholar
42.↵
Bastarache, L. et al. Phenotype risk scores identify patients with unrecognized Mendelian disease patterns. Science 359, 1233–1239 (2018).
OpenUrl Abstract/FREE Full Text Google Scholar
43.↵
Lebovitch, D. S., Johnson, J. S., Dueñas, H. R. & Huckins, L. M. Phenotype Risk Scores: moving beyond ‘cases’ and ‘controls’ to classify psychiatric disease in hospital-based biobanks. 2021.01.25.21249615 Preprint at doi:10.1101/2021.01.25.21249615 (2021).
OpenUrl Abstract/FREE Full Text Google Scholar
44.↵
Wu, P. et al. Mapping ICD-10 and ICD-10-CM Codes to Phecodes: Workflow Development and Initial Evaluation. JMIR Med. Inform. 7, e14325 (2019).
OpenUrl CrossRef Google Scholar
45.↵
Cox, D. R. Regression Models and Life-Tables. J. R. Stat. Soc. Ser. B Methodol. 34, 187–202 (1972).
OpenUrl Google Scholar
46.↵
Jermy, B. et al. A unified framework for estimating country-specific cumulative incidence for 18 diseases stratified by polygenic risk. Nat. Commun. 15, 5007 (2024).
OpenUrl CrossRef Google Scholar
47.↵
Denny, J. C. et al. PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations. Bioinforma. Oxf. Engl. 26, 1205–1210 (2010).
OpenUrl Google Scholar
48.↵
Deyo, R. A., Cherkin, D. C. & Ciol, M. A. Adapting a clinical comorbidity index for use with ICD-9-CM administrative databases. J. Clin. Epidemiol. 45, 613–619 (1992).
OpenUrl CrossRef PubMed Web of Science Google Scholar
49.↵
Charlson, M. E., Pompei, P., Ales, K. L. & MacKenzie, C. R. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J. Chronic Dis. 40, 373–383 (1987).
OpenUrl CrossRef PubMed Web of Science Google Scholar
50.↵
Singh, J. A. & Gaffo, A. Gout epidemiology and comorbidities. Semin. Arthritis Rheum. 50, S11–S16 (2020).
OpenUrl CrossRef PubMed Google Scholar
51.↵
Zhao, Y. et al. The brain structure, immunometabolic and genetic mechanisms underlying the association between lifestyle and depression. Nat. Ment. Health 1, 736–750 (2023).
OpenUrl Google Scholar
52.↵
Frediani, F. & Villani, V. Migraine and depression. Neurol. Sci. 28, S161–S165 (2007).
OpenUrl CrossRef PubMed Web of Science Google Scholar
53.↵
Kline, A. et al. Multimodal machine learning in precision health: A scoping review. Npj Digit. Med. 5, 1–14 (2022).
OpenUrl Google Scholar
54.↵
Kiser, A. C. et al. Standard Vocabularies to Improve Machine Learning Model Transferability With Electronic Health Record Data: Retrospective Cohort Study Using Health Care–Associated Infection. JMIR Med. Inform. 10, e39057 (2022).
OpenUrl Google Scholar
55.↵
Fiscella, K. & Sanders, M. R. Racial and Ethnic Disparities in the Quality of Health Care. Annu. Rev. Public Health 37, 375–394 (2016).
OpenUrl CrossRef PubMed Google Scholar
56.↵
Mahajan, S. et al. Trends in Differences in Health Status and Health Care Access and Affordability by Race and Ethnicity in the United States, 1999-2018. JAMA 326, 637–648 (2021).
OpenUrl CrossRef PubMed Google Scholar
57.↵
Antunes, R. S., André da Costa, C., Küderle, A., Yari, I. A. & Eskofier, B. Federated Learning for Healthcare: Systematic Review and Architecture Proposal. ACM Trans Intell Syst Technol 13, 54:1–54:23 (2022).
OpenUrl Google Scholar
58.↵
Hippisley-Cox, J., Coupland, C. & Brindle, P. Development and validation of QRISK3 risk prediction algorithms to estimate future risk of cardiovascular disease: prospective cohort study. BMJ j2099 (2017) doi:10.1136/bmj.j2099.
OpenUrl Abstract/FREE Full Text Google Scholar
59.↵
Hippisley-Cox, J. & Coupland, C. Development and validation of QDiabetes-2018 risk prediction algorithm to estimate future risk of type 2 diabetes: cohort study. BMJ 359, j5019 (2017).
OpenUrl Abstract/FREE Full Text Google Scholar
60.↵
Goldstein, B. A., Navar, A. M., Pencina, M. J. & Ioannidis, J. P. A. Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review. J. Am. Med. Inform. Assoc. JAMIA 24, 198–208 (2017).
OpenUrl Google Scholar
61.↵
Zhang, Q., Privé, F., Vilhjálmsson, B. & Speed, D. Improved genetic prediction of complex traits from individual-level data or summary statistics. Nat. Commun. 12, 4192 (2021).
OpenUrl Google Scholar
62.↵
Pedregosa, F. et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. (2011).
Google Scholar
63.↵
Therneau, T. M., until 2009), T. L. (original S.->R port and R. maintainer, Elizabeth, A. & Cynthia, C. survival: Survival Analysis. (2024).
Google Scholar
64.↵
Jr, F. E. H. & functions), C. D. (contributed several functions and maintains latex. Hmisc: Harrell Miscellaneous. (2024).
Google Scholar
65.↵
Harrell Jr., F. E., Lee, K. L. & Mark, D. B. Multivariable Prognostic Models: Issues in Developing Models, Evaluating Assumptions and Adequacy, and Measuring and Reducing Errors. Stat. Med. 15, 361–387 (1996).
OpenUrl CrossRef PubMed Web of Science Google Scholar
66.↵
Pencina, M. J. & D’Agostino, R. B. Overall C as a measure of discrimination in survival analysis: model specific population value and confidence interval estimation. Stat. Med. 23, 2109–2123 (2004).
OpenUrl CrossRef PubMed Web of Science Google Scholar
67.↵
Viechtbauer, W. Conducting Meta-Analyses in R with the metafor Package. J. Stat. Softw. 36, (2010).
Google Scholar
68.↵
Viechtbauer, W. metafor: Meta-Analysis Package for R. (2024).
Google Scholar

Posted October 08, 2024.

Download PDF

Author Declarations

Supplementary Material

Data/Code

Citation Tools

Get QR code

Tweet Widget

Subject Area

Genetic and Genomic Medicine

Reviews and Context

Comment

TRIP Peer Reviews

Community Reviews

Automated Services

Blogs/Media

Author Videos

Subject Areas

All Articles

Addiction Medicine (423)
Allergy and Immunology (746)
Anesthesia (219)
Cardiovascular Medicine (3235)
Dentistry and Oral Medicine (357)
Dermatology (270)
Emergency Medicine (476)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1151)
Epidemiology (13254)
Forensic Medicine (19)
Gastroenterology (891)
Genetic and Genomic Medicine (5066)
Geriatric Medicine (471)
Health Economics (773)
Health Informatics (3190)
Health Policy (1129)
Health Systems and Quality Improvement (1173)
Hematology (423)
HIV/AIDS (1005)
Infectious Diseases (except HIV/AIDS) (14545)
Intensive Care and Critical Care Medicine (902)
Medical Education (468)
Medical Ethics (126)
Nephrology (513)
Neurology (4814)
Nursing (254)
Nutrition (716)
Obstetrics and Gynecology (870)
Occupational and Environmental Health (782)
Oncology (2484)
Ophthalmology (703)
Orthopedics (279)
Otolaryngology (337)
Pain Medicine (320)
Palliative Medicine (89)
Pathology (531)
Pediatrics (1281)
Pharmacology and Therapeutics (542)
Primary Care Research (550)
Psychiatry and Clinical Psychology (4133)
Public and Global Health (7375)
Radiology and Imaging (1670)
Rehabilitation Medicine and Physical Therapy (993)
Respiratory Medicine (971)
Rheumatology (473)
Sexual and Reproductive Health (491)
Sports Medicine (416)
Surgery (534)
Toxicology (69)
Transplantation (233)
Urology (199)

Comments

medRxiv aims to provide a venue for anyone to comment on a medRxiv preprint. Comments are moderated for offensive or irrelevant content (this can take ~24 h). Please avoid duplicate submissions and read our Comment Policy before commenting. The content of a comment is not endorsed by medRxiv.

medRxiv aims to inform readers about online discussion of this preprint occurring elsewhere. The content at the links below is not endorsed by either medRxiv or the preprint's authors.

Community reviews for this article:

There are no community reviews for this paper.

Automated Evaluations

Certain services provide automated analysis of preprints. Analyses invited by the authors are displayed at the top of this tab. Those done independently of authors are shown underneath . None of these analyses is endorsed by medRxiv.

Automated Evaluations:

There are no automated evaluations for this paper.

[1] 1.↵
Johnson, K. B. et al. Precision Medicine, AI, and the Future of Personalized Health Care. Clin. Transl. Sci. 14, 86–93 (2021).
OpenUrl CrossRef Google Scholar

[2] 2.↵
Shickel, B., Tighe, P. J., Bihorac, A. & Rashidi, P. Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) Analysis. IEEE J. Biomed. Health Inform. 22, 1589–1604 (2018).
OpenUrl Google Scholar

[3] 3.↵
Tang, A. S. et al. Harnessing EHR data for health research. Nat. Med. 1–9 (2024) doi:10.1038/s41591-024-03074-8.
OpenUrl CrossRef Google Scholar

[4] 4.↵
Ayala Solares, J. R., et al. Deep learning for electronic health records: A comparative review of multiple deep neural architectures. J. Biomed. Inform. 101, 103337 (2020).
OpenUrl CrossRef PubMed Google Scholar

[5] 5.↵
Placido, D. et al. A deep learning algorithm to predict risk of pancreatic cancer from disease trajectories. Nat. Med. 29, 1113–1122 (2023).
OpenUrl CrossRef PubMed Google Scholar

[6] 6.↵
Zhao, J. et al. Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction. Sci. Rep. 9, (2019).
Google Scholar

[7] 7.↵
Petrazzini, B. O. et al. Coronary Risk Estimation Based on Clinical Data in Electronic Health Records. J. Am. Coll. Cardiol. 79, 1155–1166 (2022).
OpenUrl PubMed Google Scholar

[8] 8.↵
Forrest, I. S. et al. Machine learning-based marker for coronary artery disease: derivation and validation in two longitudinal cohorts. Lancet Lond. Engl. 401, 215–225 (2023).
OpenUrl CrossRef Google Scholar

[9] 9.↵
Lambert, S. A., Abraham, G. & Inouye, M. Towards clinical utility of polygenic risk scores. Hum. Mol. Genet. 28, R133–R142 (2019).
OpenUrl CrossRef PubMed Google Scholar

[10] 10.
Khera, A. V. et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Genet. 50, 1219–1224 (2018).
OpenUrl CrossRef PubMed Google Scholar

[11] 11.↵
Lewis, C. M. & Vassos, E. Polygenic risk scores from research tools to clinical instruments. Genome Med. 12, 44 (2020).
OpenUrl PubMed Google Scholar

[12] 12.
Jiang, X., Holmes, C. & McVean, G. The impact of age on genetic risk for common diseases. PLOS Genet. 17, e1009723 (2021).
OpenUrl CrossRef Google Scholar

[13] 13.
Patel, A. P. & Khera, A. V. Advances and Applications of Polygenic Scores for Coronary Artery Disease. Annu. Rev. Med. 74, 141–154 (2023).
OpenUrl CrossRef Google Scholar

[14] 14.↵
Martin, A. R. et al. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat. Genet. 51, 584–591 (2019).
OpenUrl CrossRef PubMed Google Scholar

[15] 15.
Marston, N. A. et al. A polygenic risk score predicts atrial fibrillation in cardiovascular disease. Eur. Heart J. 44, 221–231 (2023).
OpenUrl Google Scholar

[16] 16.↵
Mars, N. et al. Polygenic and clinical risk scores and their impact on age at onset and prediction of cardiometabolic diseases and common cancers. Nat. Med. 26, 549–557 (2020).
OpenUrl PubMed Google Scholar

[17] 17.
Tamlander, M. et al. Integration of questionnaire-based risk factors improves polygenic risk scores for human coronary heart disease and type 2 diabetes. Commun. Biol. 5, 158 (2022).
OpenUrl Google Scholar

[18] 18.
Tamlander, M. et al. Genome-wide polygenic risk scores for colorectal cancer have implications for risk-based screening. Br. J. Cancer 130, 651–659 (2024).
OpenUrl Google Scholar

[19] 19.
Wong, C. K. et al. Polygenic risk scores for cardiovascular diseases and type 2 diabetes. PLOS ONE 17, e0278764 (2022).
OpenUrl CrossRef Google Scholar

[20] 20.
Siltari, A. et al. How Well do Polygenic Risk Scores Identify Men at High Risk for Prostate Cancer? Systematic Review and Meta-Analysis. Clin. Genitourin. Cancer 21, 316.e1–316.e11 (2023).
OpenUrl Google Scholar

[21] 21.
Martin, A. R., Daly, M. J., Robinson, E. B., Hyman, S. E. & Neale, B. M. Predicting Polygenic Risk of Psychiatric Disorders. Biol. Psychiatry 86, 97–109 (2019).
OpenUrl Google Scholar

[22] 22.
Roberts, E., Howell, S. & Evans, D. G. Polygenic risk scores and breast cancer risk prediction. The Breast 67, 71–77 (2023).
OpenUrl Google Scholar

[23] 23.
Knowles, J. W. & Ashley, E. A. Cardiovascular disease: The rise of the genetic risk score. PLOS Med. 15, e1002546 (2018).
OpenUrl CrossRef PubMed Google Scholar

[24] 24.↵
Torkamani, A., Wineinger, N. E. & Topol, E. J. The personal and clinical utility of polygenic risk scores. Nat. Rev. Genet. 19, 581–590 (2018).
OpenUrl CrossRef PubMed Google Scholar

[25] 25.↵
Beesley, L. J. et al. The emerging landscape of health research based on biobanks linked to electronic health records: Existing resources, statistical challenges, and potential opportunities. Stat. Med. 39, 773–800 (2020).
OpenUrl Google Scholar

[26] 26.
Botsis, T., Hartvigsen, G., Chen, F. & Weng, C. Secondary Use of EHR: Data Quality Issues and Informatics Opportunities. Summit Transl. Bioinforma. 2010, 1–5 (2010).
OpenUrl Google Scholar

[27] 27.↵
Rajkomar, A. et al. Scalable and accurate deep learning with electronic health records. NPJ Digit. Med. 1, 18 (2018).
OpenUrl Google Scholar

[28] 28.↵
Shi, X., Li, X. & Cai, T. Spherical Regression Under Mismatch Corruption With Application to Automated Knowledge Translation. J. Am. Stat. Assoc. 116, 1953–1964 (2021).
OpenUrl Google Scholar

[29] 29.↵
Wornow, M. et al. The shaky foundations of large language models and foundation models for electronic health records. Npj Digit. Med. 6, 1–10 (2023).
OpenUrl CrossRef Google Scholar

[30] 30.↵
Xie, F. et al. Deep learning for temporal data representation in electronic health records: A systematic review of challenges and methodologies. J. Biomed. Inform. 126, 103980 (2022).
Google Scholar

[31] 31.↵
Steinfeldt, J. et al. Medical history predicts phenome-wide disease onset and enables the rapid response to emerging health threats. Nat. Commun. 15, 4257 (2024).
OpenUrl Google Scholar

[32] 32.↵
Mars, N. et al. Genome-wide risk prediction of common diseases across ancestries in one million people. Cell Genomics 2, 100118 (2022).
Google Scholar

[33] 33.↵
Sabatello, M. et al. Return of polygenic risk scores in research: Stakeholders’ views on the eMERGE-IV study. Hum. Genet. Genomics Adv. 5, (2024).
Google Scholar

[34] 34.↵
Leitsalu, L., et al. Cohort Profile: Estonian Biobank of the Estonian Genome Center, University of Tartu. Int. J. Epidemiol. 44, 1137–1147 (2015).
OpenUrl CrossRef PubMed Google Scholar

[35] 35.↵
Ritchie, S. C. et al. Integrative analysis of the plasma proteome and polygenic risk of cardiometabolic diseases. Nat. Metab. 3, 1476–1483 (2021).
OpenUrl Google Scholar

[36] 36.↵
Møller, P. L. et al. Combining Polygenic and Proteomic Risk Scores With Clinical Risk Factors to Improve Performance for Diagnosing Absence of Coronary Artery Disease in Patients With de novo Chest Pain. Circ. Genomic Precis. Med. 16, 442–451 (2023).
OpenUrl Google Scholar

[37] 37.↵
Nightingale Health Biobank Collaborative Group et al. Metabolomic and genomic prediction of common diseases in 477,706 participants in three national biobanks. Preprint at doi:10.1101/2023.06.09.23291213 (2023).
OpenUrl Abstract/FREE Full Text Google Scholar

[38] 38.↵
Aguilar, O. T., Chang, C., Bismuth, E. & Rivas, M. A. Integrative machine learning approaches for predicting disease risk using multi-omics data from the UK Biobank. 2024.04.16.589819 Preprint at doi:10.1101/2024.04.16.589819 (2024).
OpenUrl Abstract/FREE Full Text Google Scholar

[39] 39.↵
Mohsen, F., Al-Absi, H. R. H., Yousri, N. A., El Hajj, N. & Shah, Z. A scoping review of artificial intelligence-based methods for diabetes risk prediction. Npj Digit. Med. 6, 1–15 (2023).
OpenUrl CrossRef Google Scholar

[40] 40.↵
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
OpenUrl CrossRef PubMed Google Scholar

[41] 41.↵
Kurki, M. I. et al. FinnGen provides genetic insights from a well-phenotyped isolated population. Nature 613, 508–518 (2023).
OpenUrl CrossRef Google Scholar

[42] 42.↵
Bastarache, L. et al. Phenotype risk scores identify patients with unrecognized Mendelian disease patterns. Science 359, 1233–1239 (2018).
OpenUrl Abstract/FREE Full Text Google Scholar

[43] 43.↵
Lebovitch, D. S., Johnson, J. S., Dueñas, H. R. & Huckins, L. M. Phenotype Risk Scores: moving beyond ‘cases’ and ‘controls’ to classify psychiatric disease in hospital-based biobanks. 2021.01.25.21249615 Preprint at doi:10.1101/2021.01.25.21249615 (2021).
OpenUrl Abstract/FREE Full Text Google Scholar

[44] 44.↵
Wu, P. et al. Mapping ICD-10 and ICD-10-CM Codes to Phecodes: Workflow Development and Initial Evaluation. JMIR Med. Inform. 7, e14325 (2019).
OpenUrl CrossRef Google Scholar

[45] 45.↵
Cox, D. R. Regression Models and Life-Tables. J. R. Stat. Soc. Ser. B Methodol. 34, 187–202 (1972).
OpenUrl Google Scholar

[46] 46.↵
Jermy, B. et al. A unified framework for estimating country-specific cumulative incidence for 18 diseases stratified by polygenic risk. Nat. Commun. 15, 5007 (2024).
OpenUrl CrossRef Google Scholar

[47] 47.↵
Denny, J. C. et al. PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations. Bioinforma. Oxf. Engl. 26, 1205–1210 (2010).
OpenUrl Google Scholar

[48] 48.↵
Deyo, R. A., Cherkin, D. C. & Ciol, M. A. Adapting a clinical comorbidity index for use with ICD-9-CM administrative databases. J. Clin. Epidemiol. 45, 613–619 (1992).
OpenUrl CrossRef PubMed Web of Science Google Scholar

[49] 49.↵
Charlson, M. E., Pompei, P., Ales, K. L. & MacKenzie, C. R. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J. Chronic Dis. 40, 373–383 (1987).
OpenUrl CrossRef PubMed Web of Science Google Scholar

[50] 50.↵
Singh, J. A. & Gaffo, A. Gout epidemiology and comorbidities. Semin. Arthritis Rheum. 50, S11–S16 (2020).
OpenUrl CrossRef PubMed Google Scholar

[51] 51.↵
Zhao, Y. et al. The brain structure, immunometabolic and genetic mechanisms underlying the association between lifestyle and depression. Nat. Ment. Health 1, 736–750 (2023).
OpenUrl Google Scholar

[52] 52.↵
Frediani, F. & Villani, V. Migraine and depression. Neurol. Sci. 28, S161–S165 (2007).
OpenUrl CrossRef PubMed Web of Science Google Scholar

[53] 53.↵
Kline, A. et al. Multimodal machine learning in precision health: A scoping review. Npj Digit. Med. 5, 1–14 (2022).
OpenUrl Google Scholar

[54] 54.↵
Kiser, A. C. et al. Standard Vocabularies to Improve Machine Learning Model Transferability With Electronic Health Record Data: Retrospective Cohort Study Using Health Care–Associated Infection. JMIR Med. Inform. 10, e39057 (2022).
OpenUrl Google Scholar

[55] 55.↵
Fiscella, K. & Sanders, M. R. Racial and Ethnic Disparities in the Quality of Health Care. Annu. Rev. Public Health 37, 375–394 (2016).
OpenUrl CrossRef PubMed Google Scholar

[56] 56.↵
Mahajan, S. et al. Trends in Differences in Health Status and Health Care Access and Affordability by Race and Ethnicity in the United States, 1999-2018. JAMA 326, 637–648 (2021).
OpenUrl CrossRef PubMed Google Scholar

[57] 57.↵
Antunes, R. S., André da Costa, C., Küderle, A., Yari, I. A. & Eskofier, B. Federated Learning for Healthcare: Systematic Review and Architecture Proposal. ACM Trans Intell Syst Technol 13, 54:1–54:23 (2022).
OpenUrl Google Scholar

[58] 58.↵
Hippisley-Cox, J., Coupland, C. & Brindle, P. Development and validation of QRISK3 risk prediction algorithms to estimate future risk of cardiovascular disease: prospective cohort study. BMJ j2099 (2017) doi:10.1136/bmj.j2099.
OpenUrl Abstract/FREE Full Text Google Scholar

[59] 59.↵
Hippisley-Cox, J. & Coupland, C. Development and validation of QDiabetes-2018 risk prediction algorithm to estimate future risk of type 2 diabetes: cohort study. BMJ 359, j5019 (2017).
OpenUrl Abstract/FREE Full Text Google Scholar

[60] 60.↵
Goldstein, B. A., Navar, A. M., Pencina, M. J. & Ioannidis, J. P. A. Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review. J. Am. Med. Inform. Assoc. JAMIA 24, 198–208 (2017).
OpenUrl Google Scholar

[61] 61.↵
Zhang, Q., Privé, F., Vilhjálmsson, B. & Speed, D. Improved genetic prediction of complex traits from individual-level data or summary statistics. Nat. Commun. 12, 4192 (2021).
OpenUrl Google Scholar

[62] 62.↵
Pedregosa, F. et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. (2011).
Google Scholar

[63] 63.↵
Therneau, T. M., until 2009), T. L. (original S.->R port and R. maintainer, Elizabeth, A. & Cynthia, C. survival: Survival Analysis. (2024).
Google Scholar

[64] 64.↵
Jr, F. E. H. & functions), C. D. (contributed several functions and maintains latex. Hmisc: Harrell Miscellaneous. (2024).
Google Scholar

[65] 65.↵
Harrell Jr., F. E., Lee, K. L. & Mark, D. B. Multivariable Prognostic Models: Issues in Developing Models, Evaluating Assumptions and Adequacy, and Measuring and Reducing Errors. Stat. Med. 15, 361–387 (1996).
OpenUrl CrossRef PubMed Web of Science Google Scholar

[66] 66.↵
Pencina, M. J. & D’Agostino, R. B. Overall C as a measure of discrimination in survival analysis: model specific population value and confidence interval estimation. Stat. Med. 23, 2109–2123 (2004).
OpenUrl CrossRef PubMed Web of Science Google Scholar

[67] 67.↵
Viechtbauer, W. Conducting Meta-Analyses in R with the metafor Package. J. Stat. Softw. 36, (2010).
Google Scholar

[68] 68.↵
Viechtbauer, W. metafor: Meta-Analysis Package for R. (2024).
Google Scholar

Transferability and accuracy of electronic health record-based predictors compared to polygenic scores

Abstract

Introduction