Abstract
Background The 2019 novel coronavirus (SARS-CoV-2) is reported to result in both respiratory and non-respiratory severe health outcomes, but quantitative assessment of the risk – while adjusting for underlying risk driven by comorbidities – is not yet established.
Methods A retrospective observational study using electronic health records of 9,344,021 individuals across the U.S. with at-least 1 year of clinical history and followed up throughout 2020.
Results 131,329 individuals were associated with SARS-CoV-2 infection by January 6, 2021 in three distinct surges. While the age and number of preexisting conditions had decreased throughout the pandemic, the characteristics of those who experienced severe health events did not.
During the second surge, between June 7 and November 18, 2020, 425,988 individuals in the base cohort were admitted to emergency rooms or hospitals. Among them, 15,486 were detected with SAR-CoV-2 within few days of admission. Significant adjusted odds ratios were observed between SARS-CoV-2 infection and the following severe health events: respiratory (4.38, 95% confidence interval 4.16– 4.62), bacterial pneumonia (3.25, 2.76–3.83), sepsis (1.71, 1.53–1.91), renal (1.69, 1.57–1.83), hematologic/immune (1.32, 1.20–1.45), neurological (1.23, 1.09–1.38).
Conclusions SARS-CoV-2 infection among hospitalized patients is associated with non-negligible increased risk of severe events including multiple non-respiratory ones. These associations, which complement recent studies, are persistent even after accounting for sources of selection and confounding bias, increasing the confidence they are not spurious.
Introduction
Coronavirus disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has emerged in Wuhan, China in late 2019 1. The virus then rapidly spread worldwide, causing the first known case in the United States on January 20th, 2020 2, and subsequently infecting millions throughout the year 3.
SARS-CoV-2 is a respiratory virus, but its disease is not limited to the respiratory system. While most COVID-19 cases suffer from mild common-cold-like symptoms to severe respiratory disorders, a growing body of evidence suggests the disease can also cause multi-organ dysfunction – with patients manifesting cardiac, hepatic, renal, and neurological conditions. However, access to quantitative, risk-adjusted comparison of incidence of such events with non-infected populations is still scarce.
Time is a major confounding factor in emerging diseases, since care evolves over time and susceptible population may change. Therefore, there is also a need to characterize the pandemic over time. Prior studies either described clinical characteristics in a relatively small sample and a single time point 4,5,6, or used large-sample state-bureaus data that lack clinical background. Recent works started using electronic health records that allow both larger sample sizes and a clinical history to diagnose COVID-19 7, to describe adverse events 8 and to infer demographic risk factor 9.
In this work we use the IBM Explorys database to analyze detailed clinical and demographic data from multiple centers across the United States. Being large in both breadth and depth, the Explorys database provides a diverse sample with a nation-wide coverage of hospitals and other healthcare facilities that is also rich with detailed patients’ characteristics. It contains information on those detected with SARS-CoV-2 and a general baseline population. This longitudinal database is ideal to address our two main goals: a) to describe temporal changes of individuals detected with SARS-CoV-2 and b) to quantify adjusted associations between SARS-CoV-2 infection and severe health events in multiple organ systems.
Results
Cohort characteristics at baseline
We define a base population cohort to be composed of individuals who had some record in the database for both 2019 and 2020 (Figure 1). Intotal the base population cohort includes 9,344,021, out of which 131,329 have had a SARS-CoV-2 diagnosis before January 6, 2021 and are denoted the base detected cohort. The base detected cohort is, on average, 4 years older than the base population cohort and with lower variance (Table 1, Supp-Figure 1). Other demographic factors are similar between the two cohorts, except for the relative over-representation of (self-reported) African Americans and Hispanics among those detected with SARS-CoV-2, which is consistent with previous reports of racial disparities in the US 10,11. Clinical factors also exhibit bias, where individuals in the base detected cohort tend to have higher rates of various preexisting conditions such as cardiac disorders, diabetes, and neurological disorders (Table 1). Similarly, those detected with SARS-CoV-2 show higher incidence of severe health events previously reported to be presented by COVID-19 patients 12 (see Methods).
Temporal patterns of infection detection
We followed temporal patterns in the base detected cohort over the period of time between March 2020 and December 2020. The lower panel of Figure 2 shows the weekly number of diagnosed individuals, which exhibits three surges of SARS-CoV-2 infection detections: the first starting in mid-March; the second one starting in early June; and a third one starting mid-October.
The top panel of Figure 2, presents the proportion of the medical settings in which individuals had their first record of infection. Early in the outbreak, a larger proportion of detections occurred in inpatient or emergency room setting. However, since late March more cases were detected in an outpatient setting, and this trend is largely continuing throughout the year. Overall, a larger proportion of diagnosis records originated from hospitals relative to outpatient settings in the first surge. This could be attributed to several processes: First, as preparedness increased testing have become more available to a broader population. Second, the virus was being introduced into a younger, possibly less susceptible, population, causing milder symptoms (if any) that did not require inpatient care. Third, as awareness increased in the population, individuals with suspected exposure were instructed not to arrive in the ER without being tested first 13.
Temporal patterns of demographical and clinical characteristics
Figure 3 shows how the characteristics of individuals diagnosed throughout the pandemic change over time (grey dashed line). It further separates those detected with SARS-CoV-2 into two mutually exclusive groups – those who subsequently suffered a severe health event, and those who did not (red and blue lines, respectively). The characteristics presented are average age, proportion of sex, and average number of preexisting conditions, bound by a 95% confidence interval band.
Those suffering from severe health events are consistently older and with more preexisting conditions on average, consistent with both measures being positively correlated. Additionally, while the pooled averages of everyone detected show a decrease over time, the measures of the subgroup experiencing health events are more stable. The proportion of males detected with SARS-CoV-2 shows a gradual average increase over time, though not statistically significant. While the fraction of females among infected was higher than males, the fraction among those with co-occurring severe illness was lower and closer to the fraction of females in the general population (Table 1).
Association between infection and severe events in hospitalized patients
Availability of SARS-CoV-2 tests and non-random testing policies may induce spurious correlations between SARS-CoV-2 detection and severe health events. Therefore, to avoid testing bias, we further focused on individuals who were admitted to emergency room or inpatient hospitalization starting from the second surge, assuming most patients in these settings were likely to be tested.
The dynamics of the incidence for each of the events exhibits a sharp increase at the day of SARS-CoV-2 detection (Supp-Figure 2). This may hint that patients were either detected before encountering the provider addressing the severe event, or that the severe event triggered an encounter which in turn led to an unrelated SARS-CoV-2 detection. Therefore, to avoid such cases, in the following association analysis, follow-up starts at the fourth day of admission and continues for 30 days after admission, to increase the likelihood that the events occurred after SARS-CoV-2 infection and within the clinically relevant time frame following detection.
Out of individuals with hospital encounters between June 7th, 2020 and November 18th, 2020, 15,486 patients had a positive SARS-CoV-2 test in the 14 days prior to hospitalization or within 3 days of hospitalization (denoted hospital-detected, Figure 1) and 410,502 did not have any known positive SARS-CoV-2 tests prior to hospitalization or until at least 14 days after follow-up, which is 30 days after the encounter (denoted hospital-control, Figure 1). A comparison of the prevalence of pre-existing conditions for individuals in these two sub-groups shows a high similarity between the groups (Table 2, Supp-Figure 1), especially compared to the base population (Table 1).
We tested for sensitivity to the lag between admission and follow-up start time (Supp-Figure 3). All outcomes were stable across these parameters, except cardiovascular which showed increasing odds ratios for greater lags. Associations where also stable for all outcomes replacing the 14 days lag from detection to admission by 7, 21, and 28 days but not on the extreme zero days lag. We also tested for sensitivity to the exclusion of 10,190 control patients by reweighting the regression with the inverse propensity to be censored, and results are stable (Supp-Figure 4).
An analysis of the association between SARS-CoV-2 detection and severe health events (Figure 4) shows significant positive associations between detection and most severe health events among hospitalized patients. The odds ratios are adjusted for baseline characteristics extracted for the pre-admission period and until the third day of admission (see Methods, unadjusted results are in Supp-Figure 5). We used viral pneumonia as a positive control, exhibiting an adjusted odds ratio of 594 (95% CI 492-718) (not shown in figures).
Another, possibly more intuitive, way of presenting the results is using the absolute risk. For those hospitalized with no SARS-CoV-2 infection (hospital-control), this risk can be obtained directly from the data. The absolute risk for patients with SARS-CoV-2 (hospital-detected) is then calculated using the odds ratio that was obtained above (which is the ratio between the odds of the health event in the hospital-control and hospital-detected groups). Using this notation, for every 1000 patients in the hospital-control we observed 80 with severe respiratory events compared with 275 for every 1000 in the hospital-detected cohort. Similarly, these numbers are 16 vs. 27 cases of sepsis, 44 vs. 72 renal events, 33 vs. 43 hematologic events, 22 vs. 27 neurological events, and 3 vs. 8 bacterial pneumonia cases out of every 1000 patients in the hospital-control vs. the hospital-detected, respectively. The difference that was observed for cardiovascular events, of 31 vs. 34, was not statistically significant after multiple hypothesis correction (uncorrected P value of 0.05).
Changes in treatment strategies over time
To further examine the reduction in the incidence of severe health events between surges (Supp-Figure 2), we examine the changes in treatments assignment for COVID-19 over time. Since not all treatments are available in all clinical settings (e.g., convalescent plasma), we further focus our cohort on inpatient hospitalizations only. Since a patient can get more than one treatment, Figure 5 presents the proportion of patients receiving a treatment among all the patients diagnosed within that week.
We see prominent use in chloroquine and hydroxychloroquine starting in March, co-occurring with the first publication showing potential benefits 14, followed by a sharp decrease co-occurring with the U.S. FDA cautioning against using the drug for COVID-19 in April 15 and later fully revoking its Emergency Use Authorization (EUA) in mid-June 16.
Not unexpectedly, we see a slow increase in the use of remdesivir starting in May after the U.S. FDA issues an EUA for its use in severe COVID-19 17. Similarly, Dexamethasone use upticks in June, co-occurring with positive preliminary results from a randomized control trial 18,19. The trend for mechanical ventilation is less clear, and may reflect some lack of consensus around the practice 20.
Discussion
This study examined the electronic health records of over 9 million individuals across the United States throughout 2020 to track the COVID-19 pandemic. We identified a set of eight severe events in multiple organs and focused on two questions. First, we examined 131,329 SARS-CoV-2 infected individuals to show the characteristics among those who also suffer severe health events were stable throughout the pandemic. Second, we used a set of 425,988 hospitalized patients to quantify the increase in incidence of severe respiratory, neurological, blood, kidney, heart, sepsis, and bacterial pneumonia events associated with SARS-CoV-2 diagnosis.
The weekly count of cases of SARS-CoV-2 manifests in three surges (sharp increase in cases) occurring on March, June, and October (Figure 2) all consistent with contemporaneous public reporting. Starting from the second surge and onwards, a younger population with fewer preexisting conditions, on average, is being diagnosed (Figure 3) – probably due to increased testing. However, when stratifiedon those also exhibiting severe health events, the subgroup is consistently older and with more preexisting conditions. Their characteristics also vary less over time, which might suggest that there was no observable change in pathogenicity of the virus during this study (e.g., due to selective pressures as it became more prevalent 21). However, the timeframe of the study limits us from observing an effect of the newer variants emerging in late 2020 22.
The over-representation of infected females compared to males in the data does not indicate higher susceptibility to subsequent severe events. Rather, the proportion of diagnosed females suffering severe health events trends towards the rate observed in females in the entire data, suggesting that the risk for severe health events among males and females with SARS-CoV-2 infection is similar, and that there may have been some selection bias early in the pandemic 23. The sex disparity in our data may be attributed to over-utilization by females in specific healthcare services.
Despite that the COVID-19 outbreak impacted different regions across the U.S. at distinct times, data from various geo-diverse regions was aggregated for analysis and may introduce temporal variance. This aggregation also intermingles effects of non-pharmaceutical interventions taken in different places 24,25. However, the infection data extracted from the Explorys database follows similar trends to those observed in the entire US (Supp-Figure 7). This supports that the data, and its progression through time, is representative of the US population, and therefore the temporal changes we have outlined are generalizable to the US.
Incidence for all severe health events are higher in the first surge relative to the second surge (Supp- Figure 2). This could be contributed to any number of factors seen in the data: reduction in average age, reduction in average number of preexisting conditions (Figure 3), and changes in treatment strategies over time (Figure 5). Disentangling direct effects of background factors from treatments is an important subject for further research, and readers are advised not to draw causal conclusion from this apparent correspondence.
Our main analysis focused on 425,988 individuals in hospital encounters and has shown a positive association between SARS-CoV-2 detection and multiple non-pulmonary health events. As expected, the associations are smaller than the positive association of SARS-CoV-2 detection with pulmonary health events but are still statistically and practically significant. These results complement pathophysiological and small-scale studies as reviewed in Gupta et al. 12 with large-scale data-driven quantifications.
We focused on hospitalized patients from the second surge onwards. While this can’t account for admission hesitancy during a pandemic, it can help reduce two sources of bias we can’t control for in our data: non-pharmaceutical intervention policies in different geographic locations that affect infection susceptibility; and testing strategies that can favor at-risk individuals, since admitted patients are almost certain to be tested. Consequently, the baseline characteristics among those with and without diagnosis are quite similar between the two groups in the hospitalization cohort (Table 2), much more sothan in the base population (Table 1). Although it may introduce selection bias 26, this is likely to reduce confounding when measuring the statistical contribution of infection to severe events. Focusing on hospitalizations also mitigates gaps in the data because it increases the likelihood of a more complete longitudinal data set, since the Explorys database is focused on aggregating across specific health care systems rather than specific patients. Our focus on the second surge and onward allows covering a broader population as testing expanded to include a younger population with milder or no symptoms (Figure 3) which represents the overall population better. Additionally, as clinical care improved with time (higher incidence of severe health events during the first surge, see Supp-Figure 2), focusing on hospitalization after the second surge reduces the chance of positive associations being due to suboptimal clinical practices. Lastly, the timeframe of the study is prior to mass distribution of vaccines and therefore unaffected by them.
Since the outcomes are measured from electronic health records, they are less subjective than previous questionnaire-based studies on non-respiratory illness 27. On the other hand, the dataset is not fully linked to mortality records. This may cause underestimation of the risk due to censoring if, for example, hospitalized patients with SARS-CoV-2 have a higher propensity of death compared to others. However, we defined the severe health events to be severe enough so that it should be reasonable to assume that most hospital deaths are preceded by one of these events.
Focusing on hospitalizations and starting follow-up three days after admission improves our confidence that the severe events did not occur prior to infection. Positive associations between SARS-CoV-2 detection and severe events (Figure 4) can have several explanations. First, background comorbidities are considered risk-factors for COVID-19. Second, non-COVID medical conditions can trigger hospitalization, which in turn triggers testing for SARS-CoV-2. Third, direct viral pathogenic effects or indirect effects of the immune response may promote adverse events in different systems. Forth, there may be common factors that affect both susceptibility to COVID-19 as well as to other conditions. To disentangle some of the factors and isolate the contribution of infection, we adjusted for many risk factors and focused the analysis on detections that necessitate hospital admissions and that also occur sufficient time after hospital admission. However, as residual biases may be still prevalent, additional data source are needed to better clarify the cause of the inferred associations.
Conclusions
While we could not determine an association between severe liver events and SARS-CoV-2 infection (95% CI of adjusted-OR [0.37, 1.25]), and cardiovascular illness has borderline statistically significant (adjusted-OR 1.12, p=0.05 before adjusting to multiple hypotheses), we saw SARS-COV-2 significantly associated with increase in risk of severe blood, kidney, neurological, and respiratory events among hospitalized patients. While the absolute risk of non-respiratory events may not be large from an individual perspective (0.5 to 3 percentage point change), the severity of these events renders these results non-negligible. Regardless, it is significant from a healthcare perspective as it represents thousands of potential patients across the US. For severe respiratory events, the risk increase associated with SARS-CoV-2 is very high even in absolute terms (8% to 28%). Additionally, all outcomes that presented high increase in relative risk (22% to 69% more likely) should be of importance for the medical community for public health considerations.
This study used millions of electronic health records to characterize the temporal changes among individuals infected with SARS-CoV-2, compare them to a baseline population and quantify the association between infection and both respiratory and non-respiratory severe health events. We found the risk increase associated with SARS-CoV-2 infection to be significant both statistically and practically. The risk consistently higher for older population with preexisting health conditions throughout the pandemic.
We believe this study sheds more light on how we understand the pandemic quantitively and help the medical community grasp the risk of various manifestation of SARS-CoV-2.
Methods
Cohort development
We constructed a base population cohort of individuals that had an encounter during 2019, and who also had at least one year of history on January 1, 2020. Events were extracted up to January 6, 2021.
The hospital encounter cohort consists of individuals who were admitted to hospital or ER for any reason between June 7 and November 18. We excluded individuals who had an infection more than 14 days prior to hospitalization. We define the follow-up period as the 30 days starting at 4-days after hospitalization. The main analysis also excluded patients who had SARS-CoV-2 detected within the follow up time plus an additional 14 days to account for delayed diagnosis. See Figure 1 for a graphical representation of the inclusion/exclusion criteria.
SARS-CoV-2 infection
SARS-CoV-2 detection identified by both diagnostic and lab-test results codes. Specifically: (1) the official code ICD-10 code U07.1; (2) LOINC codes of SARS-Cov-2 tests with positive results; (3) Textual diagnosis descriptions such as “2019 novel coronavirus detected that were manually reviewed for accuracy; and (4) ICD, LOINC or text codes of other coronaviruses intersected with respiratory problems (see Data supplement A).
Outcomes
Following Gupta et al. 12, we extracted indictors for respiratory events as well as non-pulmonary manifestations including cardiovascular, hematologic/immune, renal, neurological, hepatic and sepsis. We identified severe events of these manifestations by medical expertise review and mapped them into diagnostic, lab, and procedure codes (Table 3).
Treatments
Treatment examined were anticoagulants usage, dexamethasone or other corticosteroids, hydroxychloroquine or chloroquine, remdesivir, cytokine inhibitors, lopinavir or ritonavir, convalescent plasma as well as ventilation and tracheostomy. These treatments were encoded either in ICD-9-CM, ICD-10-PCS, SNOMED, CPT procedure codes or RXCUI for drugs (see Data supplement B).
Preexisting conditions
Pre-existing conditions were extracted based on clinical classifications software (CCS) codes 30. These included known risk-factors for COVID-19 such as hypertension, diabetes type 2, heart conditions as well as other conditions, summing up to 44 comorbidities. Some CCS codes were grouped into a single code (e.g., different cancers were all treated as a single neoplasm category). CCS codes were then mapped into corresponding ICD-9 CM and SNOMED CT codes. A detailed mapping can be found in (see Data supplement C).
A person was considered diagnosed with a comorbidity if they either had diagnostic code anytime in 2019, or if they had a record of an unresolved problem (either self-reported or filled in by a provider) in their problem list with a corresponding comorbidity code as above.
Demographic variables
Demographic variables used are age (in years), sex (binary), insurance type (7 categories), and self-reported race (7 categories), ethnicity (4 categories), and smoking status (3 categories). To reduce missing data, and since demographic characteristics tend to vary less, they were extracted from records of up to 10 years before the 2020 index date.
Temporal analysis
Number of people were aggregated by week to avoid the observed dip in counts during weekends. Python 3.6 with Pandas 0.23.4 31 were used for data manipulation and plotting was done with Altair 4.1.0 32.
Association analysis
A preliminary exploration with Kaplan-Meyer plots of severe-events showed a large proportion of events. Therefore, to reduce the likelihood of considering encounter reasons as outcomes, we started the follow-up period 4 days after admission.
We applied multi-variable logistic regression for every severe event of interest in a follow-up period that begins at the fourth day after admission and last for 30 days. The exposure variable is an indicator of SARS-CoV-2 detection in the period that spans 14 days before and up to the third day after admission day. The three-day grace period was taken to account for diagnosis latency. Control variables included demographic and clinical features, and indicators of severe events in the preceding 30 days.
We performed sensitivity analysis for both design decisions: ranging the follow-up start time from zero to four days and ranging detection lag from zero to four weeks. Finally, we tested sensitivity to loss to follow-up using inverse probability of censoring weighting, fitting a logistic regression, using the same predictors, to predict late detection among the control patients.
Data Availability
Data supporting the findings of this study are available from IBM Explorys but were used under license for the current study and restrictions may apply to their availability.
Notes
Data sharing
Data supporting the findings of this study are available from IBM Explorys but were used under license for the current study and restrictions may apply to their availability.
Funding
None.
Contributors
Conceptualization: TEH, YS, EK, MN, AJ, MRZ. Data curation: MN, SR, MC, AP, Formal Analysis: TEH, EK. Methodology: TEH, EK, YS, MN. Software: TEH, EK, MN, SR. Supervision: AP, AJ, TP, MRZ, YS. Visualization: EK. Writing – original draft: TEH, EK, YS. Writing – review & editing: TEH, EK, YS, AP, AJ, TP, MN, SR, MC, MRZ.
Declaration of interests
TEH has nothing to disclose. EK has nothing to disclose. AP has nothing to disclose. MN has nothing to disclose. SR has nothing to disclose. MC has nothing to disclose. MRZ has nothing to disclose. TP has nothing to disclose. YS has nothing to disclose. AJ reports personal fees from IBM Corporation, outside the submitted work.
Research in context
Evidence before this study
SARS-CoV-2 is a respiratory virus. Nevertheless, since early in the pandemic, there have been reports of COVID-19 patients suffering from non-respiratory complications. SARS-CoV-2 has been found to affect multiple organ systems, such as the cardiovascular, hematologic, immune, hepatic, renal and neurological systems. These correlations where even hypothesized to be driven by underlying risk factors confounding the associations. Studies have hypothesized about the physiological mechanisms and biological pathways from which the virus causes extrapulmonary manifestations. However, to the best of our knowledge, a systematic, big-data-based quantification of the risk for severe events in such systems associated with the virus, has not-yet been established.
Added value of this study
We used 9,344,021 electronic health records from the IBM Explorys database to construct a cohort of hospitalized patients and a general cohort of individuals detected with SARS-CoV-2. We defined a set of severe health events in respiratory and non-respiratory organ systems using medical codes. We modelled the association between SARS-CoV-2 diagnosis and these severe outcomes to quantify the risk of infection. Associations, however, can rise spuriously since SARS-CoV-2 infection and severe events may be bi-directional and confounded by various factors. Therefore, we designed our inclusion/exclusion criteria to reduce selection bias and adjusted for multiple demographic factors and rich clinical history to reduce confounding bias between the infection and the outcomes. We found that among hospitalized patients with SARS-CoV-2 are expected to show 22% to 69% more severe extrapulmonary events than uninfected patients. We further examined how the risk changes over time for the general population to discover that people suffering from severe outcomes showed no observable change in demographic or clinical factors throughout most of the pandemic.
Implications of all the available evidence
The ability to quantify the risk is an imperative step towards optimal risk management. Physicians now have both the physiological mechanism and a quantitative estimation of the risk of extrapulmonary manifestation associated with SARS-CoV-2 infection. This can further allow hospitals to better adjust for expected cases and let policy maker better direct resources based on quantitative assessments.
Supplementary materials
Acknowledgment
The authors thank IBM Research for facilitating this research. Especially to Belinda Ann Perez and Tammy Allen from IBM Watson Health for their assistance in understanding the clinical codes; Ben Kolt from IBM Explorys for his help understanding the Explorys Database; To the Explorys IT team for their timely problem fixing; and to the Machine Learning for Healthcare team at IBM Research Haifa for their discussions and insights.