What factors increase the risk of complications in SARS-CoV-2 positive patients? A cohort study in a nationwide Israeli health organization

Chen Yanover; Barak Mizrahi; Nir Kalkstein; Karni Marcus; Pinchas Akiva; Yael Barer; Varda Shalev; Gabriel Chodick

doi:10.1101/2020.05.07.20091652

Abstract

Reliably identifying patients at increased risk for COVID-19 complications could guide clinical decisions, public health policies, and preparedness efforts. To date, the most globally accepted definitions of at-risk patients rely, primarily, on epidemiological characterization of hospitalized COVID-19 patients. However, such characterization overlooks, and fails to correct for, the prevalence of existing conditions in the wider SARS-CoV-2 positive population. Here, we analyze the complete medical records of all SARS-CoV-2 infected individuals (N=4,353) in a large Israeli health organization (representing a population of 2.3 million people), of whom 173 experienced moderate or severe symptoms of COVID-19, to identify the conditions that increase the risk of disease complications, in various age and sex strata. Our analysis suggests that cardiovascular and kidney diseases, obesity, and hypertension are significant risk factors for COVID-19 complications, as previously reported. Interestingly, it also indicates that depression (e.g., odds ratio, OR, for males 65 years or older: 2.94, 95% confidence intervals [1.55, 5.58]; P-value = 0.014) as well cognitive and neurological disorder (e.g., OR for individuals ≥ 65 year old: 2.65 [1.69, 4.17]; P-value < 0.001) are significant risk factors; and that smoking and background of respiratory diseases do not significantly increase the risk of complications. Adjusting existing risk definitions following these observations may improve their accuracy and impact the global pandemic containment efforts.

Introduction

As of May 7^th, 2020, close to four million people worldwide contracted severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and more than 265,000 people died of corona virus disease 2019 (COVID-19) complications. This pandemic poses grave challenges to patients, healthcare providers, and policy makers. Many of these challenges may be better addressed with timely stratification of patients to risk groups, based on their past and current medical characteristics. For example, reliably identifying patients at increased (or decreased) risk could guide clinical decisions (e.g., hospitalization vs home care), public health policies (e.g., risk-based quarantine), and preparedness efforts (e.g., expected medical equipment required).

Various algorithms for identifying patients at risk for COVID-19 (severe) complications have been proposed. The Centers for Disease Control and Prevention (CDC) identified individuals 65 years and older, living at nursing home or long-term care facility, or suffering from underlying medical conditions, particularly if not well controlled, as being at high risk for severe illness from COVID-19 [1]. Similarly, the European Centre for Disease Prevention and Control (ECDC) lists age above 70 years and some underlying conditions as risk factors for critical illness [2]. The United Kingdom National Health Service (NHS) included solid organ transplant recipients, patient with specific cancers or severe respiratory conditions, pregnant women with significant heart disease, and those with increased risk of infection (e.g., due to immunosuppression therapies) in the highest clinical COVID-19 risk group [3]. In April 2020 approximately 1.3 million people in this group were asked to "shield" by staying at home for a period of at least 12 weeks. In addition, patients over 70 years and those suffering from some underlying health conditions (e.g., chronic respiratory diseases, BMI ≥ 40, and pregnant women) were considered in a wider vulnerable group (also referred to as the "flu group"). Finally, a more quantitative risk model (derived from [4]) has been adopted by the Israeli Ministry of Health (MoH), assigning a point for each underlying condition from a predefined list, then considering age group and point count to identify high risk patients.

Initially, these algorithms have been derived from a quickly growing number of epidemiological characterization studies (e.g., [5,6]), which report the prevalence of various conditions in a population of interest, typically severe COVID-19 patients. These studies provide timely and important information; however, identifying risk factors calls for a comparative analysis, contrasting the prevalence of conditions in case and control populations. To date, only a handful of studies implemented such an approach, using, for example, the general population [7] or confirmed (and symptomatic) COVID-19 patient cohort [8]. Similar to these efforts, we analyze here the medical records of all SARS-CoV-2 positive patients in a nationwide health organization (covering 2.3 million individuals); compare the prevalence of existing conditions in complicated and non-complicated cohorts; and identify those conditions associated with COVID-19 complications in various age and sex strata. Our analysis highlights stratum-specific risk factors and may allow better identification of patients at risk, in different subpopulations.

Methods

Maccabi COVID-19 data

Maccabi Health Services (MHS) is a nationwide health plan (payer-provider), representing a quarter of the Israeli population. The MHS database contains longitudinal data on a stable population of over 2.3 million people since 1993 (with annual attrition rate lower than 1%). Data are automatically collected and include comprehensive laboratory data from a single central lab, full pharmacy prescription and purchase data, and extensive demographic information on each patient.

Studied cohorts

SARS-CoV-2 polymerase chain reaction testing in Israel uses both nasopharyngeal and saliva samples. Individuals with positive testing result (until April 22, 2020) are included in the SARS-CoV-2 positive cohort. Positive patients whose disease status, as updated by Israeli hospitals, deteriorated to moderate or severe (at any point in time), admitted to the intensive care unit, or died constitute the complicated COVID-19 cohort; the remaining SARS-CoV-2 positive patients (including asymptomatic, mild COVID-19 patients or those with unknown status) constitute the non-complicated COVID-19 cohort.

Existing conditions

Beside age and sex, we considered a set of existing conditions, comprising those included in the CDC, NHS, and Israeli MoH at-risk definitions, as well as a set of conditions showing significant association with flu and flu-like complications.

To identify each individual’s existing conditions, we used, where available, registries created and maintained by MHS. These registries are based on validated inclusion and exclusion criteria (considering coded diagnoses, treatment, labs, and imaging, as applicable). The registries are continuously and retrospectively (since 1998) updated based on each patient’s central medical record. Patients may be excluded from a registry when deemed misclassified by their primary physician. Linkage across registries and with other sources of information is performed via a unique national identification number. MHS registries used are: Cardiovascular diseases (including ischemic heart disease, congestive heart failure, peripheral vascular disease, cerebrovascular disease, and other cardiovascular diseases) [9], diabetes [10,11], hypertension [12], osteoporosis [13], chronic kidney disease [14], cognitive disorders, mental illness [15], cancer, immunosuppression, weight disorders (obesity, overweight and underweight), smoking, and nursing home. For other conditions, we relied on previously grouped lists of diagnosis codes (Read codes or International Classification of Diseases, ICD, codes, ninth revision) [16-18]: Deficiency anemia, Fluid and electrolyte disorders, chronic obstructive pulmonary disease (COPD), chronic pulmonary disease, neurological disorders, end stage renal disease, rheumatoid arthritis, paralysis, hip fracture, lymphoma, aspiration pneumonia, pleural effusion, respiratory failure, and alcohol consumption.

Statistical analysis

We extracted the prevalence of the studied conditions (excluding ones with less than 20 occurrences) in the non-complicated and complicated COVID-19 cohorts and measured the association between each condition and disease complications by computing the corresponding odds ratio and its estimated statistical significance (using Fisher’s exact test). We conducted the analysis separately in three age groups: 18-50 years, 50-65 years, and 65 years and older; as well as four (age, sex) strata: male or female, younger or older than 65 years. Finally, to account for multiple testing, we controlled for the false discovery rate using Benjamini and Hochberg’s method [19]. All analyses were performed using version 4.0.0 of the R programming language (R Project for Statistical Computing; R Foundation).

Results

Maccabi Health Services (MHS) SARS-CoV-2 positive cohort included 4,353 individuals of whom 173 deteriorated to moderate (N=87, 50%) or severe condition (N=45, 26%), admitted to intensive care unit (N=66, 38%, partly overlapping with other conditions), or died (N=21, 12%), and make up the complicated COVID-19 cohort. Overall, patients in the complicated COVID-19 cohort are older, suffer from more comorbidities, and are more predominantly male (Table 1). Moreover, the prevalence of COVID-19 complications increases with age, and more steeply for men than for women (Figure 1.); and the risk of COVID-19 complications in men under 70 years is significantly higher than in women (Table 2).

View this table:

Table 1.

Characteristics of the SARS-CoV-2 positive, complicated, and non-complicated COVID-19 patient cohorts.

Figure 1.

Age and complicated COVID-19. Prevalence of complicated COVID-19 (moderate or severe condition, y-axes) in different age groups (x-axis), shown separately for male (blue) and female (orange).

View this table:

Table 2.

Association of male sex and COVID-19 complications across age groups.

Comparing the prevalence of existing conditions in three age groups between the complicated and non-complicated COVID-19 cohorts, reveals multiple risk factors, including obesity for patient 18-50 years (OR: 11.09, 95% confidence intervals, CI: [4.15, 32.67]; P-value < 10^-4), Chronic kidney disease for patients 50-65 years (4.06 [1.89, 8.38]; P-value = 0.005); and neurological disorders (2.65 [1.69, 4.17]; P-value < 0.001) for patients 65 years or older (for a complete list, see Table 3 and Supplementary File 1).

View this table:

Table 3.

Most statistically significant conditions associated with increased risk of COVID-19 complications in age stratified patient groups.

Stratifying over age (below and above 65 years) and sex (Table 4 and Supplementary File 1), kidney diseases appear as a risk factor in all strata (e.g., OR 3.45 [1.57, 8.06]; P-value = 0.015 in women 65 years or more). Additional risk factors include hypertension in males under 65 years (4.56 [2.35, 8.55]; P-value < 0.001); neurological disorders in females 65 years or older (3.55 [1.68, 7.74]; P-value = 0.008); cognitive impairment (4.18 [1.81, 9.72]; P-value = 0.009) and depression (2.94 [1.55, 5.58]; P-value = 0.014) in males 65 years or older.

View this table:

Table 4.

Most statistically significant conditions associated with increased risk of COVID-19 complications in age and sex stratified patient groups. See Table 3 for column information.

Intriguingly, respiratory diseases and smoking, which contribute to common at-risk definitions (including those by the CDC [1] and the British NHS [3]), while typically more prevalent in complicated COVID-19 patients, are not identified as significant risk factors (e.g., chronic obstructive pulmonary disease in patients 65 years or more: OR 1.36 [0.75, 2.4]; P-value = 0.634); and see Supplementary File 1.

Discussion

We compared the prevalence of dozens of existing conditions in Israeli SARS-CoV-2 positive and complicated COVID-19 patient cohorts, to highlight those conditions associated with high risk of complications. A few other studies have employed a similar study design to identify risk factors for COVID-19 complications. For example, Ebinger et al [8] studied a cohort of symptomatic COVID-19 individuals (N=442) and examined the association of existing conditions with disease severity; and the OpenSAFELY Collaborative explored the risk of COVID-19-related hospital death among the general population (N>17M). We emphasize that cohort composition dictates the research question it can address: our analysis focuses on SARS-CoV-2 positive individuals, hence searches for risk factors of complications in patients who already contracted the virus (but are potentially asymptomatic), while studying the general population may combine risk factors for infection and COVID-19 severe outcome. Additionally, cohorts that consider only a subset of patients, defined based on disease outcome (e.g., symptomatic or hospitalized) or otherwise non-representative of the entire population (e.g., demographically skewed), may introduce biases to the analysis; instead, we study here all SARS-CoV-2 infected patients in a large, nationwide health organization.

Many conditions highlighted by our analysis have been previously reported [5,6,8] and are part of commonly used at-risk definitions [1,3], including hypertension, obesity, kidney and cardiovascular diseases. We do, however, identify a few additional risk factors, notably depression in patients 18-50 years old and males 65 years or more; and cognitive and neurological disorders in patients 65 years or older. These additions may be, in part, associated with different age distribution in the 65+ years group (median 76y, IQR [70-83.5y] versus 72y [68-78y] in the complicated and non-complicated COVID-19 cohorts, respectively) and rely on small sample size (only seven 18-50y patients with depression in the complicated COVID-19 cohort; Table 3); nonetheless, with some preliminary support [7], they may deserve more consideration in future studies. Our analysis also points out to reduced importance of respiratory diseases and smoking. Both conditions appear as factors in most at-risk definitions [3,5]: Chronic obstructive pulmonary disease has been associated with severe COVID-19 in multiple (though not all [6]) studies [20], while the role of smoking has been somewhat controversial [20,21]. The discrepancies between our analysis and previous reports likely stem from the different cohorts analyzed: SARS-CoV-2 positive individuals, ranging from asymptomatic to severe COVID-19 versus hospitalized COVID-19 patients, respectively. Other study-related attributes, for example country-specific characteristics, may also contribute to the varying significance of the studied risk factors.

In parallel to the COVID-19 epidemiological characterization efforts, researchers have also attempted to use retrospective observational data to derive risk models for severe COVID-19 patients [22]. Such models require ample data of COVID-19 patients for both model training and performance assessment. As, currently, such data are scarce, some models compromised on using data for other diseases with, supposedly, similar clinical trajectory and complications. For example, DeCapprio et al [19] trained models on US Medicare claims data to predict inpatient visits with a primary diagnosis of either pneumonia, influenza, acute bronchitis, or other specified upper respiratory infections as proxy for COVID-19 complications. However, as previously reported (e.g., in [24]), and in agreement with our analysis, severe COVID-19 patient characteristics differ considerably from other diseases’, thus limiting the generalizability of such models to COVID-19 and requiring adjustments of their parameters [4].

Our study has several limitations. First and foremost, the number of complicated COVID-19 patients in MHS data is below 200, limiting the statistical power of our analysis. Second, healthcare policies and, in particular, testing criteria, may systematically bias the composition of SARS-CoV-2 positive cohort. Third, asymptomatic and mild COVID-19 patients (currently in the non-complicated cohort) may deteriorate and eventually be part of the complicated cohort, potentially modifying the results of the analysis. Fourth, our analysis is univariate in nature, testing the association of individual conditions with COVID-19 complications; as such, it is unable to uncover more complex relations, e.g., interdependencies between conditions and COVID-19 complications. Finally, we focused on data from Israel; characteristics in other geographies may differ [24]. We attempted to mitigate some of these limitations by age and sex stratification and robust estimations of statistical significance. We also note that, at the current point in time, many of these shortcomings are shared by all published COVID-19 research work.

Notwithstanding these limitations, our work adopts a novel vantage point to the problem of identifying patients at increased risk for COVID-19 complications. Importantly, as SARS-CoV-2 containment efforts focus on patients at risk for severe complications (for example, shielding vulnerable population in the UK [3]), changes in the list of considered conditions may have huge effect on a large number of individuals, thus calling for continuous fine-tuning of the corresponding definitions.

Ethical approval

The study was approved by Maccabi Health Services’ institutional review board.

Competing Interest Statement

All authors have completed the ICMJE uniform disclosure form, with the following declarations made: PA reports personal fees and other from Medial Research, outside the submitted work.

Data Availability

Patient-level de-identified data from Maccabi Health Services

Author Contributions

Conceptualization: NK; Data Curation: BM, KM, YB; Investigation and methodology: CY, BM, NK; Project Administration: PA, VS; Writing – original draft preparation: CY; Writing – review and editing: CY, BM, KM, PA, YB, GC.

Acknowledgement

We thank Guy Amit and Irena Girshovitz, KI Research Institute, for insightful discussions and comments.

References

1.↵
Centers for Disease Control and Prevention. Coronavirus Disease 2019 (COVID-19). 11 Feb 2020 [cited 21 Apr 2020]. Available: https://www.cdc.gov/coronavirus/2019-ncov/need-extra-precautions/people-at-higher-risk.html
2.↵
European Centre for Disease Prevention and Control. Rapid risk assessment: Coronavirus disease 2019 (COVID-19) pandemic: increased transmission in the EU/EEA and the UK - eighth update. In: European Centre for Disease Prevention and Control [Internet]. 8 Apr 2020 [cited 21 Apr 2020]. Available: https://www.ecdc.europa.eu/en/publications-data/rapid-risk-assessment-coronavirus-disease-2019-covid-19-pandemic-eighth-update
3.↵
United Kingdom National Health Service. Guidance and updates for GPs: Highest clinical risk patients. [cited 27 Apr 2020]. Available: https://www.england.nhs.uk/coronavirus/publication/guidance-and-updates-for-gps-at-risk-patients/
4.↵
Barda N, Riesel D, Akriv A, Levi J, Finkel U, Yona G, et al. Performing risk stratification for COVID-19 when individual level data is not available, the experience of a large healthcare organization. 28 Apr 2020 [cited 10 May 2020]. doi:10.1101/2020.04.23.20076976
OpenUrl Abstract/FREE Full Text
5.↵
Xu L, Yaqian M, Chen G. Risk factors for severe corona virus disease 2019 (COVID-19) patients: a systematic review and meta analysis | medRxiv. [cited 21 Apr 2020]. Available: https://www.medrxiv.org/content/10.1101/2020.03.30.20047415v1
6.↵
Richardson S, Hirsch JS, Narasimhan M, Crawford JM, McGinn T, Davidson KW, et al. Presenting Characteristics, Comorbidities, and Outcomes Among 5700 Patients Hospitalized With COVID-19 in the New York City Area. JAMA. 2020 [cited 28 Apr 2020]. doi:10.1001/jama.2020.6775
OpenUrl CrossRef PubMed
7.↵
The OpenSAFELY Collaborative, Williamson E, Walker AJ, Bhaskaran KJ, Bacon S, Bates C, et al. OpenSAFELY: factors associated with COVID-19-related hospital death in the linked electronic health records of 17 million adult NHS patients. 7 May 2020 [cited 11 May 2020]. doi:10.1101/2020.05.06.20092999
OpenUrl Abstract/FREE Full Text
8.↵
Ebinger JE, Achamallah N, Ji H, Claggett BL, Sun N, Botting P, et al. Pre-Existing Traits Associated with Covid-19 Illness Severity. 5 May 2020 [cited 11 May 2020]. doi:10.1101/2020.04.29.20084533
OpenUrl Abstract/FREE Full Text
9.↵
Shalev V, Chodick G, Goren I, Silber H, Kokia E, Heymann AD. The use of an automated patient registry to manage and monitor cardiovascular conditions and related outcomes in a large health organization. Int J Cardiol. 2011;152: 345–349. doi:10.1016/j.ijcard.2010.08.002
OpenUrl CrossRef PubMed
10.↵
Sella T, Shoshan A, Goren I, Shalev V, Blumenfeld O, Laron Z, et al. A retrospective study of the incidence of diagnosed Type 1 diabetes among children and adolescents in a large health organization in Israel, 2000–2008. Diabet Med J Br Diabet Assoc. 2011;28: 48-53. doi:10.1111/j.1464-5491.2010.03174.x
OpenUrl CrossRef PubMed
11.↵
Chodick G, Heymann AD, Shalev V, Kookia E. The epidemiology of diabetes in a large Israeli HMO. Eur J Epidemiol. 2002;18: 1143-1146. doi:10.1023/B:EJEP.0000006635.36802.c8
OpenUrl CrossRef
12.↵
Weitzman D, Chodick G, Shalev V, Grossman C, Grossman E. Prevalence and factors associated with resistant hypertension in a large health maintenance organization in Israel. Hypertens Dallas Tex 1979. 2014;64: 501–507. doi:10.1161/HYPERTENSIONAHA.114.03718
OpenUrl CrossRef
13.↵
Goldshtein I, Nguyen AM, dePapp AE, Ish-Shalom S, Chandler JM, Chodick G, et al. Epidemiology and correlates of osteoporotic fractures among type 2 diabetic patients. Arch Osteoporos. 2018;13: 15. doi:10.1007/s11657-018-0432-x
OpenUrl CrossRef
14.↵
Yu J, Goldshtein I, Shalev V, Chodick G, Ish-Shalom S, Sharon O, et al. Renal impairment among postmenopausal women with osteoporosis from a large health plan in Israel. Arch Osteoporos. 2015;10: 210. doi:10.1007/s11657-015-0210-y
OpenUrl CrossRef
15.↵
Kodesh A, Goldshtein I, Gelkopf M, Goren I, Chodick G, Shalev V. Epidemiology and comorbidity of severe mental illnesses in the community: findings from a computerized mental health registry in a large Israeli health organization. Soc Psychiatry Psychiatr Epidemiol. 2012;47: 1775–1782. doi:10.1007/s00127-012-0478-9
OpenUrl CrossRef PubMed
16.↵
Metcalfe D, Masters J, Delmestri A, Judge A, Perry D, Zogg C, et al. Coding algorithms for defining Charlson and Elixhauser co-morbidities in Read-coded databases. BMC Med Res Methodol. 2019;19: 115. doi:10.1186/s12874-019-0753-5
OpenUrl CrossRef
17.
Kuan V, Denaxas S, Gonzalez-Izquierdo A, Direk K, Bhatti O, Husain S, et al. A chronological map of 308 physical and mental health conditions from 4 million individuals in the English National Health Service. Lancet Digit Health. 2019;1: e63-e77. doi:10.1016/S2589-7500(19)30012-3
OpenUrl CrossRef PubMed
18.↵
Clinical Classifications Software (CCS) 2015. Agency for Healthcare Research and Quality Healthcare Cost and Utilization Project (HCUP); Available: https://www.hcup-us.ahrq.gov/toolssoftware/ccs/CCSUsersGuide.pdf
19.↵
Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J R Stat Soc Ser B Methodol. 1995;57: 289–300. doi:10.2307/2346101
OpenUrl CrossRef PubMed
20.↵
Zhao Q, Meng M, Kumar R, Wu Y, Huang J, Lian N, et al. The impact of COPD and smoking history on the severity of Covid-19: A systemic review and meta-analysis. J Med Virol. n/a. doi:10.1002/jmv.25889
OpenUrl CrossRef
21.↵
Lippi G, Henry BM. Active smoking is not associated with severity of coronavirus disease 2019 (COVID-19). Eur J Intern Med. 2020;75: 107–108. doi:10.1016/j.ejim.2020.03.014
OpenUrl CrossRef PubMed
22.↵
Wynants L, Van Calster B, Bonten MMJ, Collins GS, Debray TPA, De Vos M, et al. Prediction models for diagnosis and prognosis of covid-19 infection: systematic review and critical appraisal. BMJ. 2020; m1328. doi:10.1136/bmj.m1328
OpenUrl Abstract/FREE Full Text
23.
DeCaprio D, Gartner J, Burgess T, Kothari S, Sayed S, McCall CJ. Building a COVID-19 Vulnerability Index. ArXiv200307347 Cs Stat. 2020 [cited 21 Apr 2020]. Available: http://arxiv.org/abs/2003.07347
24.↵
Burn E, You SC, Sena A, Kostka K, Abedtash H, Abrahao MTF, et al. An international characterisation of patients hospitalised with COVID-19 and a comparison with those previously hospitalised with influenza. medRxiv. 2020; 2020.04.22.20074336. doi:10.1101/2020.04.22.20074336
OpenUrl Abstract/FREE Full Text