Summary
Background This study aimed to evaluate the prediction capabilities of clinical laboratory biomarkers to the prognosis of COVID-19 patients.
Methods Observational studies reporting at least 30 cases of COVID-19 describing disease severity or mortality were included. Meta-data of demographics, clinical symptoms, vital signs, comorbidities, and 14 clinical laboratory biomarkers on initial hospital presentation were extracted. Taking the outcome group as the analysis unit, meta-regression analysis with the generalized estimating equations (GEE) method for clustered data was performed sequentially. The unadjusted effect of each potential predictor of the three binary outcome variables (i.e., severe vs. non-severe, critically severe vs. non-critically severe, and dead vs. alive) was examined one by one by fitting three series of simple GEE logistic regression models due to missing data. The worst one was dropped one at a time. Then, a final multiple GEE logistic regression model for each of the three outcome variables was obtained.
Findings Meta-data was extracted from 76 articles, reporting a total of 26,627 cases of COVID-19. Patients were recruited across 16 countries. The number of studies (patients) included in the final models of the analysis for severity, critical severity, and mortality was 38 studies (9,764 patients), 21 studies (4,792 patients), and 24 studies (14,825 patients), respectively. After adjusting for the effect of age, lymphocyte count mean or median ≤ 1.03 (estimated hazard ratio [HR] = 46.2594, p < 0.0001), smaller lymphocyte count mean or median (HR < 0.0001, p = 0.0028), and lymphocyte count mean or median ≤ 0.8714 (HR = 17.3756, p = 0.0079) were the strongest predictor of severity, critical severity, and mortality, respectively.
Interpretation Lymphocyte count should be closely watched for COVID-19 patients in clinical practice.
Introduction
Although numerous treatment options and vaccines are authorized for COVID-19,1, 2 the situation of a global pandemic is still continuing. Each day over four hundred thousand new cases are identified even in time of July 2021.3 COVID-19, the illness caused by infection with SARS-nCoV2,4 is spreading since December 2019 from Wuhan, China, and has accumulated more than 192 million cases and more than 4 million deaths in over 219 countries, area or territories up to 23 July 2021.5 During pandemic period medical care system started to overwhelmed in bunch communities no matter from economically developed or underdeveloped regions.6–13 How to use simple tools to differentiate, triage patients is crucial.
Several laboratory data have been identified as predictors for disease severity or mortality of COVID-19 patients, e.g., lymphocyte count, neutrophil/lymphocyte ratio (NLR), lactate dehydrogenase (LDH), D-dimer, C-reactive protein (CRP), procalcitonin (PCT).14–26 However, since many studies were conducted at the same region during a short period of time,14–22, 26, 27 potential bias of subject duplication cannot be ruled out for the following meta-analysis.28–34 Additionally the relative strength of broader spectrum lab data for their prediction capability has not been explored on a head-to-head basis. This study aimed to investigate whether laboratory data at hospital presentation play a role in distinguishing severity or predicting mortality for COVID-19 patients and to explore the relative significance of these predictors across regions.
Research in context
Evidence before This Study
Plentiful tools, e.g., demographics, symptoms, vital signs, comorbidity, imaging, and lab data have been explored to their prediction capability for COVID-19 patients. Several lab data such as lymphocyte, NLR, LDH, CRP, PCT have been identified to distinguish the severity or to predict the survival of COVID-19 patients. However, most of the meta-analysis research conducted in a specific region at the early stage of the pandemics and subject duplication concerns cannot be ignored due to the large amount papers published within a short period. In addition, the relative strength of these lab items for predictions has not been tested under a broader spectrum which including severity, critical severity, and mortality and across different regions.
Added Value of This Study
Our study proves that several lab data (e.g., WBC count, lymphocyte count, neutrophil count, platelet count, LDH, D-dimer, CRP, and PCT) of COVID-19 patients at initial hospital presentation holds the values to differentiate disease severity (severe vs. non-severe, critically severe vs. non-critically severe) and to predict the final consequence, mortality (dead vs. alive). After comparisons lymphocyte count was the most powerful predictor among the fourteen explored lab data.
The implication of All Available Evidence
Early observing essential lab data for a COVID-19 patient can help to scan the health situation, triage the patient, aid for clinical judgment, predict the severity of disease, and from a pragmatic aspect to allocate medical resources appropriately. Under pandemic condition whereas medical resource is constrained, routine lab testing, which is relatively easy access, self-explanation, cost-effectiveness, could be a valuable tool to help for compacting disease. How to maintain or improve good immunity levels for the general population in daily life can be a crucial strategy to stakeholders in facing life-threatening infectious diseases such as COVID-19.
Methods
Literature Search Strategy
We used the search terms “COVID-19”, “2019-nCoV”, and “coronavirus” in the search field “Title/Abstract”, at the electronic databases: MEDLINE, and EMBASE. The searches were completed on 10 October 2020.
Inclusion and Exclusion Criteria
The eligibility criteria for the inclusion of literature in the meta-analysis were as follows: (1) the literature is the original research; (2) the literature was a study with COVID-19 confirmed patients; (3) the literature was published in English with the full text available; (4) source of subjects, recruitment situation were clearly stated. Literature was excluded from the meta-analysis when (1) disease severity or survival status was not well defined; (2) pediatric study or particular subject group, e.g., specific disease apart from COVID-19; (3) desired lab data at hospital administration for COVID-19 was not available which fourteen lab items were: white blood cell (WBC), neutrophil, lymphocyte, neutrophil to lymphocyte ratio (NLR), platelet, alanine aminotransferase (ALT), aspartate aminotransferase (AST), creatinine, D-dimer, C-reactive protein (CRP), procalcitonin (PCT), lactate dehydrogenase (LDH), and hypersensitive troponin I (hs-cTnI); (4) subjects number below thirty; (5) research subjects may duplicate from other studies after investigation of the sites and the recruitment period. Under condition (5), the study with utmost information by calculating (the number of study subjects) × (number of lab data items)] was selected.
At the initial stage, after duplicates were removed, 1,126 records were identified from MEDLINE or EMBASE databases. Of the leaving records, after the title and abstract review, 660 documents were excluded. The leaving 466 articles were carefully and detailed evaluated. At last, 390 articles were excluded, because the studies did not meet the criteria we have set. Finally, a total of 76 studies with 26,627 patients were included in qualitative synthesis (Figure 1). Among the 76 studies, based on the features of data, a total of 38 studies, 21 studies, and 24 studies were incorporated in the analysis for severity, critical severity, and mortality respectively. After all, a total of 35 studies, 15 studies, and 19 studies were presented in the meta-regression analyses, respectively.
Data Extraction
The following data were extracted from the qualified studies: first author, year/month of the publication, location (city-country), hospital name, definition of disease severity, subject number, number of COVID-19 patients in each health status, age, male to female ratio, vital sign, clinical feature (12 symptoms), comorbidity (any; 8 main diseases), and desired 14 lab data [Appendix I, Appendix II). Lab data on the initial hospital presentation were classified as blood routine, blood biochemistry, coagulation functions, inflammatory markers, myocardial injury markers (see Appendix II).
The primary outcome measures were to compare the level of laboratory data and their impact on different health outcomes (non-severe vs. severe, non-critically severe vs. critically severe, and alive vs. dead) after adjusting the effects of other covariates.
Statistical Analysis
Statistical analysis was performed using the R 4.1.0 software (R Foundation for Statistical Computing, Vienna, Austria). Two-sided p value ≤ 0.05 was considered statistically significant. We chose the outcome groups in the collected studies as the analysis unit ― instead of the collected studies themselves ― in this meta-analytical study. The distributional properties of continuous variables were expressed by mean ± standard deviation (SD), median, interquartile range (IQR), and categorical variables were presented by frequency and percentage (%). In univariate analysis, the unadjusted effect of each potential risk factor, prognostic factor, or predictor of the three binary outcome variables (i.e., severe vs. non-severe, critically severe vs. non-critically severe, and dead vs. alive) was examined respectively using the Wilcoxon rank-sum test, Chi-square test, and Fisher’s exact test as appropriate for the data type. Next, multivariate analysis was conducted by fitting the logistic regression models to estimate the adjusted effects of potential risk factors, prognostic factors, or predictors on the three binary outcome variables (i.e., severe vs. non-severe, critically severe vs. non-critically severe, and dead vs. alive) respectively with the generalized estimating equations (GEE) method. The GEE method was used to account for the correlation between the two outcome groups within a collected study.35 Computationally, we used the geeglm function (with the specified “exchangeable” correlation structure and the default robust estimator of standard error) of the geepack package36, 37 to fit GEE logistic regression models for the three sets of correlated binary responses (i.e., severe vs. non-severe, critically severe vs. non-critically severe, and dead vs. alive) respectively in R.
To ensure a good quality of analysis, the model-fitting techniques for (1) variable selection, (2) goodness-of-fit (GOF) assessment, and (3) regression diagnostics and remedies were used in our GEE logistic regression analyses. All the univariate significant and non-significant relevant covariates (listed in Appendix II) were put on the variable list to be selected. However, each of the collected studies selectively reported the potential risk factor, prognostic factor, or predictor of the three binary outcome variables. If we wanted to assess simultaneously the effects of all the relevant covariates (listed in Appendix II), then the number of studies without missing values would be very few. Thus, our meta-regression analysis was performed by fitting a series of simple GEE logistic regression models and then dropping the worst one at a time to maximally use all the available information. Then, a final multiple GEE logistic regression model for each of the three outcome variables was obtained. Any discrepancy between the results of univariate analysis and multivariate analysis was likely due to the variation in the number of studies without missing values or the confounding effects of uncontrolled covariates in univariate analysis.
The GOF measures, including the estimated area under the receiver operating characteristic (ROC) curve (also called the c statistic) and adjusted generalized R2, and the Hosmer-Lemeshow GOF test were examined to assess the GOF of the fitted GEE logistic regression model. The value of the c statistic (0 ≤ c ≤ 1) ≥ 0.7 suggests an acceptable level of discrimination power. Larger p values of the Hosmer-Lemeshow GOF test imply better fits of logistic regression model.
Simple and multiple generalized additive models (GAMs) were fitted to draw the GAM plots for detecting nonlinear effects of continuous covariates and then for identifying the appropriate cut-off point(s) to discretize continuous covariates, if necessary, during the above variable selection procedure. Computationally, we used the vgam function of the VGAM package with the default values of the smoothing parameters (e.g., s(age, df=4, spar=0) for the cubic smoothing splines) to fit the GAMs for our binary responses, and then used the plotvgam function of the same package to draw the GAM plots for visualizing the linear or nonlinear effects of continuous covariates in R.36, 38, 39 If a separation or high discrimination problem occurred in logistic regression analysis, we fitted the Firth’s bias-reduced logistic regression model using the logistf function of the logistf package in R.40 Finally, the statistical tools of regression diagnostics for residual analysis, detection of influential cases, and check of multicollinearity were applied to discover any model or data problems. The values of the variance inflating factor (VIF) ≥ 10 in continuous covariates or VIF ≥ 2.5 in categorical covariates indicate the occurrence of the multicollinearity problem among some of the covariates in the fitted logistic regression model.
Results
Literature Search and Study Characteristics
Based on the search strategy, 76 articles were included in the qualitative synthesis10, 14, 16–25, 41– 104 including 26,627 COVID-19 confirmed patients (Figure 1, Appendix I). The number of studies (patients) included for the analysis of severity, critical severity, and mortality was 38 studies (9,764 patients), 21 studies (4,792 patients), and 24 studies (14,825 patients) respectively (Figure 1, Appendix I). Patient demographics, clinical features, comorbidities, lab items for each health status were shown (Appendix, Appendix II). All of the selected articles were published in 2020 with patient sizes ranged from 38 to 4,035 subjects. All of the articles were retrospective, observational studies with COVID-19 patients recruited between 1 December 2019 and 27 Jun 2020, from sixteen countries. The majority of studies were conducted in China (46 studies, 60.5%), followed by Italy (5 studies, 6.6%) and the US (5 studies, 6.6%). Most of the subjects come from China (13,483 patients, 50.6%), followed by Spain (4,035 patients, 15.2%) and US (2,691 patients, 10.1%). Except for 21 sole mortality studies, 55 studies incorporated disease severity definitions (Appendix I). Most (49, 89.1%) of the studies were based on the WHO interim guidance105 or national guidance modified from the WHO principles91, 106, 107, followed by the American Thoracic Society Guideline (5, 9.1%) and the International Guideline for Community-Acquired Pneumonia (1, 1.8%). Since the contents of the above guidelines were similar, they were all included in the analysis. In general, disease severity is classified into four types: mild, moderate, severe, and critically severe. Severe: Meet any of the following (1) Shortness of breath, RR>30 times per minute; At room air, SpO2 lower than 93%; (3) The partial pressure of Arterial blood oxygen (PaO2)/the fraction of inspired oxygen (FiO2) ≤ 300mmHg; (4) CT chest imaging shows that lung damage develops significantly within 24 to 48 hours. Critically severe: Meet any of the following (1) Respiratory failure requiring mechanical ventilation; (2) Signs of septic shock; Multiple organ failure requiring ICU admission. For comparison purposes in this study, the subjects in the mild and the moderate conditions were assembled into the non-severe group; subjects in the mild, moderate, and severe were assembled into the non-critical group. There were therefore six health outcomes classified into three pairs in this study: severe vs. non-severe, critically severe vs. non-critically severe, dead vs. alive.
Patient Demographics, Clinical Characteristics, Comorbidities, and Lab Data
Summary statistics of the demographics, clinical characteristics, comorbidities, and laboratory data of the COVID-19 patients on initial hospital presentations for the assessment of severity, critical severity, and mortality were shown in Appendix II-1, Appendix II-2, and Appendix II-3, respectively.
Not every desired lab parameter was collected for each study ― for example, NLR and hs-cTnI were rarely reported. Most of the lab data had statistical significance (all p < 0.05) between the two groups except less collected parameters to the disease severity (Appendix II-1, Appendix II-2); ALT, total bilirubin to the mortality (Appendix II-3).
Predictors for Severity (Severe vs. Non-severe)
Results of univariate analyses of the predictors for severity (severe vs. non-severe) were shown in Table 1-A. As less effect lab items judged by the AUC of ROC kicked out from the model, more and more arms, from m = 14 to m = 70 at the different runs, were recruited in the meta-regression analysis. At the final run (seventh, m = 70) only lymphocyte count (AUC = 0.938) or lymphocyte count ≤ 1.03 or > 2.06 (0.929) and age (AUC = 0.855) or age > 55.02 (AUC = 0.800) existed in the final univariate analyses. Table 1-B listed the result of multivariate analysis for the prediction of severity (severe vs. non-severe). Age mean or median > 55.02 presented a higher risk to the severity (estimated hazard ratio [HR] = 5.7921, p = 0.0058) while lymphocyte count mean or median ≤ 1.03 or > 2.06 showed a strong risk to the severity (HR = 46.2594, p < 0.0001).
Predictors for Critical Severity (Critically Severe vs. Non-critically severe)
Results of univariate analyses of the predictors for critical severity (critically severe vs. non-critically severe) were shown in Table 2-A. At the last run (seventh, m = 30) only lymphocyte count (AUC = 0.933) and age (AUC = 0.829) or age > 59.82 (AUC = 0.767) were existed in the final univariate analyses. Table 2-A listed the result of multivariate analysis for the predictors to critical severity (critically severe vs. non-critically severe). Age and lymphocyte count were in the final meta-regression model. We found that higher lymphocyte count mean or median had an extremely lower risk of critical severity (HR < 0.0001, p = 0.0284) while age mean or median > 59.82 have a higher risk of critical severity (HR = 307.6130, p = 0.0009).
Predictors for Mortality (Dead vs. Alive)
Results of univariate analyses of the predictors for mortality (dead vs. alive) were shown in Table 3-A. At the last run (seventh, m = 38) only lymphocyte count (AUC = 0.935) or lymphocyte count ≤ 0.87 (AUC = 0.895) and age (AUC = 0.913) or age > 67.28 (AUC = 0.895) existed in the final univariate analyses. Table 3-B listed the result of multivariate analysis for the predictors of mortality. Older age mean or median > 67.28 have a higher risk of mortality (HR = 17.3756, p = 0.0079) while lower lymphocyte count mean or median ≤ 0.87 had a higher risk of mortality (HR = 17.3756, p = 0.0079).
Discussion
The results of this study provide numerous imperative insights. After comparisons lymphocyte count was the most powerful predictor among the fourteen explored lab items. Single lab data, lymphocyte count at initial hospital presentation together with age can be remarkable indicators to discriminate the health status (severe vs. non-severe, critically severe vs. non-critically severe) or the final consequence (dead vs. alive) for COVID-19 patients. Compared with vital signs, symptoms, comorbidities, several lab data (CRP, D-dimer, lymphocyte, neutrophil, platelet, LDH) holds the value to differentiate disease severity and to predict the mortality of COVID-19 patients. To the best of our knowledge, this was the first meta-analysis study that potential bias of subject duplication of COVID-19 patients in studies has been eliminated before analyses.
After SARS-CoV2 infection, multiple mechanisms of the human body triggered, e.g., immune (ex. WBC, lymphocyte, neutrophil) responses, inflammatory cataracts (ex. CRP, PCT), and the activation of coagulation cascades (ex. platelet count, D-dimer).108–111 After virus invasion to the tissues which starts early, the inflammation situation intensifies110, 112, the inflammatory indicators will increase dramatically.15, 28, 29, 113, 114 The wide distribution of the COVID-19 receptors, e.g., angiotensin-converting enzyme-2 (ACE2) receptors, abundantly expressed in a variety of cells residing in many human organs, could exaggerate systemic failure due to direct organ injury.115, 116 Organ (ex. lung, liver, kidney, heart, brain, etc.) damage indicators such as ALT, AST, total bilirubin, LDH, hypersensitive troponin I, etc. will be augmented to reflecting the impairment situation.30, 117, 118 Our study confirmed again that levels of several laboratory data although not all are profound predictors to disease severity or mortality for COVID-19 patients as compared with previous studies.15, 28, 29, 30, 114
It is no surprise that lymphocyte count played such an important role to COVID-19 patients in defending SARS-CoV-2.28, 109, 112 Adaptive immune cells such as lymphocytes are essential for virus clearance as well as for recovery from the disease.109, 112, 119 Interaction between SARS-CoV-2 and the immune system of an individual results in a diverse clinical manifestation.16, 75, 88, 94, 96, 112 Our study reveals that lymphocyte count offerings a defensive feature to COVID-19 patients within a certain range (Table 1-B, Table 3-B). Lower lever (e.g., lymphocyte count ≤ 1.03) or extreme lower (e.g., lymphocyte count ≤ 0.87) implies immune weakness and worsens outcomes to disease severity or mortality. However, a too high level of immune response becomes another issue, which may induce unintended results such as cytokine storm.75, 88 In our study, a higher level of lymphocyte count (e.g., lymphocyte count > 2.06) revealed a more severe status to severity (Table 1-B). A current hypothesis is that a cytokine storm can induce or further aggravate SARS-CoV-2 infection.120, 121 The degree to which SARS-CoV-2 targets immune cells remains poorly defined. It is crucial to understand more about the interaction of SARS-CoV-2 with the host immune system and the subsequent contribution to the organ functions and disease progression.
Plentiful tools, e.g., demographics, symptoms, vital signs, comorbidities, imaging, etc., have been explored to their prediction capability for COVID-19 patients.28, 78, 114 However, such data has its limitations. Routine lab data retains several advantages, which can indicate the whole body situation of a COVID-19 patient whose functions can be changed dramatically in few days.122 Additionally lab testing is easy to access, repeatable, self-explain, relatively cheap, and therefore can be a cost-effective tool under pandemic circumstances.
Current criteria to judge the severity, to triage or referral COVID-19 patients, are based on imaging, demographics, comorbidities, vital signs, or symptoms.6, 123, 124 Based on our study results single lab data, lymphocyte count at administration plus age can be useful for the purposes. Early and continue monitoring lab data for a COVID-19 patient can help to understand the health state, triage the patient, predict the severity of disease, predict the health consequences, and workout treatment judgment appropriately.
Our study has several limitations. Due to the lack of non-English articles, pediatric study and specific disease groups, interpretation of the results must be cautious. Ideally, all desired lab data should be collected and analyzed in all studies. However, it is not realistic in the real world because of wide-ranging medical resource deficiency that existed across countries. We suggest collecting essential data through a standardized list while clinical presentation, medical history, imaging information, comprehensive lab data, and other valuable factors, can be assembled and analyzed which will accelerate knowledge accumulation in particular under global pandemics. Retrospective observational study conducted at the level of hospital or community, characteristics of individual patients could not be retrieved. In addition, the dynamic relationship among various lab data, the status of disease progression, functions, and feelings of the patient, have not been explored due to inadequate data. More extensive and large-scale studies are required to double confirm the findings of this study.
Conclusion
Our study involved 26,627 confirmed COVID-19 patients across sixteen countries provides evidence for defending disease under pandemics. Results prove that lymphocyte count is a universal biomarker to disease severity and mortality across regions. Several routine lab data at the initial hospital presented good prediction capability. Routine lab testing could be a useful tool in particular under a pandemic condition whereas medical resource is constrained. How to maintain or improve good immunity levels for the general population in daily life can be a crucial strategy to stakeholders in facing life-threatening infectious diseases such as COVID-19.
Data Availability
The data of this study were available from the corresponding author on request.
Contributors
KLL and FCH designed the study and take responsibility for the integrity of the data and the accuracy of the data analysis. JJC and KLL were in charge of the systematic review and data collection. FCH and FYW conducted the statistical analysis. KLL and FGH contributed to the writing of the manuscript. All authors contributed to data interpretation, reviewed, and approved the final version.
Declaration of interests
All authors declared no competing interests.
Data sharing
The data of this study were available from the corresponding author on request.
Funding
None.
Ethical statement
The study was a systematic review and meta-regression analysis so that ethical approval was not needed.
Supplementary material
Supplementary data to this article can be found online at https://.
Acknowledgments
None.
Footnotes
(conrad{at}cjc.expert), (fuchang.hu{at}gmail.com), (bagausagi{at}gms.tku.edu.tw), (jjchen092000{at}gmail.com)
References
- 1.↵
- 2.↵
- 3.↵
- 4.↵
- 5.↵
- 6.↵
- 7.
- 8.
- 9.
- 10.↵
- 11.
- 12.
- 13.↵
- 14.↵
- 15.↵
- 16.↵
- 17.
- 18.
- 19.
- 20.
- 21.
- 22.↵
- 23.
- 24.
- 25.↵
- 26.↵
- 27.↵
- 28.↵
- 29.↵
- 30.↵
- 31.
- 32.
- 33.
- 34.↵
- 35.↵
- 36.↵
- 37.↵
- 38.↵
- 39.↵
- 40.↵
- 41.
- 42.
- 43.
- 44.
- 45.
- 46.
- 47.
- 48.
- 49.
- 50.
- 51.
- 52.
- 53.
- 54.
- 55.
- 56.
- 57.
- 58.
- 59.
- 60.
- 61.
- 62.
- 63.
- 64.
- 65.
- 66.
- 67.
- 68.
- 69.
- 70.
- 71.
- 72.
- 73.
- 74.
- 75.↵
- 76.
- 77.
- 78.↵
- 79.
- 80.
- 81.
- 82.
- 83.
- 84.
- 85.
- 86.
- 87.
- 88.↵
- 89.
- 90.
- 91.↵
- 92.
- 93.
- 94.↵
- 95.
- 96.↵
- 97.
- 98.
- 99.
- 100.
- 101.
- 102.
- 103.
- 104.↵
- 105.↵
- 106.↵
- 107.↵
- 108.↵
- 109.↵
- 110.↵
- 111.↵
- 112.↵
- 113.↵
- 114.↵
- 115.↵
- 116.↵
- 117.↵
- 118.↵
- 119.↵
- 120.↵
- 121.↵
- 122.↵
- 123.↵
- 124.↵