ABSTRACT
Background To describe whether SARS-CoV-2 viral loads (VLs) and cycle thresholds (CTs) vary by sample type, disease severity and symptoms duration.
Methods Systematic searches were conducted in MEDLINE, EMBASE, BioRxiv and MedRxiv. Studies reporting individual SARS-CoV-2 VLs and/or CT values from biological samples. Paired reviewers independently screened potentially eligible articles. CT values and VLs distributions were described by sample type, disease severity and time from symptom onset. Differences between groups were examined using Kruskal-Wallis and Dunn ‘s tests (post-hoc test). The risk of bias was assessed using the Joanna Briggs Critical Appraisal Tools.
Results 14 studies reported CT values, 8 VLs and 2 CTs and VLs, resulting in 432 VL and 873 CT data points. VLs were higher in saliva and sputum (medians 4.7×108 and 6.5×104 genomes per ml, respectively) than in nasopharyngeal and oropharyngeal swabs (medians 1.7×102 and 4.8×103). Combined naso/oropharyngeal swabs had lower CT values (i.e. higher VLs) than single site samples (p=<0.0001). CT values were also lower in asymptomatic individuals and patients with severe COVID-19 (median CT 30 for both) than among patients with moderate and mild symptoms (31.4 and 31.3, respectively). Stool samples were reported positive for a longer period than other specimens.
Conclusion VLs are higher in saliva and sputum and in individuals who are asymptomatic of with severe COVID-19. Diagnostic testing strategies should consider that VLs vary by sample type, disease severity and time since symptoms onset.
Summary This systematic review found a higher viral load in saliva and sputum than in nasopharyngeal swabs, in asymptomatic individuals and patients with severe COVID-19. Diagnostic testing strategies should consider the type of sample, disease severity and the time since symptoms onset.
INTRODUCTION
In late 2019, the emergence of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) triggered the Coronavirus Disease 19 (COVID-19) pandemic [1,2], which has now affected almost all countries, with devastating consequences. Even though most infections remain asymptomatic or have mild presentation [1–4], a significant proportion progress to severe COVID-19 with high case fatality rates and over 600,000 deaths worldwide by July 2020.
The detection of SARS-CoV-2 infections relies on amplifying the viral ribonucleic acid (RNA) using reverse transcription quantitative polymerase chain reaction (RT-qPCR) assays; multiple “in-house” and commercial assays are available [5], with assay sensitivity varying from 1 to 100 viral genome copies per reaction. The amount of virus detected is thought to vary according to the specimen examined, the time of sampling after symptoms onset and disease severity, although surprisingly there are no systematic reviews describing these variations [6–9]. There is conflicting information on the association of VLs and disease severity. In some studies high VLs are associated with severe COVID-19, as an indication of higher viral replication, cellular damage and pathology[10], which trigger multiorgan failure [11]; however, not all studies have found this association and asymptomatic individuals can have high VLs [12]. Saliva and sputum tend to have higher VLs [13,14], although nasal and oropharyngeal samples are used more frequently [15–17]. Virions can be detected two days before symptom onset when VLs are high, and generally remain detectable for 7-21 days. However, some patients have detectable RNA for up to two months and virus excretion is more prolonged in stools [18,19].
Most RT-qPCR assays report cycle threshold (CT) values or estimate the viral loads (VL) utilising standard curves, which are assay specific [20]. Most reports therefore include either the CT or the VL, as extrapolation from CT values [21], but rarely both values.
Given the heterogeneity of findings, we conducted a systematic review of SARS-CoV-2 CT values and VLs reported in the literature. Herein we describe virus excretion kinetics and how these titers differ with disease severity, time from symptom onset, type of sample tested and the RT-qPCR assay used.
METHODS
Search strategy and selection criteria
The primary questions were whether VLs and CT values varied according to sample type, disease severity (asymptomatic, mild and severe), the time since the start of symptoms and RT-qPCR assay. The search was conducted within the University of Bern Living Evidence on COVID-19 database (https://ispmbern.github.io/covid-19/living-review/), which uses the following search terms: MEDLINE: (“Wuhan coronavirus” [Supplementary Concept] OR “COVID-19” OR “2019 ncov”[tiab] OR ((“novel coronavirus”[tiab] OR “new coronavirus”[tiab]) AND (wuhan[tiab] OR 2019[tiab])) OR 2019-nCoV[All Fields] OR (wuhan[tiab] AND coronavirus[tiab]))))) EMBASE: ncov OR (wuhan AND corona) OR COVID-19 and BioRxiv/MedRxiv: ncov or corona or wuhan or COVID. The database was downloaded with a cut-off data of 8th April 2020. We sub searched the database by retaining literature that contained “load” OR “dynamic*” OR “diagnostic*” OR “shed*” OR “serial” in the title or abstract. We also searched the bibliography of systematic reviews and modelling studies to identify additional studies. No language restrictions were applied, and we assessed the English abstracts of Chinese language manuscripts for inclusion and exclusion criteria. Two reviewers (ALW and CTW) independently conducted screening on the title, abstract and, if required, the full text of the manuscript. We included preprints and a second reviewer (VSS) verified all articles excluded during the full-text search. Studies were retained if they had used RT-qPCR and had reported VLs or CT values of SARS-CoV-2 in respiratory samples (oropharyngeal, nasopharyngeal, saliva, sputum, endotracheal aspirate or bronchoalveolar lavage) or stools, at one or more timepoints from individual patients in the community, outpatient or inpatient settings. Studies were included if they reported individual patient data or graphical displays that allowed data extraction and if it was possible to ascertain the day of sampling in relation to the initiation of symptoms. We included single case reports, case series and cohort studies. We excluded publications without original data, those which reported aggregated data, when data extraction of individual values was not possible and those with potentially overlapping data with other studies. The study protocol is registered at PROSPERO (Registration number: CRD42020181955).
Data extraction and bias assessment
Data were extracted in duplicate (by TE, VSS, ALW, ACA, KK and CTW) using a pre-piloted extraction form. Disagreements were resolved by discussion and involvement of a third party, where necessary. Graphs were digitised using Engauge software (http://markummitchell.github.io/engauge-digitizer/). Extracted data included author, country, participant characteristics, and clinical setting. Individual patient data was retrieved including disease severity (asymptomatic, mild, moderate, severe), as defined by the study authors, frequency and day of symptoms, sample type, diagnostic assay, CT values and/or VLs. When standard curves for the RT-qPCR assays were available, we converted CT values into viral enome copies per ml to allow their comparison with studies only reporting VLs. Standard curves were available for the Corman 2020 RT-qPCR, the China CDC N and ORF1b RT-qPCR, the United States CDC N1, N2, N3 RT-qPCRs [20] and the BGI Schenzen ORF1ab [22]. The values of converted VLs were included in both the CT values and VLs analyses. Although this method is not as accurate as when the standard curves are run experimentally by the study authors, it allows a good approximation for the conversion of CT values into VLs.
The Joanna Briggs Critical Appraisal Tools was used to assess the risk of bias and study quality for case report and case series, as appropriate [23]. After critical appraisal of each item, the studies were rated as good, fair, or poor and the findings were discussed qualitatively.
Statistical analysis
All laboratory data were standardised to display similar units and presented as medians and interquartile ranges (IQR). Categorical variables were described using descriptive statistics with 95% confidence intervals (95%CI). The normal distribution of continuous data was verified using the Kolmogorov-Smirnov test. As most CT values and VLs had skewed distributions, we used nonparametric tests to verify the statistical significance of the differences between groups. Kruskal-Wallis tests were used to assess differences between CT values and VLs by disease severity, sample type and time from symptom onset. If Kruskal-Wallis ‘s tests were significant, we performed multiple comparisons using Dunn ‘s tests (post-hoc test) to determinate differences between groups. Lines were fitted to longitudinal data sets using non-linear regression or polynomial models depending on the suitability of the data. P values <5% were considered statistically significant. Data were analysed using GraphPad Prism Version 5 (GraphPad Software, Inc., San Diego, CA).
RESULTS
The search strategy identified 746 reports. After screening titles and abstracts, 124 full-text articles were assessed for eligibility and 24 studies [1,7,12,14,16,17,24–41] met the inclusion criteria (Supplementary Figure 1). Most studies were rated good (12/24) or fair (9/24) for risk of bias, with three rated poor (Supplementary Tables 1 and 2). Eighteen studies were case series [1,12,14,16,17,26–29,31,32,34,36–41] and six case reports [7,24,25,30,33,35] (Table 1). Most studies included hospitalized patients and only two reported outpatients ‘data. CT values were reported in 18 studies [1,7,12,16,24,26,27,29,30,32–35,37–41,], VLs in six studies [14,17,25,28,31,36] and in six studies [7,12,29,33,35,37] with published standard curves CT values were converted to VLs. Individual patient data was identified for 173 participants yielding a database of 873 CT values and 494 VL data points and their characteristics are shown in Table 2.
Viral loads
The median VL across all sample types was 2.3×104 viral genomes per ml and ranged from 12 to 1×1011 genomes per ml (Supplementary Figure 2). VLs differed by samples type (P <0.0001) (Figure 1) and were highest in saliva (median 4.7×108 genomes per ml) and sputum (6.5×104 genomes per ml). Nasopharyngeal and oropharyngeal swabs had median VLs of 1.7×103 and 4.8×103 per ml, respectively, which were lower than in saliva (P <0.001) and sputum (P <0.001). VLs declined with the number of days since symptoms onset (Figure 2) and were reported for a longer time period from stool (28 days) than oropharyngeal (22 days) and nasopharyngeal swabs (18 days). The large VL in Saliva samples was maintained for the duration of positivity, however VLs in saliva were not reported beyond 12 days.
CT values The median CT values across all samples was 31.2, and ranged from 14.3 to 39.6, (Supplementary Figure 3). Values differed slightly with disease severity (P = 0.0119, Figure 3), as severe COVID-19 cases had median CT values of 30, compared with 31.4 and 31.3 for moderate and mild cases. However asymptomatic individuals also had a median of 30, which was not statistically different from severe COVID-19 cases (P >0.05). CT values also differed by sample type, with median CTs of 32.2 for nasopharyngeal, 31.9 for oropharyngeal, 31.7 for stools and 27.3 for sputum (P <0.0001, Figure 4). Combined naso/oropharyngeal swabs had lower median CTs (29.5) than other samples over the first 10 days of symptoms. Median CT values increased over time (indicating lower VLs) (Figure 5) and positive samples were reported for longer periods from nasopharyngeal swabs than oropharyngeal swabs.
CT values from nasopharyngeal swabs varied with disease severity. Severe cases continued to have positive results recorded for up to 31 days, while the last reported positive result for mild cases was 21 days (Supplementary Figure 4). Finally, CT values varied with the target gene of the RT-qPCR assays (Supplementary Figure 5), with the E gene target providing the lowest CT values, which were statistically significantly lower than the CT values reported for Orf1ab and RdRp (P <0.0001).
DISCUSSION
CTs and VL titers varied by sample type, with saliva and sputum having higher VLs (and lower CTs), than other respiratory samples. Saliva and sputum are easier to collect and less invasive than nasopharyngeal swabs, especially among patients with mild and moderate COVID-19, and can be sampled by the patient, reducing the risk of infection to healthcare workers. However, all saliva datapoints were obtained from a single study [36], and a follow on publication after the search cut-off date of 39 SARS-CoV-2 positive patients suggested a lower VL in comparison with nasopharyngeal samples [42]. Sputum, which is a diagnostic sample used for other respiratory conditions, had higher VLs and detectable virions for a longer time than other respiratory samples. This may be attributed to RNA from the lungs taking longer to be excreted from the lower respiratory tract, possibly by mechanical trapping in necrotic tissues or to lower airway cells containing greater density of human angiotensin converting enzyme 2 receptors, which are the target for virus binding [43]. VLs in the oropharynx were slightly higher than in the nasopharynx, although this was not statistically significant, as reported in studies not included here [44]. However, combining samples from both sites into a single sample resulted in higher VLs (lower CT values), which has been reported for other viruses [45]. This may reflect that the virus is not always detected simultaneously from all anatomical sites, and multiple-site swabs can decrease false negative results by extending the sampling area. Independently of the underlaying reasons, sampling both sites seem to increase the yield for upper respiratory tract specimens.
Our study highlights the presence of high VLs in stools, with shedding recorded up to 28 days post symptoms onset. Although RT-qPCR does not distinguish whole viruses from RNA fractions [15], stool examination is gaining public health importance for surveillance, as its recovery from sewage samples seems to loosely correlate with the levels of infection in the community [46] This also has the potential to forewarn of epidemics [47].
Although VLs have been considered markers of COVID-19 severity[48], with potential as prognostic biomarkers [47], there is no consensus across studies [49,50]. In our analysis, CT values were slightly lower (higher VLs) in patients with asymptomatic infections and severe COVID-19 than among patients with mild and moderate presentations. Moreover, CT values had wide ranges with significant overlap across the groups, limiting their value as prognostic markers.
RT-qPCR assays targeting the E gene generally reported lower CTs than other targets. This is potentially due to many studies using the Charité RT-qPCR assays [5], of which the E gene primer/probe set has been shown to be more sensitive than the RdRp set [51]. The lower CTs obtained with the E gene may not be indicative of inherent increased sensitivity with this PCR target, but demonstrate the high sensitivity of this RT-qPCR assay.
Our data has several limitations that need to be considered to interpret the data. Most studies were case reports or case series and therefore testing was initiated by clinical need or patient management. It is thus possible that data was collected from patients with unusual conditions and does not represent the whole spectrum of disease. Data also represents early reports during the pandemic, when diagnostic capacity was limited, and testing was prioritised for patients with the greatest need. Therefore, VLs may have over-represented patients with symptomatic infections and their contacts and further studies are needed to further document VLs in asymptomatic and mild cases through community surveillance. Other potential limitations include the use of the disease severity classifications of the authors, which were not standardised until later in the epidemic. Moreover, we included pre-prints that had not been peer-reviewed, with potentially varying quality. Despite these limitations, few studies had a significant risk of bias and the main issues noted were data aggregation and a lack of timelines to describe the patient clinical progression.
In conclusion, we have demonstrated that SARS-CoV-2 CTs and VLs vary between sample type, time point, and disease severity. This information will be useful for the selection of specimens for SARS-CoV-2 confirmation, the development of new diagnostic assays and a better understanding of the patterns of VLs over time to be expected among the different specimens. This information will also be useful to inform sample pooling strategies, in which samples from several patients are ‘pooled ‘and tested together, which are being explored as a resource saving approach to increase diagnostic capacity and for large scale screening of asymptomatic populations.
Data Availability
Data is available from the corresponding author by request
Footnotes
Sources of funding: LEC and ERA are funded by the UK National Institute for Health Research Health Protection Research Unit in Emerging and Zoonotic Infections, the Centre of Excellence in Infectious Diseases Research (Liverpool) and the Alder Hey Charity; and Wellcome/DFID through a grant for the development of COVID-19 diagnostics (grant number 220764/Z/20/Z). ALW receives support from the Partnership for Increasing the Impact of Vector Control (PIIVeC), funded by the UK Medical Research Council (grant number MR/P027873/1) through the UK Global Challenges Research Fund.
Conflicts of Interest: The authors have no conflict of interest to declare.
Authors contributions: The study was conceived by ALW, LEC and VSS and study design was developed by ALW, TE, VSS and LEC. Data extraction was conducted by ACA, ALW, KK, CTW, TE, VSS. Statistical analysis was conducted by VSS, LEC and TE. Initial manuscript was prepared by TE, LEC, ALW, and VSS. All authors approved the final manuscript.