Introduction

Hydroxychloroquine is a registered therapeutic against malaria and several autoimmune conditions, and has in vitro inhibitory effects on severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in nontoxic concentrations1. In a recent report, hydroxychloroquine given more than 2 weeks after first symptoms had no effect on the clearance of SARS-CoV-2 in various respiratory tract specimens of afebrile patients hospitalized with coronavirus 2019 (COVID-19) partly pretreated with antiviral drugs2. Viral clearance is however an incomplete characterization of viral kinetics, and the impact of hydroxychloroquine therapy on SARS-CoV-2 viral kinetics in subjects hospitalized with COVID-19 remains to be elucidated. Hydroxychloroquine is postulated to affect viral replication, and it is reasonable to assume that an effective antiviral drug will affect respiratory tract viral titers and thereby improve the symptoms and host inflammatory responses, including the cytokine and chemokine expression that is likely responsible for many of the clinical symptoms of COVID-193. Here we report findings from a two-arm, open label, pragmatic randomized controlled trial, the Norwegian Coronavirus Disease 2019 (NO COVID-19) Study (NCT04316377), that assessed the efficacy and safety of hydroxychloroquine therapy on SARS-CoV-2 oropharyngeal viral kinetics in patients hospitalized with moderately severe COVID-19. Treatment with hydroxychloroquine in addition to standard care did not result in a significantly greater rate of decline in SARS-CoV-2 oropharyngeal viral load compared to standard care alone during the first 5 days of hospitalization. Our findings suggest no important antiviral effect of hydroxychloroquine in humans infected with SARS-CoV-2.

Results

Patient characteristics

From March 25, 2020, through May 25, 2020, 27 patients were randomized to hydroxychloroquine sulfate in addition to standard care and 26 patients to standard care alone (Fig. 1). Details regarding baseline demographics, clinical variables on hospital admission and safety during the trial can be found in Table 1. Time from onset of symptoms to randomization was 8 (interquartile range [IQR] 7 to 12) days in our study. Median age was 62 (IQR 50 to 73) years, 35 patients (66.0%) were male and 2 patients (3.8%) were current smokers. Median body temperature was 38.2 (IQR 37.5 to 38.7) °C and 20 (37.7%) patients required supplemental oxygen on admission. We found no substantial differences in numbers and proportion of adverse events of special interest, serious adverse events or suspected unexpected serious adverse reactions between the hydroxychloroquine plus standard care group versus standard care alone.

Fig. 1: CONSORT diagram.
figure 1

Flow diagram of the NO COVID-19 Study according to CONsolidated Standards Of Reporting Trials (CONSORT).

Table 1 Demographics, baseline characteristics, and safety during the trial.

Primary outcome

Fifty-one participants were included in the intention-to-treat analysis; 117 samples of the 133 RT-qPCR-results were above the limit of detection (2.11 log10 RNA copies/mL). The rate of reduction in SARS-CoV-2 viral load was 0.24 (95% CI 0.03 to 0.46) log10 RNA copies/mL/24 h in the hydroxychloroquine group and 0.14 (95% CI −0.10 to 0.37) log10 RNA copies/mL/24 h in the standard care group (reduction rate difference between the groups 0.11 [95% CI −0.21 to 0.43] log10 RNA copies/mL/24 h; Fig. 2). Individual RT-qPCR results can be found in the trial open source repository (https://doi.org/10.17605/OSF.IO/U34R9).

Fig. 2: Oropharyngeal viral load (log10) in hydroxychloroquine plus standard care versus standard care in the intention-to-treat population (n = 51; full analysis set).
figure 2

One patient in the hydroxychloroquine plus standard care had missing baseline data for viral concentrations and one patient in standard care withdrew consent before viral load assessment at 48 h. Data are accordingly shown for 26 patients assigned to hydroxychloroquine plus standard care and 25 patients assigned to standard care. Estimated mean difference between groups was 0.27 (95% CI −0.92 to 1.47) log10 RNA copies/mL at randomization, 0.06 (95% CI −1.15 to 1.26) log10 RNA copies/mL at 48 h, and −0.16 (95% CI −1.67 to 1.36) log10 RNA copies/mL at 96 h. Plot displays estimated marginal means (dots) and 95% confidence intervals (error bars).

Secondary outcomes

One subject (3.7%) died in-hospital in the hydroxychloroquine plus standard care group and one subject (3.9%) died in-hospital in the standard care alone group. There were no further mortalities up to day 30. There was little difference in clinical status based on the 7-point scale at day 14 after randomization (cumulative odds ratio 1.11 [95% CI 0.31 to 4.01], Supplementary Table 1). The time from randomization to hospital discharge was similar between the two groups (p by log-rank test = 0.71; Supplementary Fig. 1). There was little difference in the change in NEWS2 from randomization to 96 h postrandomization (marginal mean change 0.47 [95% CI −0.58 to 1.53] points in the hydroxychloroquine group; 0.29 [95% CI −0.88 to 1.46] points in the standard care group; difference between groups 0.18 [95% CI −1.40 to 1.76] points; Supplementary Table 2 and Supplementary Fig. 2).

Discussion

In patients with moderately severe COVID-19 in need of hospital admission, treatment with hydroxychloroquine sulfate initiated median 8 days after first symptoms did not result in a significantly greater rate of decline of SARS-CoV-2 oropharyngeal viral load compared to standard care alone during the next 5 days. The study was stopped prematurely due to difficulties in recruiting patients and we cannot exclude a clinically important difference in viral kinetics between the two arms due to lack of power. There is currently no consensus on what a meaningful decrease in SARS CoV-2 viral load after antiviral treatment would be. However, for influenza treatment, a 2 log10 fall within 48 h has been proposed as a meaningful endpoint to reach for future influenza therapies3. Our confidence interval of the treatment effect lies well within this limit, making a clinically important difference between treatment arms less likely. We did not observe any major effect of treatment with hydroxychloroquine on short-term mortality, degree of illness, duration of hospital admission or clinical status.

Measures of rate of decline in virus replication as primary end points to evaluate antiviral drug efficacy are crucial3. Rapid reductions in active viral replication may be essential to prevent tissue damage and to further clinical recovery, as well as reduce risk of viral complications and mortality. By exploring viral load as a continuous outcome, a more sensitive statistical index compared to dichotomy4, the neutral result of hydroxychloroquine versus standard care in our study strongly suggests no major effect of hydroxychloroquine on the principal pathology in COVID-19. This model is supported by no apparent effect of hydroxychloroquine on SARS-CoV-2 negative conversion rate in Chinese patients with mild to moderate COVID-192. The patients in the aforementioned report were however younger, afebrile and partly pretreated with antiviral drugs. The duration from onset of symptoms was additionally notably longer compared to the current investigation, which with median 8 days from start of symptoms to start of therapy closely mimics the typical clinical course of COVID-19 characterized by clinical deterioration and need for hospitalization admission 1 week after illness onset5. Accordingly, our trial extends the results of previous investigations to more acutely ill and febrile patients in need of hospital admission. In light of the COVID-19 pandemic, there is an unmet need of pharmacological interventions aimed at reducing morbidity and mortality. Several medical therapies have been suggested; e.g., glucocorticoids, convalescent plasma, specific antibodies, lopinavir-ritonavir, remdesivir, and hydroxychloroquine. Treatment with the glucocorticoid dexamethasone has shown effect on mortality in patients with COVID-19, but predominantly in patients requiring supplemental oxygen or mechanical ventilation6. The use of convalescent plasma received early recognition as a viable treatment option in critically ill COVID-19 patients7, but results from randomized clinical trials are lacking. One of the most promising treatment modalities in the face of the COVID-19 pandemic is specific neutralizing antibodies, which would provide precision therapy overcoming the inherent lack of specificity provided by convalescent plasma8. Treatment with lopinavir–ritonavir has so far failed to demonstrate any benefit in COVID-19 beyond standard care9. The results for remdesivir appear more promising, but evidence is still conflicting10,11. Retrospective data examining the clinical effect of hydroxychloroquine in COVID-19 are also diverging12,13, and properly conducted and adequately powered randomized trials with peer-reviewed reports are accordingly still needed to assess the therapeutic value of hydroxychloroquine on clinical outcomes in patients with COVID-19.

Our current investigation has several limitations. The study was nonblinded without placebo treatment and we recognize that the lack of blinding may have influenced the standard care treatment and decision making by the treating physician, ultimately affecting our results. However, the study outcome was SARS-CoV-2 viral load, and study personnel performing the RT-qPCR and statistical analyses were blinded concerning group allocation. We assume that change in viral load in the upper respiratory tract is a valid measure for ongoing viral replication, but we did not perform analyses differentiating viable from nonviable virus. Oropharyngeal samples were obtained for the viral analyses, contrary to the common practice of nasopharyngeal sampling for the diagnosis of upper airway respiratory viruses. Nasopharyngeal sampling is however associated with significant discomfort for the patient, possibly to a degree leading to study discontinuation. A recent report by Wölfel et al.14 found no significant differences in viral load when comparing naso- and oropharyngeal sampling for SARS-CoV-2. Wang et al.10 found comparable viral loads in upper and lower respiratory tract samples, suggesting that SARS-CoV-2 viral kinetics can be studied in the upper respiratory tract. Stringent scientific support for this assumption is however still lacking. Electrocardiograms were not routinely taken during trial conduction, barring us from assessing the effect of hydroxychloroquine therapy on corrected QT interval. Finally, due to early study cessation, sample size was less than planned with resulting lower study power. Based on the exact effect estimates and standard deviations observed in this report, a future clinical trial would require a sample size of at least 928 subjects (allocation ratio 1:1 with 464 in each arm) to detect a significant difference between groups with a two-sided α = 0.05 and β = 0.80. Sample size calculations under the assumption of increasing effect sizes of the intervention effect, and constant standard deviations from the current report are listed in Supplementary Table 3 and Supplementary Fig. 3.

In conclusion, therapy with hydroxychloroquine did not impact SARS-CoV-2 viral kinetics in patients admitted to hospital with moderately severe COVID-19. Our results suggest no important antiviral effect of hydroxychloroquine in humans infected with SARS-CoV-2.

Methods

Trial design

The NO COVID-19 Study is a single center, two-arm, open label, group-sequential, pragmatic randomized controlled trial of hydroxychloroquine sulfate in adults hospitalized with COVID-19. Patients were randomly assigned to receive hydroxychloroquine sulfate (at a dose of 400 mg twice daily for 7 days) in addition to standard care or standard care alone15. Standard care was similar for all patients included in the study, and encompassed appropriate level and intensity of medical treatment according to local and national guidelines. No stratification was used for the computer randomization procedure. Because of rapidly decreasing incidence of COVID-19 in Norway, the trial was prematurely stopped by the trial sponsor on May 25, 2020. The study protocol was approved by the Norwegian Regional Committees for Medical Research Ethics (REC no. 121446) and the Norwegian Medicines Agency. The study was performed according to standard rules for Good Clinical Practice, with statistical methods and stopping rules described in the protocol and statistical analysis plan, with detailed descriptions in the trial open source repository (https://doi.org/10.17605/OSF.IO/U34R9).

Patients

All reverse transcriptase polymerase chain reaction (RT-qPCR) SARS-CoV-2 positive patients 18 years of age or older were eligible for study inclusion, if they had moderately severe COVID-19 at admission (NEWS216 of 6 or less). For patients who tested positive for SARS-CoV-2 before admission, SARS-CoV-2 status was verified with the external laboratory. Exclusion criteria included (1) the need of admission to intensive care unit on hospital admission, (2) history of psoriasis, (3) reduced hearing/tinnitus, (4) visual impairment, (5) known adverse reaction to hydroxychloroquine sulfate, (6) pregnancy, or (7) prolonged corrected QT interval (>450 ms). All study participants provided written informed consent before study inclusion.

Data on coexisting conditions were acquired from the hospital electronic patient records. Coronary artery disease was defined as history of myocardial infarction, coronary artery bypass grafting, or percutaneous coronary intervention. Diabetes was defined as history of diabetes mellitus type 1 or type 2, and the use of antidiabetic medication. Hypertension was defined as history of hypertension and the use of antihypertensive medication. Obstructive pulmonary disease was defined as history of chronic obstructive pulmonary disease or asthma. Obesity was defined as body mass index of 30 kg/m2 or above. Current smoking was defined as daily consumption of cigarettes.

Primary outcome

The primary outcome was rate of decline in SARS-CoV-2 viral load in the oropharynx from baseline through the first 96 h after randomization, using a single batch of swabs and a standardized sampling procedure to saturate them. Oropharyngeal swab samples were taken from patients at inclusion, at 48 h and at 96 h, by a selected group of study physicians. For analysis, total nucleic acids were extracted from 300 µL of each specimen using the Maxwell® RSC Viral total Nucleic Acid Purification Kit (Promega, Madison, Wisconsin, USA) according to the manufacturer’s instructions and eluted in 50 µL nuclease-free water. SARS-CoV-2 detection was performed in duplicate by RT-qPCR on 5 µL nucleic acid eluate in a total reaction volume of 25 µL on a QuantStudio™ 7 Flex Real-Time PCR System (Thermofisher Scientific, Waltham, Massachusetts, USA), according to the protocol published in January 2020 by Corman et al.17 that targets the viral E-gene of sarbecoviruses. For each patient, all samples in the time series were analyzed in the same extraction and PCR set-up. Single batches of all reagents for extraction and PCR were used for all samples in the study. SARS-CoV-2 RNA quantitation was calculated using a serial dilution of a the synthetic Wuhan coronavirus 2019 E gene RNA control comprising the viral region to be amplified, provided by the European Virus Archive Global (EVAg). Viral loads are expressed in log10 RNA copies/mL transport medium. The limits of detection (LoD) and quantitation (LoQ) of the assay are of 2.11 and 2.55 log10 RNA copies/mL, respectively. For data analyses, results below LoD (SARS-CoV-2 RNA not detected) were set to 0 log10 RNA copies/mL, and results below LoQ were set to the mean between LoD and LoQ values (i.e., 2.36 log10 RNA copies/mL). A qPCR assay targeting human β-globin was performed on all samples, where no viral RNA was detected for assessment of sample adequacy18. For the intention-to-treat population (n = 51), all actual samples were positive for human β-globin analysis, indicating adequate sample quality. Further details regarding study sampling and analysis can be found in the Supplementary Methods.

Secondary outcomes

The secondary outcomes include in-hospital mortality, mortality at 30 days, clinical status on a 7-point ordinal scale (1. dead, 2. hospitalized, on invasive mechanical ventilation or extracorporeal membrane oxygenation, 3. hospitalized, on noninvasive ventilation or high flow oxygen devices, 4. hospitalized, requiring supplemental oxygen, 5. hospitalized, not requiring supplemental oxygen, 6. not hospitalized, but unable to resume normal activities, 7. not hospitalized, with resumption of normal activities) at 14 days after randomization, duration of hospital admission after randomization and change in degree of illness as quantified by NEWS216 from randomization to 96 h. Further details on the secondary outcomes can be found in the study protocol in the trial open source repository (https://doi.org/10.17605/OSF.IO/U34R9).

Statistical analyses

The analysis of the primary outcome was pre-specified and detailed in the statistical analysis plan (https://doi.org/10.17605/OSF.IO/U34R9). The primary outcome was analyzed using a generalized linear mixed model, with subject-specific random intercept and slope in the full analysis set (FAS, all randomized subjects who have had at least one baseline and one postrandomization evaluation of efficacy). The analysis of the secondary endpoints was based on the prespecified description of analysis of secondary continuous, categorical and time-to-event endpoints in the study protocol (version 1.3, dated 26.03.2020). The analysis of the ordinal endpoint was not specified in the protocol and was decided post hoc before secondary endpoint analysis. The secondary endpoints were all analyzed in the FAS and based on available data. Because of the low number of deaths (one in each arm) in-hospital and at 30 days after randomization, we used descriptive statistics only for these endpoints. We used descriptive statistics to report the number of participants in each level of the 7-point clinical status scale at day 14 after randomization. We performed an ordinal logistic regression with the 7-point scale as dependent and group allocation as independent variable to compare the two study arms. We used the Kaplan–Meier method to calculate the time from randomization to hospital discharge in each study arm. Deceased participants were censored at the time of death. We used a log-rank test for equality of survivor functions to compare time from randomization to hospital discharge in the two groups. NEWS2 was assessed at several time points each day during in-hospital stay. We calculated the daily mean NEWS2 for each participant from randomization to 96 h postrandomization. We analyzed change in NEWS2 from baseline to 96 h postrandomization using a linear mixed model with fixed treatment by time and random intercept. The complete statistical analysis plan can be found in the trial open source repository (https://doi.org/10.17605/OSF.IO/U34R9). The statistical analyses were performed with STATA 16 (StataCorp LP, College Station, TX).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.