Abstract
Objectives To investigate longitudinal trajectory of SARS-CoV-2 neutralising antibodies and the performance of serological assays in diagnosing prior infection and predicting serum neutralisation titres with time
Design Retrospective longitudinal analysis of a COVID19 case cohort.
Setting NHS outpatient clinics
Participants Individuals with RT-PCR diagnosed SARS-CoV-2 infection that did not require hospitalization
Main outcome measures The sensitivity with which prior infection was detected and quantitative antibody titres were assessed using four SARS-CoV-2 serologic assay platforms. Two platforms employed SARS-CoV-2 spike (S) based antigens and two employed nucleocapsid (N) based antigens. Serum neutralising antibody titres were measured using a validated pseudotyped virus SARS-CoV-2 neutralisation assay. The ability of the serological assays to predict neutralisation titres at various times after PCR diagnosis was assessed.
Results The three of the four serological assays had sensitivities of 95 to100% at 21-40 days post PCR-diagnosis, while a fourth assay had a lower sensitivity of 85%. The relative sensitivities of the assays changed with time and the sensitivity of one assay that had an initial sensitivity of >95% declined to 85% at 61-80 post PCR diagnosis, and to 71% at 81-100 days post diagnosis. Median antibody titres decreased in one serologic assay but were maintained over the observation period in other assays. The trajectories of median antibody titres measured in serologic assays over this time period were not dependent on whether the SARS-CoV-2 N or S proteins were used as antigen source. A broad range of SARS-CoV-2 neutralising titres were evident in individual sera, that decreased over time in the majority of participants; the median neutralisation titre in the cohort decreased by 45% over 4 weeks. Each of the serological assays gave quantitative measurements of antibody titres that correlated with SARS-CoV-2 neutralisation titres, but, the S-based serological assay measurements better predicted serum neutralisation potency. The strength of correlation between serologic assay results and neutralisation titres deteriorated with time and decreases in neutralisation titres in individual participants were not well predicted by changes in antibody titres measured using serologic assays.
Conclusions SARS-CoV-2 serologic assays differed in their comparative diagnostic performance over time. Different assays are more or less well suited for surveillance of populations for prior infection versus prediction of serum neutralisation potency. Continued monitoring of declining neutralisation titres during extended follow up should facilitate the establishment of appropriate serologic correlates of protection against SARS-CoV-2 reinfection.
Introduction
The emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causative agent of COVID-19, has resulted in a global pandemic with hundreds of thousands of deaths and millions of illnesses. Diagnosis of SARS-CoV-2 infection is principally dependent on RT-PCR using nasal and throat swabs, which is not ideally suited to mass population testing and as such has largely been targeted at symptomatic individuals in many settings. RT-PCR-diagnosed case numbers have therefore underestimated the prevalence of SARS-CoV-2 infection, and serology assays must be deployed to determine the true number of infections using a surveillance approach. Serology assays also have a critical role in screening volunteers for vaccine trials and convalescent plasma donations, as well as predicting infection or vaccine-induced immunity. Although several commercially available SARS-CoV-2 immunoassays are in common use, evaluation of their sensitivity has often used samples from hospitalised patients soon after infection. Knowledge of the long-term kinetics of antibody titres and the corresponding effectiveness of commercial assays is imperative if these testing protocols are to be accurately interpreted1-3.
Serology assays for SARS-CoV-2 primarily employ viral nucleocapsid (N) or the spike surface protein (S) antigens. Because S binds to target cells through its receptor binding domain (RBD) it is the target of neutralising antibodies. Therefore, S-based assays may be preferable to N-based assays for the assessment of the risk of future re-infection4. Of course, this premise is based on the assumptions (1) that neutralising antibodies constitute a major mechanism of protective immunity, and (2) that S-based serology assays accurately predict neutralising antibody activity.
Thus, major outstanding questions remain about the utilisation of serology that have implications for ongoing public heath testing strategies for SARS-CoV-2. These questions include (1) how circulating antibody levels that are specific for each viral antigen change with time following natural infection and (2) which serological assays best predict protective immunity. As yet, the prognostic value of antibody measurements in situations where individuals may be re-exposed to reinfection has yet to be demonstrated. Nevertheless, it is important to understand post infection serology as measured using high throughput assays to enable correlates of protection to be established. Here, we present the results of a longitudinal antibody testing study on a cohort of mildly symptomatic, non-hospitalised COVID19 positive patients during the first few months of convalescence. We compare the ability of four high-throughput automated assays to diagnose prior SARS-CoV2 infection and to predict neutralising activity in convalescent serum.
Methods
Participants
Participants with prior RT-PCR-diagnosed COVID-19 were recruited. Recruits were surveyed to determine the date of the positive PCR test, the date of onset of symptoms, and if their symptoms required hospitalisation. Serum samples were taken at a baseline visit (~3.5 to ~8.5 weeks post PCR test), and 2 weeks (visit 2), 4 weeks (visit 3) and 8 weeks later (visit 4). In total, 97 participants, who were not hospitalised during the course of their illness completed at least 3 visits. The mean age of the participants was 44.2 years (21 – 65 y), with 70 female (72% of cohort) participants. At visit 1 (baseline), the average number of days between PCR test and visit 1 (baseline) was 40.8 days (24 – 61 days); at visit 2 (2 weeks post-baseline), the average number of days post-PCR test was 55.1 days (40 – 79 days); at visit 3 (4 weeks post-baseline), the average number of days post-PCR test was 69.8 days (55 – 95 days); at visit 4 (8 weeks post-baseline), the average number of days post-PCR test was 98.4 days (85 – 110 days). Ethical approval was obtained for this study to be carried out through the NHS Lothian BioResource. All recruits gave written and informed consent for serial blood sample collection. Patients and Public were not involved in the design of this research.
High throughput automated serology assays
Four commercial assays, that employ either S or N protein antigens and are designed for high throughput in healthcare settings were used. All the assays generate a qualitative positive/negative result based on assay-dependent signal thresholds. The Abbott SARS-CoV-2 IgG assay detects anti-N IgG using a two-step chemiluminescent microparticle immunoassay (CMIA) method with an acridinium-labelled anti-human IgG. The DiaSorin SARS-CoV-2 IgG assay is also a two-step CMIA method targeting undisclosed epitopes in the SARS-CoV-2 S protein and employs an isoluminol conjugated anti-human IgG. The Roche Anti-SARS-CoV total antibody assay is a two-step bridging electrochemiluminesent immunoassay (ECLIA) using ruthenium-labelled and biotin conjugated N protein. The Siemens SARS-CoV-2 total antibody assay is a one-step bridging CLIA method that detects antibodies against the RBD of the S protein, using acridinium and biotinylated S1 RBD. Assays were performed on the Abbott Architect and Diasorin Liason platforms (NHS Lothian), and the Roche Elecsys (NHS Lanarkshire) and Siemens Atellica (NHS Tayside) platforms. Serum, collected and stored according to the manufacturer’s recommendations, was used in all cases.
SARS-CoV-2 Neutralisation assays
To measure neutralising antibody activity, serial dilutions of serum, beginning with a 1:12.5 dilution were five-fold serially diluted in 96-well plates over four dilutions. Thereafter, approximately 5×103 infectious units of an HIV/CCNG/nLuc/SARS-CoV-2 pseudotype virus were mixed with the serum dilutions at a 1:1 ratio and incubated for 1 hour at 37 degrees in a 96-well plate. The mixture was then added to 293T/ACE2cl.22 target cells5 plated at 1×104 cells/well in 100 ^l medium in 96-well plates the previous day. Thus, the final starting serum dilution was 1:50. Cells were cultured for 48h and harvested for NanoLuc luciferase assays, as previously described5.
Results
The cohort studied herein consists of participants who were not hospitalised during the course of their illness and were therefore relatively mildly symptomatic. Participants were invited to report on the occurrence and frequency of symptoms. Approximately 70% of people reported at least one of the 3 main WHO -identified symptoms, namely fever, cough and anosmia. The most common of symptom was anosmia and the majority of participants reported the presence of 2 of these 3 symptoms (Table 1). Serum samples were collected from 97 participants at ~ 4 weeks (visit 1), 6 weeks (visit 2) and 8 weeks (visit 3) post diagnosis (by RT-PCR). Additionally, serum was collected from a subset (28 of the 97 participants) at ~ 12 weeks post diagnosis (visit 4).
We compared the diagnostic sensitivity of 4 high throughput SARS-CoV-2 serology assays that are in routine use in hospital settings. Each assay gives a qualitative positive or negative result, based on assay specific thresholds and sensitivities were calculated for each assay using these thresholds. Inter and intra-assay analytical precision for each assay is detailed in Supplementary Table 1. To account for the differences in time post PCR diagnosis that participants made their first visit, sensitivity across a 20 day rolling time window was calculated. The Abbott, Roche and Siemens assays all had sensitivities of 95 to100% at 21-40 days post PCR-positive test, while the Diasorin assay had a lower sensitivity of 85% (fig 1A). However, the relative sensitivities of the assays changed with time. Specifically, the sensitivity of the Abbott assay declined to 85% in the 61-80 day window, and 71% at >81 days post diagnosis (fig 1A).
Conversely, the sensitivities of the other assays were maintained or increased over time (fig 1A). In terms of intra-individual change, 14/91 participants that were positive on the Abbott assay at visit 1 were negative by visit 3 or 4, whereas none of the participants with a positive result at visit 1 on the other assays became negative at visit 3 or 4. For the Diasorin assay, 2 participants that were negative at visit 1 were positive at visit 3 (both participants had an equivocal result at visit 1, and showed a small increase above the assay threshold at visit 3). In the Siemens assay, 3 participants were consistently negative, and in the Roche assay only a single participant was negative at each visit.
The serological assays give a quantitative assessment of antibody titre as well as a threshold-based positive/negative result. We next analysed changes in the quantitative results over time for each platform (fig 1 B, C). Mean antibody titres decreased in the Abbott assay at visits 2 and 3 compared to visit 1 (fig 1B) but increased in the Diasorin and particularly the Roche assays and remained approximately constant in the Siemens assay (Fig 1B). Notably, 79 out of 97 (81%) of participants showed a decrease in antibody titre on the Abbott platform, while 82/97 (85%) showed an increase on the Roche assay, despite the fact that both assays detect N-specific antibodies (fig 1 B, C). Negative or positive change was approximately equally likely in the S-based assays; specifically, 57% and 47% of intra-individual changes were negative for the Diasorin and Siemens assays respectively (fig 1 B, C).
We measured neutralising activity in serum samples from the first 3 visits for 80 of the 97 participants using a SARS-CoV-2 pseudotyped virus neutralisation assay. This assay employs HIV-1-based virions carrying a nanoluc luciferase reporter, pseudotyped with the SARS-CoV-2 spike protein. We have previously shown that neutralisation titres obtained using these pseudotyped particles correlate well with titres obtained using neutralisation of authentic SARS-CoV-25, Moreover, this assay has been successfully applied for analysis of convalescent plasma samples and in a campaign to identify potent human monoclonal antibodies6,7. Consistent with our analyses of other cohorts6,7, a broad range of neutralising titres were evident in sera collected from 80 participants at three timepoints (fig 2A). In samples collected at visit 1, the neutralising activity, as determined by half-maximal neutralising titre (NT50), ranged from <30 to 4300, with a geometric mean of 234 (arithmetic mean was 411) (fig 2A, red symbols). Consistent with other cohorts 6,7 34/80 (42%) had NT50 of less than 250 while only in 11/80 participants (14%) had NT50 values higher than 1000.
NT50 values measured at each timepoint for individual participants correlated with each other, although there was divergence in NT50 values over time (fig 2 A inset). Notably, neutralising activity decreased at each time point for the majority of participants (fig 2 A, blue and green symbols). Overall, the decrease in median NT50 was ~25% per two-week sampling interval, resulting in a ~45% reduction in NT50 over the 4 weeks between visit 1 and visit 3 (fig 2B). As a result, distribution of NT50 values the cohort differed between visits (fig 2C). The relative decline in NT50 between visits 1 and 2 versus visits 2 and 3 did not differ significantly, and the majority of participants exhibited a similar relative decrease in neutralising activity over time, regardless of their initial NT50 values or the number of days post PCR at visit 1, suggesting exponential decay (fig 2D).
NT50 values at each sampling timepoint were poorly correlated with age (Supplementary fig 1A), and no correlation was observed between age and NT50 decay with time. As has been previously reported, there was a trend toward lower NT50 values in females than in males6,7, but there was no difference between sexes in NT50 decay with time (Supplementary fig 1B). Individual clinical parameters such as GI symptoms, fever or recovery time, did not predict NT50, serological values or decay parameters for any antibody measurement.
Next, we compared neutralising activity in serum with quantitative results obtained from the serological assays. Analysis of combined results from the three visits by 80 participants revealed a significant correlation between any combination of two serological assays (Supplementary fig 2).However, stronger correlations were observed between the two S-based assays, Siemens and Diasorin (r=0.92, p<0.0001) and between the two N-based assays Abbott and Roche (r=0.81, p<0.0001), The S-based assays correlated less well, but significantly (p<0.0001), with the N-based assays (Supplementary fig 2).
All the serological assays gave quantitative values that correlated with NT50 measurements, but as expected, the S-based assay measurements correlated more closely with NT50 measurements (fig 3A-D). The S1/S2-based Diasorin assay, was the best predictor of NT50 (r=0.84, p<0.0001, fig 3A), followed by the RBD-based Siemens assay (r=0.74, p<0.001, fig 3B), the N-based Abbott assay (r=0.69, p<0.0001, fig 3C) and, lastly, the Roche assay (r=0.56, p=0.0001, Fig 3D).
The correlation between NT50 and the individual serological assays was best at the first visit and deteriorated to some extent thereafter (fig 3A-D, see color-coded r-values in individual graphs, p<0.0001 for all correlations), The decrease in the strength of correlation might, in part, be attributable to the fact that later sampling timepoints have more samples with lower NT50 values, which may reduce measurement precision. The magnitude of the deterioration in the predictive value differed between serological assays, with the S-based assays exhibiting larger decreases in correlation coefficients (r=0.89 and 0.83 at visit 1, versus r=0.83 and 0.71 at visit 3 for DiaSorin and Siemens assays respectively fig 3A-D), Despite the increasing disparity over time, the DiaSorin assay was clearly superior at predicting NT50 at all visits (fig 3A-D).
Interestingly, comparison of the extent of change in neutralisation activity over the 4-week observation interval with the concomitant change in values obtained using serological assays, revealed only minimal correlation (fig 3E-H, supplementary fig 3). Notably, in most participants, the decline in serum neutralising activity was clearly greater than the decline in antibody titre measured using any serological assay (fig 3E-H supplementary fig 3). Even for the Diasorin assay, which gave the best prediction of neutralising activity at each time point (fig 3A), declines in neutralising activity were not well predicted by declines in Diasorin assay measurements (fig 3E, supplementary fig 3). While both the Abbott assay and the NT50 measurements exhibited declining antibody titres with time, the magnitudes of these declines did not correlate with each other (fig 3G, supplementary fig 3).
Discussion
Serological assays for infectious agents have two major and distinct uses, namely (1) to diagnose chronic infections (e.g. HIV-1) and (2) to determine past infection or immunisation status (e.g. measles, VZV) which may be able to predict immunity from future infection. For example, HIV-1 and HCV serological tests are crucial for diagnosis but have no role in prediction of immunity. In contrast, for viruses such as VZV and measles, the main role of serology is to predict immunity induced by vaccination or prior infection. The use of serological assays to determine widespread seroprevalence is a relatively new application following the COVID19 pandemic, which is different from how these assays have traditionally been used clinically and requires understanding of how these assays perform in populations over time.
During the current SARS-CoV-2 pandemic it has become clear that the magnitude of serologic immune responses is highly variable6,7. Nevertheless, the vast majority of individuals with a PCR-confirmed SARS-CoV-2 infection generate antibodies at a sufficient level for diagnosis of recent infection8. A number of commercial assays have been deployed for high throughput SARS-CoV-2 antibody testing in a clinical setting, and evaluated mostly using hospitalized participants9,10. Non-hospitalised patients and those with mild disease typically have lower levels of antibodies than hospitalized patients with severe illness11-15, Using our cohort of non-hospitalized participants with mild disease, all four assays evaluated herein had sensitivities at visit 1 (an average of 40.8 days after PCR testing) that were comparable to the evaluations performed for these platforms using hospitalised patients16. This would therefore make all four assays suitable for the detection of COVID-19 antibodies shortly after infection as a confirmatory test for diagnostic purposes, when used in conjunction with RT-PCR assays and clinical history
However, differences in assay diagnostic sensitivity become apparent at later time points. Specifically, the sensitivity of the widely used Abbott assay declined with time, to ~70% at >81 days post PCR. Consequently, this assay is not appropriate for seroprevalence studies, for identification of SARS-CoV-2 naive vaccine trial participants, or for investigation of individuals presenting with long term chronic symptoms. Altering the positive/negative threshold, may mitigate this issue17, but would not ultimately alter the downward trend in assay signal over time. Notably our study is one of the few that would capture this information, as most other studies have examined seroconversion at early time points14,18-20. Reasons for the differences in assay performance over time are unclear but cannot be attributed solely to the choice of antigen. Although other studies have attributed a decline in sensitivity of an N based assay to an inherent difference in the dynamics of S versus N antibodies21 our findings do not support this contention, at least during the first ~100 days of convalescence. Both Abbott and Roche assays employ the N-proteins as an antigen, but Abbott assay titres decline while those in the Roche assay increase during this time period. One possible explanation for this difference is the use of an antigen bridging approach in the Roche assay, where declines in the total amount of antibody might be compensated by increases in affinity or avidity as antibodies mature through somatic hypermutation. Alternatively, it is possible that the range of N epitopes recognized by sera might change with time. Whatever the explanation, it is clear that that the trajectories of antibody titres measured using assays based on recognition of the same or related antigens can differ22-25.
Overall, given their superior sensitivity at each of the time points investigated thus far, the Siemens and Roche assays appear most appropriate for diagnosis of prior SARS-CoV-2 infection, at least within 4 months of SARS-CoV-2 infection, and would report a higher population prevalence than Abbott or DiaSorin assays in the 1 to 4 month post infection period.
While the Roche assay exhibited the best diagnostic sensitivity and is therefore well suited for serosurveillance during this time period, it had the lowest ability to predict neutralising antibody titres. This finding might be expected, as neutralising antibodies are directed to the S protein while N-specific antibodies are not expected to be neutralising. The Diasorin assay best predicted neutralising titres, and marginally outperformed the Siemens assay in this regard, perhaps because the dominant neutralising and/or S-binding activity in at least some sera is provided by antibodies that recognize epitopes outside the RBD26,27. It is important to recognize however, that many S-binding antibodies are not neutralising - measurements of S-binding antibodies remain correlates of, and not direct measures of, neutralising antibodies7.
Very recent reports have also indicated that neutralising antibody titres decline with time23 24, while another study reported that neutralisation titres remained stable for at least 3 months post infection28. However, in the latter case neutralisation titres were inferred based on a serologic ELISA measurement that was calibrated using a neutralisation assay performed on a small subset of samples. As shown herein, neutralising antibody levels indeed decline in most patients, even those who apparently maintain S-binding or RBD-binding antibody titres measured in serology assays. Thus, the trajectory of neutralising antibody levels cannot necessarily be deduced from serologic measurements of S-binding antibodies, even though S-binding antibodies and neutralising titres are broadly correlated.
Key questions in SARS-CoV-2 serology are the trajectory of of the antibody response and to what degree the titres of neutralising antibodies, or antibodies that simply bind to S or N correlate with protection from reinfection or severe disease. Serological studies based on hCoV infection have shown that many adults possess detectable circulating antibodies against OC43 and 229E29, and children seroconvert to NL63 and 229E before ~3.5 years of age30. These baseline levels increased upon infection, returning to baseline within one year. High levels of circulating neutralising antibody correlated with protection from re-infection with the same strain of virus31,32. However, hCoV re-infections occur32,33, with more mild illness and shorter duration of virus shedding. Thus, in the case of seasonal hCoV, these data suggest that immunity may wane over time. More limited data is available for SARS-CoV and MERS-CoV, although it suggests antibody responses also decline in the majority of infected individuals34.
If, as seems plausible, neutralising antibodies constitute a major protective mechanism against SARS-CoV-2 infection, then the use of serological assays that use S-based antigens and correlate best with NT50 measurements would appear most appropriate for prognostication of immunity. Conversely, if other mechanisms of immunity, such as long-lived memory T-cell responses play a dominant role in protection from infection or severe disease35-38, then the optimal choice of antigen for serology assays might differ. Because detailed analyses of T-cell responses are not currently feasible in a high throughput clinical setting, future work should examine the frequency of reinfection and clinical outcomes in cohorts with detailed longitudinal analyses of serum antibodies to both N and S antigens to determine the prognostic value of such measurements.
Data Availability
Raw de-identified data for serology measurements and NT50 measurements is available from the authors on request.
Contributor and guarantor information
Contributors: HW, SJ, TH and PDB conceived and designed the study. HW, BB, MS, ES, CR, JM, SC, EF, NG, GH, KT and SJ acquired and analysed data using the serological assay platforms. FM and JCCL performed the neutralisation assays. FM and TH did additional data analysis. HM, BB, SJ and PDB wrote the first draft of the manuscript. HM, BB, FM, TH, SJ and PB critically reviewed and revised the draft. All authors approved the final version of the manuscript for submission. HW, BB and MS contributed equally. SJ and PDB are the guarantors. The corresponding author attests that all listed authors meet authorship criteria and that no others meeting the criteria have been omitted.
Copyright statement
The Corresponding Author has the right to grant on behalf of all authors and does grant on behalf of all authors, an exclusive licence on a worldwide basis to the BMJ Publishing Group Ltd to permit this article (if accepted) to be published in BMJ editions and any other BMJPGL products and sublicences such use and exploit all subsidiary rights, as set out in our licence
Competing interest statement
All authors have completed the Unified Competing Interest form (available on request from the corresponding author) and declare: no support from any organisation or financial relationships with any organisations that might have an interest in the submitted work in the previous three years, no other relationships or activities that could appear to have influenced the submitted work.
Transparency declaration
Paul Bieniasz and Sara Jenks (Guarantors) affirm that the manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; there were no discrepancies from the study as planned.
Details of ethical approval
Ethical approval was obtained for this study to be carried out through the NHS Lothian BioResource. All recruits gave written and informed consent for serial blood sample collection. The Rockefeller University IRB reviewed and approved the study.
Funding
This work was supported by the NHS and Grants from the National Institutes of Allergy and infectious Diseases R37AI640003 (to PDB) and R01AI078788 (to TH). There were no study sponsors. The funders played no role in the design, analysis or reporting of this research.
Data sharing statement
Raw de-identified data for serology measurements and NT50 measurements is available from the authors on request. we plan to disseminate the results to study participants and or patient organisations
Checklist
A filled in checklist for appropriate study type is attached
Supplementary table
Supplementary Figures
Acknowledgements
We acknowledge the support and hard work of the NHS Lothian BioResouce team including in particular David Harrison for his support with initiating this study; Frances Rae and Craig Marshall for their advice and Linda MacLeod for her advice and hard work. NHS Lothian laboratories and out-patients teams have also worked extremely hard to enable the provision of this research study. In particular we would like to thank Joan Donnelly for her support and Susan Taylor for her dedication and hardwork.