Abstract
Infection with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) induces a complex antibody response that varies by orders of magnitude between individuals and over time. Waning antibody levels lead to reduced sensitivity of serological diagnostic tests over time. This undermines the utility of serological surveillance as the SARS-CoV-2 pandemic progresses into its second year. Here we develop a multiplex serological test for measuring antibodies of three isotypes (IgG, IgM, IgA) to five SARS-CoV-2 antigens (Spike (S), receptor binding domain (RBD), Nucleocapsid (N), Spike subunit 2, Membrane-Envelope fusion) and the Spike proteins of four seasonal coronaviruses. We measure antibody responses in several cohorts of French and Irish hospitalized patients and healthcare workers followed for up to eleven months after symptom onset. The data are analysed with a mathematical model of antibody kinetics to quantify the duration of antibody responses accounting for inter-individual variation. One year after symptoms, we estimate that 36% (95% range: 11%, 94%) of anti-S IgG remains, 31% (9%, 89%) anti-RBD IgG remains, and 7% (1%, 31%) anti-N IgG remains. Antibodies of the IgM isotype waned more rapidly, with 9% (2%, 32%) anti-RBD IgM remaining after one year. Antibodies of the IgA isotype also waned rapidly, with 10% (3%, 38%) anti-RBD IgA remaining after one year. Quantitative measurements of antibody responses were used to train machine learning algorithms for classification of previous infection and estimation of time since infection. The resulting diagnostic test classified previous infections with 99% specificity and 98% (95% confidence interval: 94%, 99%) sensitivity, with no evidence for declining sensitivity over the time scale considered. The diagnostic test also provided accurate classification of time since infection into intervals of 0 – 3 months, 3 – 6 months, and 6 – 12 months. Finally, we present a computational method for serological reconstruction of past SARS-CoV-2 transmission using the data from this test when applied to samples from a single cross-sectional sero-prevalence survey.
Introduction
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), causing coronavirus disease 2019 (COVID-19), has led to widespread morbidity and mortality since its emergence. The response to the SARS-CoV-2 pandemic is critically dependent on surveillance data, most notably numbers of COVID-19 associated hospital admissions and deaths recorded through health systems surveillance, as well as numbers of cases confirmed SARS-CoV-2 positive by PCR-based testing1. Other tools are providing crucial complementary information, for example genomic surveillance has been key to tracking the emergence of novel SARS-CoV-2 variants2. Serology, based on the detection of antibodies induced by previous infection with SARS-CoV-2, represents another category of surveillance information3,4. Appropriately designed sero-prevalence studies can provide estimates of the proportion of a population who have been previously infected. Although no substitute for health systems surveillance, sero-prevalence studies have the advantage of accounting for asymptomatic cases, and symptomatic individuals who do not present to health systems. Sero-prevalence studies also provide information on the status of SARS-CoV-2 epidemics in situations where record keeping by health systems is not possible5.
Infection with SARS-CoV-2 induces diverse humoral and cellular immune responses6. Humoral immunity includes antibodies of several immunoglobulin isotypes targeting SARS-CoV-2 proteins, most notably Spike (S) and Nucleocapsid (N). The concentration of antibodies in blood varies substantially between individuals, and with time since infection6,7,8,9,10,11. Studies of the duration of immunity to a range of coronaviruses demonstrated that antibodies remain detectable six years after infection, but continue to wane12. Longitudinal follow-up of individuals infected with SARS-CoV-2 indicates a pattern of waning of antibody responses consistent with other coronaviruses13,14. Within the first three months, antibody levels boost sharply and wane rapidly. Over a longer interval of eight months, antibody levels wane more slowly6. These observations can be explained by the bi-phasic nature of antibody kinetics15. In the first three months, antibodies are predominantly generated by short-lived plasma cells in secondary lymphoid organs. The long-term response is dominated by antibodies from long-lived plasma cells in the bone marrow. This pattern of bi-phasic waning is observed for infection- and vaccine-induced antibody responses to a wide range of pathogens16,17,18,19.
Initial concerns that waning antibody responses would lead to reduced sensitivity of SARS-CoV-2 serological diagnostics over time have unfortunately been confirmed20. Using self-administered lateral flow diagnostic tests, a sero-prevalence study of 365,000 adults after the first epidemic wave in the UK observed significantly declining sero-prevalence during July to September 20204. Using a laboratory-based assay for measuring anti-N IgG antibodies, a sero-prevalence survey of blood donors from the Brazilian city of Manaus estimated that 26% of the population had previously been infected21. Application of a statistical adjustment to account for antibody waning led to an increased estimate of 76% previously infected. As the pandemic progresses, this problem of declining sensitivity of serological diagnostics is likely to get worse, potentially undermining the utility of sero-prevalence studies.
Serological diagnostics typically classify a sample as positive if a measured antibody level is greater than a defined cutoff. Analysis of quantitative rather than binary antibody levels provides additional information, for example antibody levels are associated with time since infection, symptom severity, and sex22. However, large variation in antibody levels between individuals prevents this from having predictive value: detected antibodies could be from a recent infection, or due to immunological memory of an infection that cleared a year ago. This limitation has recently been overcome for a range of pathogens through the combination of multiplex assays and classification algorithms. Using machine learning algorithms to analyse quantitative antibody responses to multiple antigens, the time since previous infection can be estimated for Plasmodium falciparum malaria23,24, P. vivax malaria25, and cholera26.
In this study, we use multiplex assays to measure antibodies to SARS-CoV-2 in health care workers and hospitalized patients followed for up to eleven months after infection, and apply mathematical models to characterize antibody kinetics in the first year following infection. Classification algorithms were developed that minimize the reduction in the sensitivity of serological tests over time, in addition to estimating time since previous infection from a single blood sample. Finally, we present a method for serological reconstruction of past SARS-CoV-2 transmission using samples from a single cross-sectional survey.
Methods
Samples
A panel of 407 negative control serum or plasma samples was assembled from pre-pandemic cohorts (before December, 2019) with ethical approval for broad antibody testing (Table 1). This included 258 serum samples from healthy adult French blood donors, 81 serum samples from Peruvian healthy adult donors, and 69 plasma samples from healthy adult donors from the Thai Red Cross. A panel of 407 positive control serum samples was assembled from individuals with recent SARS-CoV-2 infection. This included 72 samples from patients in Paris hospitals27,28, 161 samples from Strasbourg healthcare workers, all confirmed positive by PCR-based testing29. Also included were 174 samples from community members of Crépy-en-Valois, France, confirmed seropositive by flow cytometry based testing30,31.
The duration of antibody responses following SARS-CoV-2 infection was studied in longitudinal cohorts of hospitalized patients and healthcare workers. 213 serum samples from 194 patients in Dublin hospitals were collected, with date post symptom onset extending to four months. 724 serum samples from 347 healthcare workers in Strasbourg hospitals were collected, with date post symptom onset extending to nine months32.
In April 2020, our team implemented a study of the sero-prevalence of SARS-CoV-2 in healthcare workers from Institut Mutualiste Montsouris, a hospital in Paris. Serum samples were collected from 769 healthcare workers, and tested with our multiplex assay (Appendix Figure 1). Healthcare workers who tested sero-positive in April 2020 were invited to present a second sample in January 2021. In total we obtained follow-up samples from 29 healthcare workers.
Serological assays
We set up a 9-plex bead-based assay allowing simultaneous detection of antibody responses to five SARS-CoV-2 antigens and four seasonal coronaviruses (Spike proteins of NL63, 229E, HKU1, OC43) in 1µL serum or plasma samples that were heat-inactivated at 56°C for 30 minutes. SARS-CoV-2 antigens were from Spike (whole trimeric Spike (S), its Receptor Binding Domain (RBD) and Spike subunit 2 (S2)), nucleocapsid protein (N) and a Membrane-Envelope fusion protein (ME). ME and S2 antigens were purchased from Native Antigen, Oxford, UK and all other antigens were produced as recombinant proteins at Institut Pasteur. The mass of proteins coupled on beads was optimized to generate a log-linear standard curve with a pool of 27 positive sera prepared from RT-qPCR-confirmed SARS-CoV-2 patients. We measured the levels of IgG, IgA and IgM of each sample in three separate assays. Briefly, serum was incubated with mixed antigen-coupled beads for 30 minutes at a 1/200 final dilution for IgG or 1/400 for IgA and IgM. Secondary antibodies conjugated to R-phycoerythrin (Jackson Immunoresearch) were used at 1/120, 1/200 or 1/400 for detection of specific IgG, IgA and IgM respectively. All dilutions and cycles of washing steps were done in phosphate buffer saline supplemented with 1% bovine serum albumin and 0.05% (v/v) Tween-20. On each assay plate, two blanks (only beads, no serum) were included to control for background signal as well as a standard curve prepared from two-fold serial dilutions (1/50 to 1/102,400) of a pool of positive controls. Plates were read using a Luminex® MAGPIX® system and the median fluorescence intensity (MFI) was used for analysis. A 5-parameter logistic curve was used to convert MFI to relative antibody unit (RAU), relative to the standard curve performed on the same plate to account for inter-assay variations.
The data from our multiplex assay was compared against data from two different neutralization assays with live virus using a subset of serum samples. Firstly, we implemented an S-Fuse assay as described elsewhere32. Neutralization activity was measured as the reciprocal dilution required to obtain a 50% reduction in neutralization (IC50). Secondly, we implemented a foci reduction neutralization test (FRNT) based on the detection of neutralizing antibodies directed against SARS-CoV-2. This assay was performed under BSL-3 conditions as it facilitates infection of African green monkey kidney cells (VeroE6; ATCC CRL-1586) with live-virus of a Cambodian SARS-CoV-2 isolate (GISAID: EPI_ISL_411902). Infection is visualized 14-16h after inoculation by staining of infected cells with a SARS-CoV-2 specific antibody (# ABIN1030641), targeting the S2 subunit of the viral spike protein. Neutralizing antibody titers are expressed as the reciprocal serum/plasma dilution that induces 50% reduction of infection (FRNT50) and is calculated by log probit regression analysis.
Mathematical model of antibody kinetics
SARS-CoV-2 antibody kinetics are described using a previously published mathematical model of the immunological processes underlying the generation and waning of antibody responses following infection33. β denotes the boost in antibody secreting plasma cells. It is assumed that a proportion ρ of plasma cells are short-lived (Ps) waning at rate cs, and a proportion 1 – ρ are long-lived (Pl) waning at rate cl. Plasma cells generate antibodies (IgG, IgM or IgA) at rate g, and r is the rate of decay of antibody molecules.
Statistical inference was implemented within a mixed-effects framework allowing for characterisation of the kinetics within all individuals while also describing the population-level patterns (Statistical Appendix). On the population level, both the mean and variation in antibody kinetics are accounted for. Models were fitted in a Bayesian framework using Markov chain Monte Carlo methods with priors informed by estimates from long-term studies of antibody kinetics following infection with other coronaviruses20.
Classification algorithms
Measurements of antibodies of three isotypes (IgG, IgM, IgA) to multiple SARS-CoV-2 antigens were used to create a training dataset. Samples where the time post symptom onset was ≤ 14 days or unknown were excluded. In total we had 407 samples from pre-pandemic negative controls and 1402 positive samples. In a first step, a Random Forests binary classification algorithm was developed. A threshold corresponding to 99% specificity was selected. In a second step, a Random Forests multiway classification algorithm was developed for categorizing samples into four classes: (i) negative; (ii) infected ≤ 3 months ago; (iii) infected 3 – 6 months ago; and (iv) infected 6 – 12 months ago. The algorithm was calibrated to have 99% specificity for correctly classifying negative samples. Positive samples were then classified according to the maximum number of votes. Uncertainty in classification performance was assessed via 1000-fold cross-validation with a training set comprising two thirds of the data and a disjoint testing set comprising a third of the data. Classification algorithms were implemented in R (version 3.4.3).
Statistical methods for serological surveillance
Imperfect diagnostic sensitivity causes a downwards bias in sero-prevalence estimates, whereas imperfect specificity causes an upwards bias in seroprevalence estimates. This bias can be corrected for using a Rogan-Gladen estimator, with incorporation of cross-validated uncertainty34. Multiway classification algorithms provide estimates of the proportion of a population infected during different time intervals. These estimates are subject to biases which can be adjusted for using a multivariate extension of the Rogan-Gladen estimator described in the Statistical Appendix.
Ethics
Serum samples were biobanked at the Clinical Investigation and Access to BioResources platform at Institut Pasteur (Paris, France). Samples were obtained from consenting individuals through the CORSER study (NCT04325646), directed by Institut Pasteur and approved by the Comité de Protection des Personnes Ile de France III, and the French COVID cohort (NCT04262921), sponsored by Inserm and approved by the Comité de Protection des Personnes Ile de France VI. Samples from French blood donors were approved for use by Etablissement Français du Sang (Lille, France) and approved through the CORSER study by the Comité de Protection des Personnes Ile de France VI. Sample collection in Hôpital Cochin was approved by the Research Ethics Commission of Necker-Cochin Hospital. Samples from healthcare workers in Strasbourg University Hospitals followed longitudinally were collected as part of an ongoing clinical trial (ClinicalTrials.gov Identifier: NCT04441684) which received ethical approval from the Comité de Protection des Personnes Ile de France III. Samples collected from patients in Dublin received ethical approval for study from the Tallaght University Hospital (TUH)/St James’s Hospital (SJH) Joint Research Ethics Committee (reference REC 2020–03). Use of the Peruvian negative controls was approved by the Institutional Ethics Committee from the Universidad Peruana Cayetano Heredia (SIDISI 100873). The Human Research Ethics Committee at the Walter and Eliza Hall Institute of Medical Research and the Ethics Committee of the Faculty of Tropical Medicine, Mahidol University, Thailand, approved the use of the Thai negative control samples. Informed written consent was obtained from all participants or their next of kin in accordance with the Declaration of Helsinki.
Results
SARS-CoV-2 antibodies over time
Antibody responses of three isotypes (IgG, IgM, IgA) to nine coronavirus antigens were measured in 407 pre-pandemic serum samples, and 1402 serum samples from individuals with previous SARS-CoV-2 infection. 961/1402 of the positive samples were from individuals with SARS-CoV-2 infection confirmed by PCR-based testing with available data on time post symptoms. Figure 1 presents the IgG, IgM and IgA antibody responses to SARS-CoV-2 S, RBD, and N measured over time. Appendix Figures 2 and 3 presents the antibody responses to S2, ME, and the Spike proteins of the four human seasonal coronaviruses. Notably there is substantial inter-individual variation in antibody responses, with antibody levels varying by orders of magnitude between individuals. As a measure of functional immunity, we applied live virus neutralizing assays to a subset of samples. Substantial variation in neutralizing activity between individuals and over time (Appendix Figure 4).
Modelled SARS-CoV-2 antibody kinetics
Our data on measured SARS-CoV-2 antibody responses was supplemented with data from six other longitudinal studies of the SARS-CoV-2 antibody response, and one longitudinal study of the SARS-CoV-1 antibody response. Appendix Figure 5 provides a comparison of the measured antibody responses between the eight studies. The data have been rescaled so that the average antibody response for each study equals one at day 14 after symptom onset. In addition to the large inter-individual variation, there is notable variation in antibody levels between studies. For example, the large dynamic range in the study by Iyer et al8, reveals the boosting and subsequent decay of anti-S IgG antibodies in the first two months post infection, a phenomenon that is missed by assays without a comparably high upper limit of detection.
A mathematical model of antibody kinetics was simultaneously fitted to data from all eight studies. Appendix Figure 6 provides an overview of the fit of the model to the data. Figure 2a provides example fits to data from one individual from each of the eight studies. Figure 2b provides model predictions of the percentage antibody level remaining over the first two years post infection, where 100% is assumed to be the antibody response at day 14 following symptom onset. There are considerable differences in the pattern of waning between isotypes and between antigens. Table 2 summarises the duration of the antibody response. One year following symptom onset, we estimate that 36% (95% range: 11%, 94%) anti-S IgG remains, 31% (9%, 89%) anti-RBD IgG remains, and 7% (1%, 31%) anti-N IgG remains. The uncertainty represents the considerable degree of inter-individual variation in the duration of the antibody responses. Antibodies of the IgM isotype waned more rapidly, with 6% (0%, 27%) anti-S IgM remaining after one year, 9% (2%, 32%) anti-RBD IgM remaining after one year, and 15% (4%, 50%) anti-N IgM remaining after one year. Antibodies of the IgA isotype also waned rapidly, with 18% (4%, 67%) anti-S IgA remaining after one year, 10% (3%, 38%) anti-RBD IgA remaining after one year, and 3% (0%, 13%) anti-N IgA remaining after one year. We also observed comparable reductions in titres for viral neutralization over time (Appendix Figure 4), although the small number of samples prevented application of our model.
Estimating time since previous SARS-CoV-2 infection
Using a dataset on measured IgG, IgM and IgA antibody levels from pre-pandemic negative controls and samples from individuals with SARS-CoV-2 infection confirmed by PCR-based testing and followed for up to eleven months after symptom onset, Random Forests binary classification algorithms were trained to identify individuals with previous SARS-CoV-2 infection (Appendix Figures 7 and 8). Next, a Random Forests multi-way classification was trained to simultaneously identify previous infection and estimate the time since infection (Figure 3). The diagnostic test identified samples from individuals with previous SARS-CoV-2 infection with 99% specificity and 98% (95% confidence interval: 94%, 99%) sensitivity (Figure 3a). The diagnostic test classified samples from individuals infected within the previous 3 months (Figure 3b). Notably, it was easier to distinguish recent infections (<3 months) from older infections (6 – 12 months), compared to infections that occurred 3 – 6 months ago. There was limited statistical signal to distinguish between infections that occurred 3 – 6 months ago, and older infections occurring more than 6 months ago (Figure 3c). A breakdown of classification performance by time since infection is provided in Figure 3D. The diagnostic test accurately classifies samples of all categories, with the exception of samples from individuals infected 3 – 6 months ago. Many of these samples were incorrectly classified in the neighbouring infection categories of 0 – 3 months or 6 – 12 months.
Serological reconstruction of COVID-19 epidemics
When applied to samples from a cross-sectional sero-prevalence survey, a diagnostic test capable of estimating time since infection can provide a serological reconstruction of past SARS-CoV-2 epidemics. Figure 4 presents four simulated epidemics where 1,000 individuals were sampled, and 40% were previously infected. A range of scenarios were simulated, including a recent epidemic wave, an older epidemic wave, a double wave epidemic, and constant transmission. The algorithm was able to accurately reconstruct all epidemic profiles. For example, in the recent wave scenario 60% of the population were negative and the model estimated 59.8% (95% CI: 59.3%, 61.4%); 35.8% were infected 0 – 3 months ago and the model estimated 35.5% (95% CI: 25.7%, 40.3%); 4.2% were infected 3 – 6 months ago and the model estimated 4.2 % (95% CI: 0.0%, 14.4%); and 0% were infected 6 – 12 months ago and the model estimated 0% (95% CI: 0.0%, 3.3%).
Discussion
Infection with SARS-CoV-2 induces a complex, diverse immune response, that varies by orders of magnitude between individuals, and changes over time. This diversity is a challenge to immunologists and vaccinologists, but presents an opportunity to diagnostic developers armed with multiplex assays and machine learning algorithms. By quantifying the kinetics of different components of the humoral immune response, it is possible to provide classification of previous infection that minimizes the reduction of diagnostic sensitivity over time, and also allows estimation of the time since infection.
Based on data from a range of studies with up to eleven months follow up after symptom onset, we estimate that 31% (95% CrI: 9%, 89%) of anti-RBD IgG antibody levels remain one year after infection. Antibodies of other isotypes waned more rapidly, with 9% (95% CrI: 2%, 32%) of anti-RBD IgM antibody levels remaining after one year, and 10% (95% CrI: 3%, 38%) of anti-RBD IgA antibody levels remaining after one year. There was considerable variation in kinetic profiles between different SARS-CoV-2 antigens, with 7% (95% CrI: 1%, 31%) of anti-N IgG antibody levels remaining after one year. Although the determinants of the duration of antigen-specific antibody responses remain poorly understood22, the diversity in patterns of antibody kinetics can be quantified using epidemiological studies, yielding valuable information for serological diagnostics.
The majority of commercially available serological tests classify individuals as having previous SARS-CoV-2 infection if a measured antibody response is greater than a defined cutoff35. Instead of reducing a complex antibody response to a binary data point, a mode detailed serological signature based on quantitative measurements of multiple antibody responses provides two notable advantages20. Firstly, the reduction in diagnostic sensitivity associated with waning antibodies is minimized – no reduction in diagnostic sensitivity over the eleven months of follow up was observed. Secondly, the time since previous infection can be estimated providing valuable additional epidemiological information. For the current assay, previous infections were categorised into intervals of 0 – 3 months, 3 – 6 months, and 6 – 12 months. More precise classification is possible in theory, but this must be balanced against the statistical signal. For example, there was little statistical signal to discriminate between infections that occurred 6 – 9 months ago, versus infections that occurred 9 – 12 months ago. It is anticipated that further improvements to the assay such as the incorporation of new antigens, more training samples with a range of time since infection, and better algorithms will lead to improvements in accuracy.
Existing sero-prevalence studies estimate the proportion of a population previously infected with SARS-CoV-2. The addition of a diagnostic tool capable of estimating time since infection allows for the serological reconstruction of past transmission trends. Thus, using samples from a sero-prevalence study collected at a single time point, we can discriminate between a scenario of constant SARS-CoV-2 transmission and a scenario where transmission occurs in distinct epidemic waves. This diagnostic technology has a range of possible applications. For countries that experienced double wave SARS-CoV-2 epidemics, it has been challenging to quantify the relative magnitude of these waves due to the time required to scale up PCR-based diagnostic testing36. Furthermore, there are many epidemiological settings where it is unknown if SARS-CoV-2 transmission was constant over time, or occurred as distinct epidemic waves5.
The widespread introduction of SARS-CoV-2 vaccines will lead to vaccine-induced immunity in many individuals. The majority of vaccines against SARS-CoV-2 target the Spike protein’s RBD. As our diagnostic assay and algorithms include Spike and RBD antigens, serum samples from vaccinated individuals would likely be classified as positive. However, by measuring antibody responses to Nucleocapsid, Membrane, Envelope, and other viral proteins37 it will be possible to modify the diagnostic test to provide three-way classification: negative; previously infected; or vaccinated. Identification of individuals who have been both infected and vaccinated will be challenging. Having classified an individual as previously infected or vaccinated, the diagnostic algorithm can then be used to estimate time since infection or time since vaccination. Such an approach could contribute to efforts to measure population-level immunity to SARS-CoV-2, whether induced by infection or vaccination.
There are several limitations to this study. Estimates of the duration of antibody responses are based on data from multiple studies, each using a unique immunoassay6,7,8,9,10,11,12. Every immunoassay may differ in terms of background reactivity, cross-reactivity with other pathogens, protein formulation, dynamic range and reproducibility. We believe that the benefit of drawing on multiple data sources outweighs the benefit of having a smaller more homogeneous database, especially since the mathematical model of antibody kinetics is sufficiently flexible to incorporate data from multiple assays. Our selection of a mechanistic mathematical model of antibody kinetics is a potential limitation. The model is based on a mechanistic understanding of the immunological processes underlying the generation and persistence of antibodies, and imposes a flexible functional form on how antibody levels change over time. Although this approach has been validated in a range of applications16,17,18,19, there will be instances where the model fails to capture the true pattern of antibody kinetics, for example in immune-deficient individuals. An advantage of a mechanistic model versus a non-parametric statistical model is the ability to make projections forward in time. We have provided predictions up to two years following infection, for example by estimating that 16% (5%, 48%) of anti-RBD IgG antibodies remain after two years. There is a risk to providing predictions beyond the timescale of the data, but these predictions can be easily falsified via continued longitudinal studies.
The diagnostic assay is not exempt from the challenges of antibody waning. Although no reductions in diagnostic sensitivity over time were observed, reduced sensitivity will likely be observed as we analyse additional samples over longer durations of follow up. There are substantial challenges in providing estimates of time since infection. Although the approach can reliably reconstruct the distribution of infection times across a population, there will be substantial uncertainty in estimates of the time since previous infection for individual samples.
Sero-prevalence studies are playing a critical role in monitoring the progress of the SARS-CoV-2 pandemic. In the early stages of the pandemic, immunoassays had the advantage of measuring high antibody levels in the initial months after infection. As the pandemic progresses, sero-prevalence will become more challenging to accurately measure due to waning antibody responses and increased vaccine-induced immunity. Multiplex assays and algorithms accounting for how antibody levels change over time may be an important tool for ensuring the ongoing utility of sero-surveillance.
Data Availability
Data and code will be made available online following peer review of the article. In the meantime, requests for data or code should be made to the corresponding author.
Declaration of interests
MTW, IM, JR, SPel, MB, and SPet are inventors on provisional patent PCT/US 63/057.471 on a serological antibody-based diagnostics of SARS-CoV-2 infection. TB and OS are coinventors on provisional patent PCT/US 63/020,063 entitled “S-Flow: a FACS-based assay for serological analysis of SARS-CoV-2 infection” submitted by Institut Pasteur. SK reports personal fees from Accelerate Diagnostics, Astellas, Merck Sharp & Dohme, Pfizer, and Menarini, and grants and personal fees from bioMérieux, outside of the submitted work.
Data availability
Data and code will be made available online following peer review of the article. In the meantime, requests for data or code should be made to the corresponding author.
Funding
This work was supported by the European Research Council (MultiSeroSurv ERC Starting Grant 852373; MW), l’Agence Nationale de la Recherche and Fondation pour la Recherche Médicale (CorPopImm; MW), and the Institut Pasteur International Network (CoronaSeroSurv; MW). JR was supported by the Pasteur Paris University (PPU) International PhD Program. CC was supported by the European Research Council 771813. MB and AL were supported by the « URGENCE COVID-19 » fundraising campaign of Institut Pasteur. NC and CNC are part-funded by a Science Foundation Ireland (SFI) grant, Grant Code 20/SPP/3685. LT has been awarded the Irish Clinical Academic Training (ICAT) Programme, supported by the Wellcome Trust and the Health Research Board (Grant Number 203930/B/16/Z), the Health Service Executive, National Doctors Training and Planning and the Health and Social Care, Research and Development Division, Northern Ireland (https://icatprogramme.org/). MB and AM were supported by Institut Pasteur TaskForce funding (TooLab project). SFK lab is funded by Strasbourg University Hospitals (SeroCoV-HUS; PRI 7782), the Agence Nationale de la Recherche (ANR-18-CE17-0028), Laboratoire d’Excellence TRANSPLANTEX (ANR-11-LABX-0070_TRANSPLANTEX), and Institut National de la Santé et de la Recherche Médicale (UMR_S 1109).
Appendix Figures
Statistical Appendix
Data on longitudinal antibody responses to SARS-CoV-2 proteins
Several teams have conducted studies of the duration of the antibody response against SARS-CoV-2 with varying durations of follow up. Our data analyses the duration of the SARS-CoV-2 antibody response over approximately eleven months. To better characterize the antibody kinetics over different time scales, we supplemented our database with data from six published studies on SARS-CoV-2 antibody kinetics. These data are summarized in Appendix Table 1. In addition to the studies on SARS-CoV-2 antibody kinetics, we also included data from one study of SARS-CoV-1 antibody kinetics in individuals with six years follow-up12.
Mathematical model of antibody kinetics
SARS-CoV-2 antibody kinetics are described using a previously published mathematical model of the immunological processes underlying the generation and waning of antibody responses following infection or vaccination20,33. For antibodies of three classes (IgG, IgM and IgA) targeting the studied antigens, the antibody kinetics are described by the following equations: where β denotes the boost in antibody secreting plasma cells. It is assumed that a proportion ρ of plasma cells are short-lived (Ps) waning at rate cs, and a proportion 1 – ρ are long-lived (Pl) waning at rate cl. Plasma cells generate antibodies
(IgG, IgM or IgA) at rate g, and r is the rate of decay of antibody molecules. Assuming Ps(0) = Pl(0) = 0 and A(0) = Abg, these equations can be solved analytically to give: δ is the time after symptom onset when antibody levels start to increase. B0 is the number of B cells, and g is the rate at which they secrete antibodies. As g and B0 are not both identifiable without detailed and invasive experiments (e.g. bone marrow aspirates to measure antigen-specific plasma cells), we estimate β = gB0. If r is the decay rate of antibody molecules, then we define to be the half-life of antibody molecules. Similarly, we define and .
Note that this model of antibody kinetics follows the formulation outlined in White et al.33 and not that outlined in Rosado et al.20 which included an additional equation for modelling the proliferation of memory B cells. Following experiments on model identifiability with simulated data, it was found that there was limited ability to recover model parameters describing memory B cell proliferation.
Methodology for statistical inference
The model was fitted to longitudinal antibody level measurements from all participants. Mixed effects methods were used to capture the natural variation in antibody kinetics between individual participants, whilst estimating the average value and variance of the immune parameters across the entire population of individuals. The models were fitted in a Bayesian framework using Markov Chain Monte Carlo (MCMC) methods. Mixed effects methods allow individual-level parameters to be estimated for each participant separately, with these individual-level (or mixed effects) parameters being drawn from global distributions.
We consider a scenario where we have data from M studies, each of which has data from Nm individuals. For example, for each participant n in study m the half-life of the short-lived ASCs may be estimated as (an individual-level parameter). The estimates of the local parameters will be drawn from a probability distribution. A log-Normal distribution is suitable as it has positive support on [0, ∞). Thus we have . The mean ds and the variance of the estimates of are given by and .
Model likelihood
For individual n from study m we have data on observed antibody levels Am,n= {a1, …, aJ} at times Tm,n = {t1, …, tJ}. We denote Dm,n = (Am,n, Tm,n) to be the vector of data for individual n from study m. For individual n from study m, the parameters are estimated. The model predicted antibody levels will be {A(t1), A(t2), …, A(tJ)}. We assume log-Normally distributed measurement error such that the difference between log(aj) and log (A(tj)) is Normally distributed with variance . The parameters for observational variance are specific for each study due to differences in the assays utilised. For model predicted antibody levels A(tj) the data likelihood for individual n from study m is given by
Mixed effects likelihood
As described above, for each individual there are seven parameters to be estimated. These individual-level parameters are drawn from population-level distributions. Some parameters are assumed not to depend on the assay in each study (the antibody kinetic parameters: ). Each of these parameters are assumed to be drawn from the same distribution across all studies. Some parameters are assay dependent, notably . These parameters are drawn from distributions specific for each study. The mixed effects likelihood can be written as follows: As the proportion of the ASCs that are long-lived must be bounded by 0 and 1, the individual-level parameters ρm,n are assumed to be drawn from logit-Normal distributions.
Total model likelihood
Denote D = {D1, …, DN} to be the vector of data for all N participants from all studies. We denote θm,n = (μA,m, σA,m, μβ,m, σβ,m, μδ, σδ, μs, σs, μ𝓁, σ𝓁, μa, σa, μρ, σρ, θ1, …, θN) to be the combined vector of population-level parameters and individual-level parameters to be estimated. The total likelihood is obtained by multiplying the likelihood for each participant
Markov Chain Monte Carlo parameter update
The model was fitted to the data using Markov Chain Monte Carlo (MCMC) methods. A three stage parameter update regimen was utilised with a Metropolis-within-Gibbs sampler with sequential updating of individual-level parameters, population-level parameters, and observational variance parameters. A′ indicates an attempted update.
1. Individual-level parameter Metropolis-Hastings update
For each participant n in study m:
Update local parameters:
Calculate updated mixed effects likelihood
Accept the parameter update with probability
2. Population-level parameter Gibbs update
For each of the we obtain new estimates of the population level parameters μi′ and as follows: where μi,0 and τi,0 parameterise the Normal prior distribution on the mean, and k and θ parameterise the Gamma prior distribution on the precision (inverse standard deviation). is the individual-level parameter i in individual n.
For each of the i ∊ {δ, s, 𝓁, a} we obtain new estimates of the population level parameters μi′ and as follows: where μi,0 and τi,0 parameterise the Normal prior distribution on the mean, and k and θ parameterise the Gamma prior distribution on the precision (inverse standard deviation). is the individual-level parameter i in individual n.
For ρ we obtain new estimates of the population level parameters μ ′ and as follows: where μρ,0 and τρ,0 parameterise the Normal prior distribution on the mean, and k and θ parameterise the Gamma prior distribution on the precision (inverse standard deviation).
3. Observational variance parameter Metropolis-Hastings Update
For each study m, update the observational variance parameter σobs,m′
Calculate updated total likelihood Ltota𝓁(θ′|D)and the updated prior probability density P(θ′)
Accept the parameter update with probability
The MCMC algorithm was implemented in C++ complied in Microsoft Visual Studio. The covariance of the multivariate-Normal proposal distributions for Metropolis-Hastings updates were adaptively tuned using the estimated posterior distributions during a burn-in phase of 1 million MCMC iterations. The magnitude of the proposed step size was calibrated using a Robbins-Munro algorithm to achieve an acceptance rate of approximately 23%. The total number of MCMC iterations was 10,000,000. The effective number of iterations was calculated using the effectiveSize routine in the R library coda and the effective size was checked to be > 1,000 for all parameters. The statistical inference procedure was repeated twice to allow for assessment of chain convergence using Gelman-Rubin convergence diagnostics Rc. For all population-level chains, we ensure Rc< 1.05. For all individual-level chains, we ensure Rc < 1.1.
Serological surveillance algorithms
A ROC curve obtained from a training data set consisting of positive and negative samples is described by a sequence of estimated sensitivities and specificities{E(sei, spi)}, where E denotes an estimator. N-fold cross-validation generates samples of sensitivity {sei1, …, seiN} for each E(spi) and samples of specificity {spi1, …, spiN} for each E(sei). Following a previously outlined approach34, for each pair i of sensitivity and specificity, we obtain N estimates of the measured seroprevalence Min in a scenario with true seroprevalence T as follows: Following the Rogan-Gladen estimator approach, this equation can be rearranged to give an adjusted estimate of true seroprevalence: with E(Tin) = 0 if Min < 1 − E(sei). Both Mi and E(Ti) are summarized as medians with 95% ranges. Define αK as a K-way classification algorithm that can classify a sample into one of K categories. Let C0 be the category for a sample from an individual not previously infected with SARS-CoV-2. Let Ck be the category for individuals previously infected with SARS-CoV-2 in one of K – 1 time intervals defined by (0, t1, …, tK-1). αK can be any of a range of algorithms from Random Forests, Support Vector Machines, or ordinal logistic regression. αK can be trained on a data set, with classification performance assessed using N-fold cross-validation. For a serum sample, let Pk denote the classification score (analogous to a probability) that this sample belongs to category k.
Given a vector of classification scores Pk, we can assign a serum sample to a category. A frequently used approach is to select k according to the maximum of Pk. In order to retain better control over diagnostic specificity, we select an alternative two step approach:
if P0 ≥ γ99 then the sample is classified as negative
if P0 < γ99 then the sample is classified into category k according to the maximum of P1, …, PK-1
Here γ99 is selected based on the training data to ensure a target specificity of 99%. The classification performance of this approach can be assessed using a confusion matrix Cij defined by the proportion of samples of category i classified as category j. For the example with K = 4 explored in more detail here, we define three intervals: 0 – 3 months; 3 – 6 months; 6 – 12 months.
Following the approach of the Rogan-Gladen estimator, define M to be the vector of measured prevalences in each of the K categories. An estimator of the true sero-prevalence is provided by E(T) = C-1 * M where C-1 is the inverse of the confusion matrix. Depending on the values of the confusion matrix, this approach may have Tk < 0. In this instance we assign Tk = 0 and transfer the predictions proportionally to the other categories.
Acknowledgements
The French COVID cohort is supported by the REACTing consortium and by the French Directorate General for Health. Jessica Vanhomwegen (Institut Pasteur) is thanked for provision of a MAGPIX machine. Jérôme Hadjadj and Laura Barnabei (Institut Imagine, Paris) are thanked for their work on the Hôpital Cochin study. Dionicia Gamboa (Cayetano Heredia University, Lima) is thanked for sharing negative control samples from Peru. Jetsumon Sattabongkot (Mahidol University, Bangkok) is thanked for sharing negative control samples from Thailand. Marie-Noelle Ungeheuer and Blanca Liliana Perlaza are thanked for processing samples at the Clinical Investigation and Access to BioResources platform in Institut Pasteur. We thank all patients and health-care workers who kindly agreed for samples to be used for medical research purposes.