Abstract
High frequency screening of populations has been proposed as a strategy in facilitating control of the COVID-19 pandemic. Here we use computational modeling, coupled with clinical data from a rapid antigen test, to predict the impact of frequent rapid testing on COVID-19 spread and outcomes. Using patient nasopharyngeal swab specimens, we demonstrate that the sensitivity and specificity of the rapid antigen test compared to quantitative real-time polymerase chain reaction (qRT-PCR) are 84.7% and 85.7%, respectively; moreover, sensitivity correlates directly with viral load. Based on COVID-19 data from three regions in the United States and São José do Rio Preto, Brazil, we show that high frequency, strategic population-wide rapid testing, even at varied accuracy levels, diminishes COVID-19 infections, hospitalizations, and deaths at a fraction of the cost of nucleic acid detection via qRT-PCR. We propose large-scale antigen-based surveillance as a viable strategy to control SARS-CoV-2 spread and to enable societal re-opening.
INTRODUCTION
The COVID-19 pandemic has taken an unprecedented toll on lives, wellbeing, healthcare systems, and global economies. As of 1 September 2020, there have been more than 25.5 million confirmed cases globally with more than 850,000 confirmed deaths1. However, these numbers and the current mapping of disease spread present an incomplete picture of the outbreak largely due to the lack of adequate testing, particularly as undetected infected cases are the main source of disease spread2-7. It is estimated that the reported detection rate of actual COVID-19 cases is only 1-2%5. As of September 2020, the United States and Brazil remain the top two countries with the highest number of COVID-19 cases and deaths worldwide. As countries begin to re-open their economies, a method for accessible and frequent surveillance of COVID-19, with the necessary rapid quarantine measures, is crucial to prevent the multiple resurgences of the disease.
The current standard of care rightfully places a strong focus on the diagnostic limit of detection, yet frequently at the expense of both cost and turnaround time. This situation has contributed to limited population testing largely due to a dearth of diagnostic resources. Quantitative real-time polymerase chain reaction (qRT-PCR) is the gold-standard method for clinical diagnosis, with high sensitivity and specificity, but these tests are accompanied by the need for trained personnel, expensive reagents and instrumentation, and a significant amount of time to execute. Facilities offering qRT-PCR sometimes require a week or longer to complete and return the results to the patient. During this waiting period the undiagnosed individual may spread the infection and/or receive delayed medical treatment. Moreover, due to the cost and relative inaccessibility of qRT-PCR in both resource-limited and abundant settings, large-scale screening using qRT-PCR at frequent intervals remains impractical as a way to identify infected but asymptomatic or mildly symptomatic infections. Numerous studies have reported asymptomatic COVID-19 cases as well as a variation in viral load within and between individuals at different time points, suggesting the need for more frequent testing for informative surveillance.
Technologies alternate to qRT-PCR, such as rapid viral antigen detection, clustered regularly interspaced short palindromic repeats (CRISPR), and loop-mediated isothermal amplification (LAMP) of SARS-CoV-2 provide potential large-scale screening applications, yet their implementation is stymied by requirements for qRT-PCR-like accuracy before they can reach the market8. In countries such as India, where the qRT-PCR resources would not be sufficient to cover monitoring of the population, the use of rapid antigen tests is well underway9,10. In early May 2020, the United States Food and Drug Administration (FDA) authorized the first antigen test for the laboratory detection of COVID-19, citing a need for testing beyond molecular and serological methods. Antigen testing detects the viral proteins rather than nucleic acids or human antibodies, allowing for detection of an active infection with relative ease of sample collection and assay. These rapid assays – like other commercially-available rapid antigen tests – can be mass-produced at low prices and be administered by the average person without a laboratory or instrumentation. These tests also take as little as 15 minutes to determine the result, enabling real-time surveillance and/or diagnosis. Although antigen tests usually perform with high specificities (true negative rate), their sensitivity (true positive rate) is often lower when compared to molecular assays. While qRT-PCR can reach a limit of detection as low as 102 genome copies per mL11, rapid antigen testing detects viral protein that is assumed to correlate with approximately 105 genome copies per mL.
We hypothesize that frequent antigen-based rapid testing even with lower sensitivities compared to qRT-PCR – along with appropriate quarantine measures – can be more effective at decreasing COVID-19 spread than less frequent molecular testing of symptomatic individuals. Keeping in mind the realities of daily testing in resource-limited regions, we also hypothesize that testing frequency can be adjusted according to the prevalence of the disease; that is, an uptick in reported cases should be accompanied by more frequent testing. During the viral incubation period, peak infectivity correlates with a high viral load that can be detected by either qRT-PCR or rapid antigen testing12-15. Rapid tests thus optimize diagnosis for the most infectious individuals. Studies also point to the relatively small window of time during an individual’s incubation period in which the qRT-PCR assay is more sensitive than rapid tests12.
In this study we report the development and clinical validation of a direct antigen rapid test for detection of SARS-CoV-2 spike glycoprotein using 121 retrospectively collected nasopharyngeal swab specimens. Using the clinical performance data, we develop a modeling system to evaluate the impact of frequent rapid testing on COVID-19 spread and outcomes using a variation of a SIR model, which has been previously used to model COVID-19 transmission16-22. We build on this model to incorporate quarantine states and testing protocols to examine the effects of different testing regimes. This model distinguishes between undetected and detected infections and separates severe cases, specifically, those requiring hospitalization, from those less so, which is important for disease response systems such as intensive care unit triaging. We simulate COVID-19 spread with rapid testing and model disease outcomes in three regions in the United States and São José do Rio Preto, Brazil – the site of the clinical validation study – using publicly available data. To date, COVID-19 modeling describes the course of disease spread in response to social distancing and quarantine measures, and a previous simulation study has shown that frequent testing with accuracies less than qRT-PCR, coupled with quarantine process and social distancing, are predicted to significantly decrease infections12,16,22-26. This is the first modeling system using publicly-available data to simulate how potential public health strategies based on testing performance, frequency, and geography impact the course of COVID-19 spread and outcomes. Our findings suggest that a rapid test, even with sensitivities lower than molecular tests, when strategically administered 2-3 times per week, will diminish COVID-19 spread, hospitalizations, and deaths at a fraction of the cost of nucleic acid testing via qRT-PCR.
RESULTS
Direct Antigen Rapid Test Accuracy Correlates with Viral Load Levels
Rapid antigen tests have recently been considered a viable source for first-line screening, although concerns about the accuracy of these tests persist. We developed a direct antigen rapid test in a lateral flow dipstick format for the detection of the spike glycoprotein from SARS-CoV-2 in nasopharyngeal swab specimens. Of the total number of nasopharyngeal swab specimens evaluated by qRT-PCR for amplification of SARS-CoV-2 RNA-dependent RNA polymerase (RdRp), nucleocapsid (N), and envelope (E) genes, 72 tested positive and 49 tested negative (Table 1).
The overall sensitivity and specificity of the rapid antigen test was 84.7% and 85.7%, respectively (Table 1). Our data demonstrate that the sensitivity of our test is positively correlated to the viral load level (Fig. 1, Table 2). The rapid test result was compared to the qRT-PCR cycle threshold (Ct) value measured across RdRp, N, and E genes, and calculated as sensitivity and specificity.
Ct value cutoffs represent the number of qRT-PCR cycles at which generated fluorescence crosses a threshold during the linear amplification phase. The x-axis of Figure 1 describes the percentile positives in the total positive population. Percentile Positives ranks the samples in order of high qRT-PCR Ct value to low. In other words, the higher the percentile, the more “positive” because fewer qRT-PCR cycles are required for gene detection. Because the Ct value is a variable unit based upon qRT-PCR protocol and instrumentation, we evaluated sensitivity against the percentile of cases across Ct values.
As the percentile positive increases, the sensitivity between the rapid test and the gold-standard qRT-PCR increases, reaching 100% sensitivity at 68.1% percentile positive for the gene average detection. Significantly, even at percentile positives between 0% and 68.1%, the sensitivity remains above 80%. Taken together, the clinical data shows that the rapid antigen test performs with increasing accuracy for individuals with a higher viral load, and are thus the most infectious13-15.
An Enhanced Epidemiological SIDHRE-Q Model
We propose an enhanced epidemiological modeling system, SIDHRE-Q, a variant of the classical SIR model in order to expand our clinical validation study and to understand the effects of using frequent rapid tests such as the rapid antigen test on COVID-19 outbreak dynamics. The changes we make to the basic model to encompass the unique characteristics of the COVID-19 pandemic are similar to those presented by Giordano et al.16 (Fig. 2). The differential equations governing the evolution of the SIDHRE-Q model and descriptions of the parameter values are provided in the material and methods section (Equation 2, Table 5).
An individual that begins in S may either transition to a Quarantine Uninfected (QU) state via a false positive result or to an Infected Undetected (I) state via interaction with an infected individual. Should an individual in S move into Q-U, they are quarantined for 14 days before returning to S, a time period chosen based on current knowledge of the infectious period of the disease. One could also conceive of an effective strategy in which individuals exit quarantine after producing a certain number of negative rapid tests in the days following their initial positive result or confirm their negative result using qRT-PCR.Given that those diagnosed are predominantly quarantined, individuals in I interact more with the S population than do those in Infected Detected (D). Therefore, the infectious rate for I is assumed to be significantly larger than for D. Furthermore, a region’s ability to control an outbreak is directly related to how quickly and effectively people in I test into D, reducing their infectiousness through quarantine. This study, in particular, highlights the critical role frequency of testing, along with strict quarantine, has in mitigating the spread of the disease and provides specific testing strategies based on rapid tests we predict to be highly effective.
In this model, we assume that individuals receive a positive diagnosis before developing severe symptoms and that those with symptoms severe enough to be potentially fatal will go to the hospital. If an individual develops symptoms, we assume they are tested daily until receiving a positive result; hence, before severe symptoms develop, they will be diagnosed with high probability. Those who do not develop symptoms are tested according to the frequency of tests administered to the general population. Therefore, there is no modeled connection between I and H or between I and E. Removing these assumptions would have negligible impact on the results as these flows are very small.
Should an individual test positive and transition to D, they may either develop serious symptoms requiring care or recover. Those who develop serious symptoms and transition to state H will then transition to either R or E. The recovered population is inevitably tested, as infected individuals may recover without being detected. Therefore, the Quarantined Recovered (Q-R) state is introduced with the same connections to R as the connections between S and Q-U. Though the reinfection rate of SARS-CoV-2 has been a point of recent debate, it is assumed that the number of re-infected individuals is small27-31. Therefore, individuals cannot transition from R to S, hence the separately categorized quarantined populations.
We considered several variations and extensions of the SIDHRE-Q model. In simulations, we tested additional states, such as those in the SIDARTHE model, which include distinctions between symptomatic and asymptomatic cases for both detected and undetected populations16. Incorporating information about the correlations between viral load and infectivity and sensitivity were also considered. Altogether, our modeling system has been well tuned to predict the impact of high frequency rapid testing on COVID-19 spread and outcomes.
Frequent Rapid Testing with Actionable Quarantining Dramatically Reduces Disease Spread
In order to demonstrate how strategies could affect the disease spread in different geographies and demographics, we used surveillance data obtained from regions of varying characteristics: the state of Massachusetts (MA), New York City (NYC), Los Angeles (LA), and São José do Rio Preto (SJRP), Brazil, the site of the rapid antigen test clinical validation study. These regions are also selected in our study due to the readily available surveillance data provided by the local governments. We fit the model to the data from each region starting 1 April 2020. At this time point the disease reportedly is most advanced in NYC and least advanced in SJRP, Brazil with estimated cumulative infection rates of 7.11% and 0.12%, respectively.
After calibrating the SIDHRE-Q model, the disease spread is observed with varying validated rapid antigen test performances and frequencies (Fig. 3). Sensitivity (the ratio of true positives to the total number of positives) and specificity (the ratio of true negatives to the total number of negatives) compared to gold-standard qRT-PCR were used as measures of test accuracy.
The rapid test frequency is varied while maintaining an accuracy of 80% sensitivity and 90% specificity, comparable to our clinical data collected in SJRP, Brazil. These testing scenarios are then compared to symptomatic testing, in which individuals receive a rapid test only when presenting symptoms, via either a rapid test or qRT-PCR. Since the primary testing regiment deployed in MA, LA, NYC and SJRP, Brazil is qRT-PCR-based and focused on symptomatic individuals, the symptomatic testing protocol via qRT-PCR is directly estimated from the data to be the rate v (Table 5).
The difference between the qRT-PCR and rapid test simulations (red and orange lines, respectively) is therefore only sensitivity of testing (Fig. 3). We assumed that test outcome probability is a function only of whether an individual is infected and independent of other factors; one can consider this a lower bound on effectiveness of a strategy, as sensitivity and infectivity are often positively correlated with antigen testing.
To better understand the effect of rapid testing frequency and performance on healthcare capacity and mortality rates, we simulate the testing strategy with 30%-90% sensitivity each with 80% or 90% specificity against the symptomatic testing strategy. Table 3 describes the different test performances implemented in the model matched with corresponding Figure 3 – Figure Supplement 1 plots.
As per our hypothesis, frequency and symptom-based testing dramatically reduced infections, simultaneous hospitalizations, and total deaths when compared to the purely symptom-based testing regiments, and infections, hospitalization, and death were reduced as frequency increased. Although testing every day was clearly most effective, even testing every fourteen days with an imperfect test gave an improvement over symptomatic testing with qRT-PCR. While the strategy works best when implemented at the very beginning of an outbreak, as demonstrated by the results in SJRP, Brazil, it also works to curb an outbreak that is already large, as demonstrated by the results in NYC. The difference between frequencies is more noticeable when the testing strategy is applied to the outbreak in NYC, leading us to hypothesize that smaller outbreaks require a lower testing frequency than larger ones; note the difference between the dependence on frequency to curb a small initial outbreak in SJRP, Brazil versus a large one in NYC (Fig. 4, Figure 4 – Figure Supplement 1).
For test performance of 80% sensitivity and 90% specificity, the percent of the population that has been infected in total from the beginning of the outbreak to mid-July drops from 18% (MA), 11% (LA), 26% (NYC), and 11% (SJRP, Brazil) to 3%, 2%, 12%, and 0.26%, respectively, using a weekly rapid testing and quarantine strategy (with regards to predictions of overall infection rates, other studies based on seroprevalence and epidemiological predictions have reached similar conclusions32,33). If testing is increased to once every three days, these numbers drop further to 1.6% (MA), 1.4% (LA), 9.4% (NYC), and 0.19% (SJRP, Brazil) (Table 4).
To further examine the relationship between frequency and sensitivity, we modeled the maximum number of individuals in a given state over the 105-day time period for four geographic regions (Fig. 4). In all four geographic regions, as frequency of testing increases, the total infections, maximum simultaneous hospitalizations, and total deaths converge to small percentages regardless of the sensitivity at high frequencies. It is clear that the difference in frequency required to achieve the same result using tests of differing sensitivities is very small (Fig. 4) For example, we predict that for the outbreak in LA, a testing strategy started on 1 April of every 10 days using a test of sensitivity 90% would have resulted in 2.5% of the population having been infected, while using a test of sensitivity 30% would require a strategy of every 5 days to achieve the same number. Thus, we conclude that frequency is more important than sensitivity in curbing the spread, and a large range of sensitivities prove effective when testing sufficiently often. How frequently, exactly, depends on the specific outbreak and what stage it is in, which leads us to the location-based deployment strategy discussed in a later section. Frequency of testing can be significantly reduced to effectively contain the disease once the initial outbreak has been controlled; it is clear that this takes only a matter of weeks (Fig. 3).
On the other hand, according to the specificity of the rapid test and the quarantine duration, larger testing frequency result in a larger percent of the population quarantined (Fig. 3). Assuming a 90% rapid test specificity and 14-day quarantine duration, for the 1, 3- and 7-day frequencies almost 60%, 38% and 20% of the population, respectively, would be quarantined. This figure may be reduced with additional rules for exiting quarantine early, such as after complementary testing. An example of such a strategy is that individuals who test positive are required to either quarantine for two weeks or produce two consecutive negative rapid tests in the two days following their positive result. Assuming 80% sensitivity and 90% specificity, those individuals will reenter the public while still infected with probability 0.04. If uninfected, that individual will exit quarantine after two days with probability 0.81. However, a compromise between the reduction of infections and the proportion of the population in quarantine would be part of the planning for the appropriate testing protocol in each community or region.
Additionally, while high frequency may be necessary to contain a large outbreak initially, relatively infrequent testing, such as every one or two weeks, is sufficient to keep controlled outbreaks small, while reducing the number of quarantined individuals to less than 10% of the population using a two-week mandatory quarantine.
A County-Based Testing Strategy Offers a Cost-effective Approach to Large-scale COVID-19 Surveillance
To examine the effects of resource-strategic testing schemes, we modeled the COVID-19 prevalence by varying testing frequency across counties of California. For this analysis, only California was analyzed because of the accessibility of the county level data. In this scheme, the percent of active infected detected individuals in a county determines the frequency of testing. We define thresholds for the number of active detected infections that, when hit, initiate testing protocols of different frequencies depending on the threshold hit. We first tested evenly spaced thresholds for the number of detected active infections up to 1% of the population, but later adopted thresholds that were determined according to Equation 1. In Equation 1, D = population of state D at the time of testing. T = number of active infections which, if reached, initiates everyday testing. The days between tests are rounded to the closest integer value.
The days between tests are chosen such that the detected active infections should remain near to or below T. If the initial detected active infections are greater than T, then the testing frequency of 1 will cause infections to rapidly drop. Both the threshold at which everyday testing begins and the coefficient of log2T/D can be modified to produce a strategy that is more or less frequent in testing or resource effective; a range of days between tests from 14 days to 1 day are used (Fig. 5). A scan over different choices of T is shown in the supplements to Figure 5; the threshold we choose in Figure 5 is 0.05% because it is successful in curbing the outbreak within the time period we consider. Whilethese choices work for the epidemic in California at the point we start our simulations, 10 April, they do not necessarily reflect the most resource effective choices everywhere. Our analysis could be redone to select the best strategy in other states or in the country as a whole.
Using a rapid test with a sensitivity of 80% and specificity of 90%, the county-based testing with threshold 0.05% reduces the active infections from 0.94% to 0.0005%, while the uniform strategy with tests administered every 7 days results in double the number of active infections (Fig. 5). As the threshold is reduced, the total cost increases while thecumulative infections, maximum percentage hospitalized, and cumulative deaths all decrease (Figure 5 – Figure Supplement 1).
Strategy B in Figure 5 consists of qRT-PCR testing uniformly applied to the highlighted population with a frequency of once weekly. The average cost per person per day is just under $15. Despite this frequency and the accuracy of qRT-PCR, the strategy does not succeed in curbing the spread as fast as strategy A, which uses a testing sensitivity and specificity of 80% and 90%, respectively, and testing frequency that vary between counties depending on the proportion of their population that is currently infected. The total cost for strategy A is estimated at a fraction of the other at $1.53 per person per day.
DISCUSSION
In this study we examine the potential effects of a novel testing strategy to limit the spread of SARS-CoV-2 utilizing rapid antigen test screening approaches. Our clinical data and SIDHRE-Q modeling system demonstrate that 1) frequent rapid testing even at a range of accuracies is effective at reducing COVID-19 spread, 2) rapid antigen tests are a viable source for this strategy and diagnose the most infectious individuals, and 3) strategic geographic-based testing can optimize disease control with the amount of available resources. The information from a diagnostic test itself is of tremendous value, as it can prompt the necessary quarantine measures to prevent spread, guide proper care and triage, and provide crucial disease-tracking information. Diagnostic testing in the United States and abroad, however, has been a significant public health hurdle. The public has witnessed and experienced symptomatic individuals being denied testing due to shortages, and few testing structures for asymptomatic or mildly symptomatic individuals – the main source of disease spread. Though several factors contributed to the stymied early response measures, such as lockdown and quarantine protocols and adherence, severe testing bottlenecks were a significant culprit34-36. Early control measures have been shown to decrease lives lost by several orders of magnitude37. These challenges, though exacerbated during the early months of the pandemic, remain at the forefront of the public health crises.
Diagnosis of SARS-CoV-2 infection by qRT-PCR is the current standard of care, yet remains expensive and requires a laboratory and experienced personnel for sample preparations and experimentation. Significantly, the turnaround time for results can be up to 10 days38. On an individual scale, this leaves the public in limbo, preventing people from either leaving quarantine if they are negative, or delaying critical care and infecting others if they are positive. On a societal level, this current testing scheme yields incomplete surveillance data on which response efforts such as societal reopening and hospital management depend. Though qRT-PCR is considered the gold-standard diagnostic method because of its high sensitivity and specificity, the logistical hurdles render it unrealistic for large-scale screening.
As qRT-PCR remains impractical for this strategy, and rapid tests are facing regulatory challenges because they do not perform with qRT-PCR-like accuracy, rapid test screening is either nonexistent in several countries or symptom-based. Even under best-case assumptions, findings have shown that symptom and risk-based screening strategies miss more than half of the infected individuals39. Some have argued that the need for widespread testing is overstated due to the variability in test sensitivity and specificity40. Our findings show, however, that test performance, though valuable, is secondary to widespread test frequency, which is enabled by accessibility and turnaround time.
Giordano et al. has modeled the evolution of SARS-CoV-2 spread, introducing a diagnosed state to elucidate the importance of population-wide testing16. Mina et al. has examined how various test sensitivities and frequencies affect the reproductive number12. We build upon these findings to show how in affected United States and Brazil regions, population-wide frequent and rapid testing schemes, with sensitivities ranging from 30%- 90%, can be more effective in curbing the pandemic than a PCR-based scheme. Integrating real-world surveillance and clinical data into our modeling system has allowed us to incorporate regional differences – such as variances in healthcare access, state health policy and adherence, state GDP, and environmental factors – under the same model. Significantly, our findings hold true across Massachusetts, New York City, Los Angeles, and São José do Rio Preto, Brazil. We also present the economic considerations of these testing regimes, showing that widespread rapid testing is more cost efficient than less frequent qRT-PCR testing. In line with these economic considerations, our model demonstrates the effectiveness of a geographic-based frequent testing regime, in which high disease prevalence areas receive more frequent testing than low disease prevalence areas.
Since COVID-19 is known to affect certain demographics differently, modeling would benefit from incorporating demographic information correlated with disease progression and spread to define sub-models and sets of parameters accordingly. Age, pre-existing conditions, job types, and density of population are examples of possible categories, each of which influence the risk of contracting and/or dying from COVID-19. Further studies may benefit from incorporating these ideas should more information become available.
Our findings also point to low-cost tools for implementation of this testing strategy, such as a rapid antigen-based test for the detection of SARS-CoV-2 proteins. We show that the rapid antigen test performs with a range of accuracies under which disease spread can be dramatically mitigated under our model. Notably, the sensitivity is correlated to the individual’s viral load, effectively diagnosing those who are most infectious with the highest accuracy. Our findings are significant because these rapid antigen tests are cheaper than qRT-PCR, can be mass produced to millions per day, present results within 15 minutes, and can be administered by a nonexpert without a lab or special equipment.
There are several policy implications for these findings. First, our model supports that systems of high frequency rapid testing should be implemented as a first-line screening method. This can be first enabled by a more holistic regulatory evaluation of rapid diagnostics, such that policy emphasizes accessibility and turnaround time even under a range of accuracies. One can imagine a less accurate, though rapid method of first-line screening in schools, public transportation, and airports, or even at home, and a qRT-PCR-based method for second-line screening (testing those who present severe symptoms or have been in contact with infected individuals, testing in a clinical setting, etc). Second, our cost analysis and rapid antigen test data present a viable and potentially more cost-effective method for screening. Third, our county-based testing scheme presents a possible method for wide-scale screening while optimizing resources. Future studies should investigate how this selective testing strategy can be applied to different location scales to further inform health policy. Moreover, though our models analyze regions in the United States and Brazil, similar testing strategies can be considered globally in both resource limited and abundant settings due to the higher accessibility of rapid tests compared to qRT-PCR.
We emphasize that integral to the effectiveness of diagnostic schemes is 1) the proper adherence to quarantine measures and 2) the combined use of a variety of diagnostic methods including nucleic acid, antigen, and antibody tests. According to these models, rapid antigen tests are an ideal tool for first-line screening. Clinical molecular tests such as qRT-PCR are vital to the diagnostic landscape, particularly to re-test suspected cases that were negative on the rapid test. Because rapid tests present a higher rate of false negatives, methods such as qRT-PCR remain integral to second-line screening. Antibody tests provide important information for immunity and vaccination purposes as well as epidemiological surveillance. This model also assumes that individuals will quarantine themselves before being tested and for 14 days following a positive diagnostic result.
Our simulations combined with real-world data demonstrate a robust modeling system and elucidates the significance of this novel testing strategy. However, there are important limitations to be considered. Differences in disease reporting between the geographical regions and the incomplete nature of COVID-19 surveillance data, often due to the lack of testing, are not considered in the model. It is imperative that the testing results, hospitalization and death statistics, and changes in protocol are reported in realtime to scientists and policy makers so that models can be accurately tuned as the pandemic develops. The model also does not take into account infrastructural limitations such as hospital capacity. Though the rapid antigen test offers several advantages such as affordability, fast turnaround time, and ease of mass production, we are also assuming that there are systems in place to implement frequent and safe low-cost screening across different communities and settings.
Our model underscores the need for a point-of-care or at-home test for frequent screening, particularly as lockdown restrictions ease. Regulatory agencies such as the FDA could work towards regulating rapid tests to alternative standards other than comparison to high sensitivity molecular diagnostics, as our model shows that frequency and scale of testing may overcome lower sensitivities. Rather, we could refocus policy to implement first-line screening that optimizes accuracy with efficiency and equitability.
MATERIAL AND METHODS
Development of Direct Antigen Rapid Test for the Detection of SARS-CoV-2
We developed a direct antigen rapid test for the detection of the spike glycoprotein from SARS-CoV-2 in nasopharyngeal swab specimens as previously described41. Briefly, the rapid antigen test is an immunochromatographic format with a visual readout using anti-spike mouse monoclonal antibodies (E25Bio, Inc., Cambridge, MA, USA) that are either coupled to 40 nm gold nanoparticles (Abcam, Cambridge, UK) or adsorbed to nitrocellulose membranes (Sartorius, Goettingen, Germany). Each rapid antigen test has a control area adjacent to the paper absorbent pad; the control is an anti-mouse Fc domain antibody (Leinco Technologies, Fenton, MO, USA) that will capture any of the antibody-conjugated gold nanoparticles to generate a control visual signal. A visual signal at the test area reflects SARS-CoV-2 spike glycoprotein that is “sandwiched” between an anti-spike glycoprotein antibody adsorbed to the nitrocellulose membrane and a second anti-spike glycoprotein antibody covalently coupled to visible gold nanoparticles.
Validation of Direct Antigen Rapid Test for the Detection of SARS-CoV-2
In a retrospective study of nasopharyngeal swab specimens from human patients, we compared the accuracy of the rapid antigen test to the viral load of individuals. Nasopharyngeal swab specimens (n = 121) were tested in Brazil following approved human subjects use protocols. The age of study participants ranged from 1 to 95 years with an overall median of 37 years (interquartile range, 27-51 years), and 62% were female. The demographic summary of the patients are included in Supplementary Table 1. The nasopharyngeal swab specimens were banked refrigerated or frozen samples from suspected patients submitted to the lab for routine COVID diagnosis. Prior to using the rapid test, the nasopharyngeal swab samples were validated by qRT-PCR using GeneFinder™ COVID-19 Plus RealAmp Kit (OSANGHealtcare, Anyang-si, Gyeonggi-do, Republic of Korea I). The primary study under which the samples and data were collected received ethical clearance from the Faculdade de Medicina de São José do Rio Preto (FAMERP), protocol number 31588920.0.0000.5415. All excess samples and corresponding data were banked and de-identified prior to the analyses.
A nasopharyngeal swab specimen (1 mL) was concentrated using a Vivaspin 500 centrifugal concentrator (Sartorius, Goettingen, Germany) at 12,000 x g for 10 minutes. The concentrated nasopharyngeal swab specimen retentate was transferred to a collection tube and the rapid antigen test was inserted into the tube with the retentate and allowed to react for 15 minutes. After processing of the rapid antigen test, the visual positive or negative signal was documented.
Data for Modeling
As of August 2020, the United States and Brazil have the highest number of confirmed COVID-19 cases and deaths worldwide, with both countries reporting their first case on 26 February 20201. Although several affected US regions could have been modeled, we look at data from Massachusetts, New York, and Los Angeles: these regions each contained “hotspots”, or areas of surging COVID-19 cases, at different points in time during the pandemic and have publicly available government-provided surveillance data. Our model is fit using data over 105 days beginning on April 1 for Figures 3 and 4 and 105 days beginning on April 10 for Figure 5 (see “Modeling Parameters” in Methods). In order to understand the various testing proposals on a global scale, we performed our clinical study in and expanded the modeling study to Brazil. The specific data we use to fit our model are cumulative confirmed cases, total deaths, and number of daily hospitalizations due to COVID-19. This surveillance data was retrieved from government-provided online databases42-48.
Modeling Parameters
Equation 2 below provides the exact differential equations governing the model.
In order to determine the values of the parameters defining the flows between states, we use a least squares regression performed at seven day intervals in the datasets to which we fit. This allows the model to take into account the time dependent nature of the parameters, which rely on factors such as social distancing regulations and changes in testing capacity. We also fit window sizes between 1 and 21 days and find that while the fit degrades with larger window size, the overall shape of the fits do not change. We choose seven days assuming policy changes take a week to become effective and that reasonable parameters can be expected to change within this time period. Also, the seven day window size accounts for the fact that often data is not reported as diligently over the weekend. Time series of the values of the parameters for the geographic locations discussed in this paper can be found in the supplemental materials for Figure 2.
Given the restrictions on data available for the populations of various states, varying all of the parameters results in an over parameterized system. Therefore, a subset of the model parameters are fit while the others are either extracted from other sources; see Table 5.
The fitting procedure minimizes the sum of the squared residuals of the total cases, current daily hospitalizations, cumulative deaths, and percentage of total infected individuals currently hospitalized. The first three are present in the data sets while the latter is derived from the estimates of the ratio between infected undetected to infected detected individuals from the CDC Laboratory Seroprevalence Survey Data49. While this ratio changes over time, the percentage of infected individuals developing severe symptoms should remain roughly constant throughout the course of the epidemic in the different locations studied.
We consider the data sets for outbreaks in MA, NYC, LA, and SJRP, Brazil42-47. While each location has testing and fatality information dating back to January, hospitalization data was not included until late March (for NYC and SJRP) and April (for MA and LA). Hence we begin our fitting procedure and testing strategy on 1 April for each of the data sets; by this point, the outbreak is advanced in NYC, substantial in MA, non-negligible, but far from its peak, in LA, and in early stages in SJRP, Brazil. Starting simulations at various stages of the outbreak allows one to see the difference in results between when a testing strategy is administered.
In order to determine the effectiveness of the county-based strategy when applied to the state of California, we also fit all of the counties in California with a population greater than 1.5% of that of the entire state and with greater than zero deaths. The results do not depend on these selections, but instead suggest a practical criteria to administer limited resources. The fitting is done starting 10 April for these counties, as at this point the outbreak is sufficiently well-documented in each to successfully model. For the county-level data we compute a seven day running average of each of the data sets to which we then fit in order to smooth out fluctuations in the data, likely due to reporting, which are more significant here than in the other data sets considered, as the county populations are smaller and hence discrepancies impact the smoothness of the data more. The fits for each of the counties can be found in the supplementary materials to Figure 5.
As one can see from Figure 2, these data sets are particularly not smooth, which indicates inefficiencies in reporting. Additionally, it is difficult to gauge their consistency within the dates provided or to compare between locations, as reporting mechanisms changed over time within the same locations. Despite this lack of consistency, our model and fitting mechanism was successful in reproducing the progress of the outbreak in each data set studied.
The authors confirm that the data supporting the findings of this study are available within the article and/or its supplementary materials; any other data will be made available upon request. Our code can be found on github: https://github.com/badeaa3/COVID19 Rapid Testing. The code is written using python with the packages scipy, numpy, lmfit, matplotlib and plotly50-54.
Data Availability
The authors confirm that the data supporting the findings of this study are available within the article and/or its supplementary materials; any other data will be made available upon request. Our code can be found on github: https://github.com/badeaa3/COVID19_Rapid_Testing.
Competing Interest
BN, AB, AR, MB, NS, AG, IB, and BBH are employed by or affiliated with E25Bio Inc. (www.e25bio.com), a company that develops diagnostics for epidemic viruses.
Acknowledgments
We thank Professor Lee Gehrke for critical reading of the manuscript. The study is funded, in part, by a Bill and Melinda Gates Foundation Award (INV-017872) to E25Bio, Inc. EN is funded by Tufts University DISC Seed Grant. MLN is supported by a FAPESP grant (#2020/04836-0) and is a CNPq Research Fellow. AFV is supported by a FAPESP Fellow grant (#18/17647-0). GRFC is supported by a FAPESP Fellow grant (#20/074190). BHGAM is supported by a FAPESP Scholarship (#19/06572-2). The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.