ABSTRACT
The COVID-19 pandemic has created an urgent need for rapid, effective, and low-cost SARS-CoV-2 diagnostic testing. Here, we describe COV-ID, an approach that combines RT-LAMP with deep sequencing to detect SARS-CoV-2 in unprocessed human saliva with high sensitivity (5–10 virions). Based on a multi-dimensional barcoding strategy, COV-ID can be used to test thousands of samples overnight in a single sequencing run with limited labor and laboratory equipment. The sequencing-based readout allows COV-ID to detect multiple amplicons simultaneously, including key controls such as host transcripts and artificial spike-ins, as well as multiple pathogens. Here we demonstrate this flexibility by simultaneous detection of 4 amplicons in contrived saliva samples: SARS-CoV-2, influenza A, human STATHERIN, and an artificial SARS spike-in. The approach was validated on clinical saliva samples, where it showed 100% agreement with RT-qPCR. COV-ID can also be performed directly on saliva adsorbed on filter paper, simplifying collection logistics and sample handling.
INTRODUCTION
Within the first year of the COVID-19 pandemic SARS-CoV-2 has swept across the world, leading to more than 130 million infections and over 2.8 million deaths worldwide (as of April 2021). In many countries, non-pharmaceutical interventions, such as school closures and national lockdowns, have proven to be effective, but could not be sustained due to economic and social impact1,2. Regularly performed population-level diagnostic testing is an attractive solution3, particularly as asymptomatic individuals are implicated in rapid disease transmission, with a strong overdispersion in secondary transmission4. Maintenance of population-level testing can be successful in isolating asymptomatic individuals and preventing sustained transmission5, 6; however, considerable barriers exist to the adoption of such massive testing strategies. Two such barriers are cost and supply constraints for commercial testing reagents, both of which make it impractical to test large numbers of asymptomatic individuals on a recurrent basis. A third major barrier is the lack of “user-friendly” protocols that can be rapidly adopted by public and private organizations to establish high-throughput surveillance screening. In addition, while COVID-19 testing of symptomatic individuals might be effective during the summer season, when other respiratory infections are rare, new strategies are needed to facilitate rapid differential diagnosis between SARS-CoV-2 and other respiratory viruses in winter.
Recent adaptations of reverse transcription and polymerase chain reaction (RT-PCR) to amplify viral sequence and perform next-generation DNA sequencing have opened promising new avenues for massively parallel SARS-CoV-2 detection. In general, sequencing-based protocols use libraries of amplification primers to tag reads originating from each individual patient sample with a unique index that can be identified and deconvoluted after sequencing, thus allowing pooling of tens of thousands of samples in a single assay. SARSeq, SPAR-Seq, Swab-seq, and INSIGHT, directly amplify the viral RNA by RT-PCR and simultaneously introduce barcodes7-10. While effective, these methods rely on individual PCR amplification of each patient sample, thus requiring a large number of thermal cyclers for massive scale-up. An alternative approach, ApharSeq, addresses this bottleneck by annealing barcoded RT primers to viral RNA and pooling samples prior to amplification but the need for specialized oligo-dT magnetic beads might constitute a separate adoption barrier for this method11. Finally, several recent methods have been designed to take advantage of the extreme sensitivity and isothermal conditions of loop-mediated isothermal amplification (LAMP)12-14, but these methods either require additional manipulation to introduce barcodes12, 13 or do not allow for convenient multiplexing14.
In this study, we present COV-ID, a method for SARS-CoV-2 identification based on LAMP, which enables large-scale diagnostic testing at low cost and with minimal on-site equipment. COV-ID is a robust method that can be used to test tens of thousands of samples for multiple pathogens with modest reagent costs and 2–4 laboratory personnel, generating results within 24 hours. COV-ID uses unpurified saliva or saliva adsorbed on filter paper as input material, thus enabling the massively parallel, inexpensive testing required for population-level surveillance of the COVID-19 pandemic (Fig. 1A).
RESULTS
Two-step amplification and indexing of viral and human sequences via RT-LAMP and PCR
The molecular basis for COV-ID is reverse transcription loop-mediated isothermal amplification (RT-LAMP), an alternative to PCR that has been used extensively for viral DNA or RNA detection in clinical samples15-18, including SARS-CoV-219, 20. RT-LAMP requires 4–6 primers that recognize different regions of the target sequence21, 22 and proceeds through a set of primed and self-primed steps to yield many inverted copies of the target sequence spanning a range of molecular sizes (Fig. S1). The forward inner primer (FIP) and backward inner primer (BIP), which recognize internal sequences, are incorporated in opposite orientation across the target sequence in the final amplified product (Fig. S1).
Previous studies have shown that the FIP and BIP tolerate insertion of exogenous sequence between their different target homology regions23. We exploited this flexibility and introduced 1) patient-specific barcodes as shown previously12, 14, 23 and 2) artificial sequences that allowed for PCR amplification of a small product compatible with Illumina sequencing library construction (Fig. 1, Fig. S1). These innovations allow us to pool individually barcoded RT-LAMP reactions and amplify them in batch via PCR, while introducing unique P5 and P7 dual indexes in different pools, thus enabling two-dimensional barcoding and dramatically increasing method throughput (see Table S1 for PCR primer sequences). To minimize pool variability, PCR primers can be titrated to 100 nM and pooled PCRs carried out to completion, resulting in each pool being amplified to the same approximate concentration. Uniquely amplified and barcoded pools are mixed into a single “super-pool” that can be sequenced on an Illumina NextSeq or similar instrument (Fig. 1A). Combining individual barcodes embedded in the product at the RT-LAMP step with dual indexes introduced at the pool level during the PCR step allows for deconvolution of thousands or tens of thousands of samples in a single sequencing run.
To determine whether introduction of these exogenous sequences into the primers inhibited the isothermal amplification step, we performed RT-LAMP on inactivated SARS-CoV-2 virus using an extensively validated primer set against the N2 region of the nucleocapsid protein24 including either the conventional BIP and FIP primers or their modified version re-engineered for the COV-ID workflow (Fig. 1B). Although the appearance of the amplified viral product was slightly delayed when using COV-ID primers, all reactions reached saturation rapidly and without detectable amplification of negative controls (Fig. 1C). Next, we tested whether COV-ID was compatible with RT-LAMP using newly designed primers against a host (human) transcript and whether the second step of COV-ID, direct library construction and indexing via PCR amplification (Fig. 1D), yields the desired product. For this, we designed RT-LAMP primers against the human beta-actin (ACTB) transcript that included sequences necessary for COV-ID. After RT-LAMP, reactions were diluted 100-fold before PCR with barcoded Illumina adapters. A PCR product of the expected size was visible in reactions containing total HeLa RNA, whereas no PCR product was observed in the absence of template (Fig. 1E). Sanger sequencing of the PCR product confirmed that RT-LAMP followed by PCR generated the product expected by the COV-ID method design, including the sample barcode introduced during the RT-LAMP step. Thus, our data show that RT-LAMP is tolerant of sequence insertions in the BIP and FIP primers that allow introduction of LAMP-level barcodes as well as sequences homologous to Illumina adapters for direct amplification, indexing, and library construction via PCR.
Sequencing-based detection of SARS-CoV-2 RNA from saliva using COV-ID
We next evaluated the utility of COV-ID to detect viral RNA in saliva. Saliva is an attractive sample material for COVID-19 diagnostics with potential for early detection25, and has been shown to be a viable template for nucleic acid amplification via RT-PCR26, recombinase polymerase amplification (RPA)27 as well as RT-LAMP28, 29. We prepared human saliva for RT-LAMP using a previously described treatment that inactivates SARS-CoV-2 virions, saliva-borne RNases and LAMP inhibitors (Fig. 2A)29. We performed RT-LAMP followed by PCR on inactivated saliva spiked with water or 1,000 total copies of inactivated SARS-CoV-2 virus. We observed a single band of the expected size in reactions performed on saliva spiked with virus but not in control reactions (Fig. 2B). The sequence of the amplified and barcoded viral product was confirmed by Sanger sequencing (Fig. S2A). Next, we subjected the libraries to deep sequencing. Reads aligned uniformly to the N gene, the region targeted by the N2 primer set, in COV-ID libraries constructed from viral samples but not in control libraries (Fig. 2C).
In several SARS-CoV-2 FDA approved tests, parallel amplification of a host (human) amplicon is implemented as a metric for sample integrity and quality. That is, if no human RNA is amplified from a clinical sample, no conclusion can be drawn from a negative test result30. However, in most tests, viral and human amplicons must be detected separately, resulting in a multiplication of the number of reactions to be performed. We reasoned that the deep sequencing nature of COV-ID would allow for simultaneous detection of viral, human, and other control amplicons, without increasing the number of necessary reactions. In fact, given that the PCR handles inserted in the BIP and FIP are the same for all RT-LAMP amplicons (Fig. 1B), the same P5 and P7 Illumina primers allow the simultaneous amplification of all RT-LAMP products obtained with COV-ID-modified primer sets (Fig. 1D). To identify a suitable human control, we compared conventional RT-LAMP primers for the mRNA of ACTB24 or STATHERIN (STATH), a gene expressed specifically in saliva31. To determine which of the two RT-LAMP primer sets was a better proxy to measure RNA integrity in saliva samples, we assayed for amplification of the respective products in presence or absence of RNase. Whereas addition of RNase A abolished the STATH signal, it was ineffectual for ACTB (Fig. S2B), suggesting that amplification of genomic DNA made considerable contributions to the RT-LAMP signal observed for the latter. Therefore, we utilized STATH mRNA as a human control in subsequent experiments.
We used COV-ID-adapted primer sets for N2 and STATH (Table S1) in multiplex on inactivated saliva spiked with a range of SARS-CoV-2 from 5 to 10,000 virions/μL. Subsequently, each RT-LAMP reaction was separately amplified via PCR using a unique P5 and P7 index combination, pooled, quantified, and deep-sequenced to an average depth of 6,000 reads per sample. After read trimming, alignment, and filtering (see Methods), 76% of reads from saliva COV-ID reactions were informative (Fig. S2C). In order to differentiate SARS-CoV-2 positive and negative samples, we calculated the ratio between N2 reads and reads mapping to the human STATH control. Using the highest N2/STATH read ratio in control (SARS-CoV-2 negative saliva) as a threshold, 95% (19/20) of samples with spiked-in virus were correctly classified as positives (Fig. 2D). Using COV-ID, we consistently detected SARS-CoV-2 in saliva samples containing as low as 5 virions per µL, a sensitivity comparable and in some cases superior to those of established testing protocols32.
Scaling COV-ID to handle higher sample numbers requires pooling samples immediately following RT-LAMP, prior to the PCR step (Fig. 1A). We designed 32 unique 5-nucleotide barcodes for several target LAMP amplicons (Fig. S2D and Table S2). We first individually validated each barcode and primer combination by real-time fluorescence and PCR efficiency. Certain barcodes inhibited the RT-LAMP reaction, possibly due to internal micro-homology and primer self-hybridization33. Nonetheless, out of 32 barcodes tested in 3 separate RT-LAMP reactions (N2, ACTB, and STATH), 25 successfully amplified all three target RNAs (Fig. S2D). Saliva samples spiked with various concentrations of inactivated SARS-CoV-2 were amplified via barcoded RT-LAMP, then optionally pooled prior to PCR and sequencing (Fig. S2E). CoV-2/STATH ratios demonstrated no loss of sensitivity or specificity in the pooled samples compared to the individual PCRs.
To test the potential of COV-ID on patient samples, we tested saliva specimens, collected and previously analyzed at the Hospital of the University of Pennsylvania (see Methods). We carried out multiplex barcoded RT-LAMPs on each sample (COV-ID step I, Fig. 1B), pooled the reactions and then constructed libraries via PCR (COV-ID step II, Fig. 1D). After deep sequencing, analysis of N2/STATH ratios showed 100% (8/8) concordance with viral copy numbers generated by a standard clinical test (RNA purification followed by RT-qPCR) (Fig. 2E), demonstrating the effectiveness of the COV-ID approach.
Taken together, our data show that COV-ID can be utilized to detect viral and human amplicons in multiplex directly from saliva. The samples that can be batch amplified and deconvoluted after deep sequencing.
Calibration of COV-ID using an artificial spike-in
Existing deep sequencing approaches for massively parallel COVID-19 testing based on RT-PCR incorporate artificial spike-ins, which serve as an internal calibration controls and allow for better estimates of viral loads by end-point PCR7, 8. At the same time, adding to the reactions an artificial substrate for amplification helps minimizing spurious signals as it can “scavenge” viral amplification primers in negative samples. Finally, by providing a baseline amplification even in empty samples, a properly designed spike-in strategy can reduce variance in total amounts of final amplified products across samples, which compresses the dynamic-range of sequence coverage for each patient in a complex pool and, therefore, reduces the risk of inconclusive samples due to low sequencing coverage8.
We reasoned that a spike-in approach for LAMP-based quantification would provide similar benefits in the context of COV-ID. To generate a SARS-CoV-2 spike-in, we synthesized a fragment of the N2 RNA that retained all primer-binding regions for RT-LAMP and contained a divergent 7-nt stretch of sequence to distinguish reads originating from the spike-in from those originating from the natural virus (Fig. S3A). After confirming that the spike-in template was efficiently amplified via RT-LAMP with the N2 primer set (Fig. S3B), we performed pooled COV-ID on virus-containing saliva in the presence of 20 fg of N2 spike-in RNA. As expected8, addition of a constant amount of viral spike-in across reactions reduced the variability in total read numbers for individual samples in the final pool (Fig. S3C). As discussed above, a narrower range in sequencing output across samples in a pool optimizes the utilization of sequencing reads, and ultimately lowers the cost per sample. Because the spike-in provides an internal calibration that is independent of the RNA quality found in saliva, in several cases normalization against the spike-in resulted in lower levels of false positive signal from negative samples (Fig. S3D). This is likely because in cases where very few STATH reads were obtained, possibly due to degradation of host RNA in the saliva sample, the resulting small denominator inflated the N2/STATH ratio even for SARS-CoV-2 signal that was low in absolute terms and likely spurious.
Thus, these data show that spike-in strategies are compatible with the COV-ID workflow and provide a means to stabilize total amplification and read allocation per sample while also offering an additional calibration control to better estimate the viral load in samples where the endogenous STATH mRNA might be below detection due to improper collection or handling.
Simultaneous detection of SARS-CoV-2 and influenza A by COV-ID
Given the challenge of distinguishing early symptoms of COVID-19 from other respiratory infections, we evaluated COV-ID for the simultaneous detection of more than one viral pathogen. Multiple distinct products can be simultaneously amplified by RT-LAMP in the same tube by providing the appropriate primer sets in multiplex, as we demonstrated above by co-amplifying N2 and STATH in the same COV-ID reaction (see Fig. 2). In fact, simultaneous detection of SARS-CoV-2 and influenza virus by RT-LAMP was previously achieved, albeit in a fluorescent-based, low-throughput type of assay34. We reasoned that the sequencing-based readout of COV-ID would allow extending this approach to the simultaneous detection of multiple pathogens as well as endogenous (host mRNA) and artificial (spike-in) calibration standards, all in a single reaction.
To test the ability of COV-ID to simultaneously detect multiple viral templates, we selected and validated a generic “flu” RT-LAMP primer set that recognizes several strains, including influenza A virus (IAV) and influenza B34, 35, and modified the BIP and FIP sequence to introduce the COV-ID barcodes and handles for PCR (Fig. S2D and Table S1). We added inactivated SARS-CoV-2 virus (BEI resources) and IAV strain H1N1 RNA (Twist Biosciences) to saliva according to a 3 × 4 matrix of (104, 103, or 0 copies per µL) SARS-CoV-2 RNA against H1N1 RNA (105, 104, 103, or 0 copies per µL) (Fig. 3A), as well as the N2 spike-in control. We performed multiplex COV-ID on these samples using primers sets for STATH, N2 (to detect SARS-CoV-2), and IAV (to detect H1N1) and sequenced to an average depth of 21,000 reads per sample. Both H1N1 and SARS-CoV-2 were detected above background and the signal correlated with the amount of the respective template added to saliva (Fig. 3B–C). Overall, multiplex COV-ID correctly identified samples that contained only SARS-CoV-2 (7/8) or H1N1 (6/8). For samples that contained both pathogens we observed reduced sensitivity (11/16 identification of both pathogens), which was also observed in a previous multiplexing attempt34. However, in practice individuals who are simultaneously infected with both viruses presumably would be rare, and for these cases the ability to detect at least one virus successfully would allow to follow up with further diagnostic testing. We found that of the samples containing both viruses, 16/16 showed positive detection of at least one pathogen (SARS-CoV-2 or H1N1), suggesting the reduced sensitivity of the multiplex assay is due to interference between amplification of both viral templates. This also demonstrates that COV-ID can be used as an effective screening approach for multiple viral templates.
Paper-based saliva sampling for COV-ID
As an additional step toward increasing the throughput of the COV-ID approach, we explored avenues to simplify collection, lower costs, and expedite processing time. Absorbent paper is an attractive alternative to sample vials for collection, given its low cost, wide availability, and smaller environmental footprint. In fact, paper has been used as a means to isolate nucleic acid from biological samples for direct RT-PCR testing36 as well as RT-LAMP37, 38.
We sought to determine whether the COV-ID workflow would be compatible with saliva collection on absorbent paper. First, we immersed a small square of Whatman filter paper into water containing various dilutions of inactivated SARS-CoV-2. After 2 min, the paper was removed and transferred to PCR strip tubes followed by heating at 95ºC for 5 minutes to air-dry the sample (Fig. 4A). Next, we added the RT-LAMP mix containing the N2 COV-ID primer set directly to the tubes containing the paper squares and let the reaction proceed in the usual conditions. COV-ID PCR products of the correct size were evident in all samples containing viral RNA, with sensitivity of at least 100 virions / μL (Fig. 4B) and in none of the controls, demonstrating that the presence of paper does not interfere with the RT-LAMP reaction and subsequent PCR amplification with Illumina adapters.
To assay direct COV-ID detection from saliva on paper, we saturated Whatman filter paper squares with saliva containing different amounts of inactivated SARS-CoV-2 virus, which, we reasoned, would be equivalent to a patient collecting their own saliva by chewing on a small piece of absorbent paper. Next, we placed the paper squares into reaction tubes containing TCEP/EDTA inactivation buffer (see Methods) similar to that used for the in-solution samples used in our previous experiments (see Fig. 1A). We dried the paper at 95ºC and performed RT-LAMP followed by PCR (Fig. 4C), which resulted in the appearance of COV-ID products of the correct size starting from saliva spiked with as few as 50 virions / μL (Fig. 4D). We then performed COV-ID sequencing on saliva collected on paper using primers N2 and STATH in the presence of the N2 spike-in RNA. The sequence data showed more variability and limited coverage of the control amplicons compared to in-solution COV-ID likely due to the more challenging reaction conditions; therefore, we normalized viral reads using both STATH and spike-in. This paper-based COV-ID proof-of-principle experiment detected the presence of viral RNA in samples with as little as 320 copies / µL (Fig. 4E), a lower sensitivity compared to that of in-solution COV-ID but still well within the useful range39 to detect infections.
Taken together, these data show that the RT-LAMP step of COV-ID is compatible with the presence of paper in the reaction tube and suggest that self-collection of saliva by patients directly on absorbent paper could provide a simple and cost-effective strategy to collect and test thousands of saliva samples for multiple pathogens (Fig. 4F).
DISCUSSION
Testing strategies are vital to an effective public health response to the COVID-19 pandemic, particularly with the spread of the disease by asymptomatic individuals. An ongoing challenge to COVID-19 testing is the need for massive testing strategies for population-level surveillance that are needed for efficient contact tracing and isolation. Most FDA-approved clinical SARS-CoV-2 diagnostic tests are based on time-consuming and expensive protocols that include RNA purifications and RT-PCR32 and must be performed by trained personnel in well-equipped laboratories. Point-of-care antigen tests provide a much faster turnaround time and require little manipulation, but there remains limited data on their specificity in real-world applications40. Because of reagent limitations and diagnostic testing bottlenecks, prioritization of COVID diagnostic testing continues to be for symptomatic individuals and individuals who are particularly vulnerable for infection after exposure41. Private organizations, including colleges and universities, have circumvented some of these challenges by contracting with private laboratories to establish asymptomatic surveillance testing protocols; this is a costly option for population-level surveilling of asymptomatic SARS-CoV-2 infections.
Several effective COVID-19 vaccines have been developed and there is a concerted ongoing global vaccination effort, providing a concrete means to end the pandemic. Despite this progress there are several potential risks that require vigilance: possible COVID-19 transmission in vaccinated individuals, emergence of vaccine-resistant viral variants, and public skepticism of vaccines or faltering compliance with social distancing guidelines42. For these reasons ongoing testing and surveillance efforts will remain important for the foreseeable future, both to monitor the progress of vaccination in reducing symptomatic cases and to detect emerging variants.
In order to scale testing to an effective volume and frequency, surveillance tests must demonstrate the following qualities: 1) sensitivity, to identify both asymptomatic and symptomatic carriers; 2) simplicity in methodology, to be performed in a number of traditional diagnostic laboratories, without specialized equipment; 3) low cost and easily accessible reagents; 4) ease of collection method; 5) rapid turnaround time to allow for isolation and contract tracing; and 6) ability to co-detect multiple respiratory viruses, given the overlap in patient symptoms. To this end, we have developed COV-ID, an RT-LAMP-based parallel sequencing SARS-CoV-2 detection method that can provide results from tens of thousands of samples per day at relatively low cost to simultaneously detect multiple respiratory viruses.
COV-ID features several key innovations that make it well-suited to high-throughput testing. First, COV-ID uses a two-dimensional barcoding strategy8, where the same 96 barcodes are used in each RT-LAMP plate, making it possible to pre-aliquot barcodes in 96-well plates ahead of time and store them at −20ºC, simplifying execution of the assay and shortening turnaround times. Second, since RT-LAMP does not require thermal cycling, tens of thousands of samples can be run simultaneously in a standard benchtop-sized incubator or hybridization oven held at 65ºC. Third, individual samples are pooled immediately following RT-LAMP; therefore, a single thermocycler has the potential to process up to 96 or 384 RT-LAMP plates, generating 9,216 or 36,864 individually barcoded samples, respectively (Fig. 1A, 4F, 4G). Only 96 unique FIP barcodes are required for this scaling; here, we show that 28 out of 32 LAMP barcodes tested were functional for both N2 and STATH. This proof-of-principle experiment demonstrates the feasibility of generating the library of barcodes required to apply COV-ID to a large population. An additional advantage of sequencing-based approaches, such as COV-ID is that with carefully designed primers it would be possible to recover information about viral variants directly from the sequencing reads43. Finally, COV-ID can generate ready-to-sequence libraries directly from saliva absorbed onto filter paper, which would allow for major streamlining of the often-challenging logistical process of sample collection (Fig. 4). Thus, COV-ID libraries for thousands and tens of thousands of samples can be generated with relatively minimum effort in biological laboratories with basic equipment and easily accessible reagents.
With the average throughput of an Illumina NextSeq 500/550, a relatively affordable next-generation sequencer up to 9,216 (96 RT-LAMPs x 96 pools) can be sequenced at a depth of ∼48,000 reads per sample, and up to 36,864 (96 RT-LAMPs x 384 pools) can be sequenced at a depth of ∼12,000 reads, which, we showed, is more than sufficient to obtain information about multiple viral and control amplicons. Considering that reagents for one NextSeq run cost ∼1,500 U.S. dollars, the theoretical sequencing cost per sample could be as low as $0.04 (Fig. 4G). While sequencing instruments are relatively specialized and not ubiquitous, amplified COV-ID DNA libraries could be shipped to remote facilities for sequencing in a cost-effective manner as previously proposed by the inventors of LAMP-seq14. Finally, because of the limited sequence space against which reads must be aligned, computational analysis of the resulting data can be performed in a matter of minutes with optimized pipelines, providing results shortly after the sequencing run has completed.
COV-ID has sensitivity of 5–10 virions of SARS-CoV-2 per μL in contrived saliva samples (Fig. 2D) and at least 300 virions / μL in saliva collected from patients in a clinical setting (Fig. 2E). However, this was sufficient to properly classify 100% of the clinical samples analyzed, given that all positive samples had an estimated viral load > 300 virions / µL. Importantly, this was also the apparent limit of sensitivity of paper-based COV-ID (Fig. 4E), suggesting that even in these settings COV-ID would be capable of accurately classifying the majority of patient-derived samples.
In conclusion, COV-ID is a flexible platform that can be executed at varying levels of scale with additional flexibility in sample input, making it an attractive platform for surveillance testing. Population-level monitoring of SARS-CoV-2 infections will be critical while vaccines are being distributed to the global population, and continued surveillance will likely remain an effective strategy to protect immune-compromised and unvaccinated members in society and within entities and organizations where regular monitoring is critical to social isolation strategies. To that end, effective, low-cost, multiplexed, and readily-implementable strategies for surveillance testing, such as COV-ID, are important to mitigate the effects of the current and future pandemics.
METHODS
RT-LAMP primer design
Primers against ACTB were designed using PrimerExplorerV5 (https://primerexplorer.jp/e/) using default parameters and including loop primers (Table S1).
For COV-ID, priming sequences for PCR were inserted in FIP and BIP primers between the target homology regions (F1c and F2, and B1c and B2, respectively, see Fig. S1). After testing, we determined that 12 nts and 11 nts were most effective for the P5 and P7 binding regions, respectively, being the shortest insertion that allowed reliable PCR amplification from LAMP products without impacting LAMP efficiency. In addition a 5 nt barcode sequence was inserted at the immediate 3’ end of the P5-binding region of the FIP primer.
LAMP barcode design
Starting from the total possible 1,024 unique 5-nt barcodes, we removed those that matched any sequence within the RT-LAMP primers used in this study (Table S1) in either sense or anti-sense orientation. From the remaining pool, we selected 32 barcodes with hamming distance of at least 2 between all candidates. We tested FIPs incorporating candidate barcodes for ACTB, STATH, N2, and IAV primer sets on saliva RT-LAMP with 1,000 copies target amplicon. Primers that failed to show LAMP signal by real time fluorescence monitoring or generate expected PCR product were discarded. Final usable barcodes are provided in (Table S2).
Saliva preparation
We prepared 100x TCEP/EDTA buffer (250 mM TCEP, 100 mM EDTA, 1.15 N NaOH) 29. TCEP/EDTA buffer was added to human saliva at 1:100 volume, then samples were capped, vortexed to mix and heated in a thermocycler (95ºC 5 min, 4ºC hold) until ready to use for RT-LAMP. When indicated, heat-inactivated SARS-CoV-2 (BEI Resources Cat. NR-52286) or H1N1 genomic RNA (Twist Biosciences Cat. 103001) was added to inactivated saliva prior to RT-LAMP.
N2 spike-in synthesis
To prepare the in vitro transcription template for SARS-CoV-2 N2 spike-in RNA, we performed RT-PCR using Power SYBR RNA-to-Ct kit (Thermo Cat. 4389986) of heat inactivated SARS-CoV-2 (BEI Resources Cat. NR-52286) using the following primers: N2-B3 and N2-spike-T7 S. PCR product was purified and used as a template for in vitro transcription using HiScribe T7 transcription kit (NEB Cat. E2040S). RNA was purified with Trizol (Thermo Cat. 15596026), quantified via A260, then aliquoted in BTE buffer (10 mM bis-tris pH 6.7, 1 mM EDTA) and stored at −80ºC. Primers used and final spike-in sequence are provided in Table S1.
RT-LAMP
All RT-LAMP reactions were set up in clean laminar flow hoods and all steps before and after LAMP were carried out in separate lab spaces to avoid contamination. RT-LAMP reactions were set up on ice as follow: for each amplicon 5 or 6 LAMP primers were combined into 10x working stock at established concentrations: 16 μM FIP, 16 μM BIP, 4 μM LF, 4 μM LB, 2 μM F3, 2 μM B3. For multiplexed COV-ID reactions 10x working primer mixes for each amplicon were either added proportionally so that the total primer content remained constant, or mixed so that BIP and FIP primers were scaled down depending on amplicon number while remaining primers (LF and/or LB, F3, B3) were kept at same concentration as in single reactions.
Each 10 μL RT-LAMP reaction mix consisted of 1x Warmstart LAMP 2x Master Mix (NEB Cat. E1700S), 0.7 μM dUTP (Promega Cat. U1191), 1 μM SYTO-9 (Thermo Cat. S34854), 0.1 μL Thermolabile UDG (Enzymatics Cat. G5020L), 1 μL of saliva template and optionally 20 fg of N2 Spike RNA. Reactions were prepared in qPCR plates or 8-well strip tubes, sealed, vortexed and centrifuged briefly, then incubated in either a QuantStudio Flex 7 or StepOnePlus instrument (Thermo) for 65ºC 1 hr. Real-time fluorescence measurements were recorded every 30 sec to monitor reaction progress but were not used for data analysis. Following LAMP the reactions were heated at 95ºC 5 min to inactivate LAMP enzymes.
Library construction by PCR amplification
All post-LAMP steps were carried out on a clean bench separate from LAMP reagents and workspace. For individual LAMP samples, LAMP amplicons were diluted either 1:100 or 1:1,000 in water. For pooling of individually barcoded LAMP reactions, equal amounts of all LAMP reactions were combined and then either diluted 1:1000 or purified via SPRIselect beads (Beckman Coulter Cat. B23317) using a bead-to-reaction ratio of 0.1x. Purified material was diluted to final 100-fold dilution relative to LAMP.
1 μL of diluted LAMP material was used as a template for PCR using OneTaq DNA polymerase (NEB Cat. M0480L) with 100 nM each of custom dual-indexed Illumina P5 and P7 primers in either 10 or 25 μL reaction (Table S1). PCR reactions were incubated as follows: (25 cycles of stage 1 [94ºC x 15 sec, 45ºC x 15 sec, 68ºC x 10 sec], 10 cycles of Stage 2 [94ºC x 15 sec, 68ºC x 10 sec], 68ºC x 1 min, 4ºC x ∞). Note, for initial pilot COV-ID and clinical sample experiments (Fig. 2D–E, Fig. S2C) PCR incubation was performed as above with modification: [Stage 1 × 10 cycles, Stage 2 × 25 cycles].
PCR products were resolved on 2% agarose gel to confirm library size, then all were pooled and purified via MinElute PCR purification kit (Qiagen Cat. 28004) and quantified using either Qubit dsDNA High Sensitivity kit (Thermo Cat. Q32851) or Kapa Library Quantification Kit for Illumina (Kapa Cat. 07960140001).
Human samples
Clinical saliva samples used for Fig. 2E were obtained and characterized as part of a separate study at the University of Pennsylvania44 and collected under Institutional Review Board (IRB)-approved protocols (IRB protocol #842613 and #813913). Briefly, salivary samples were collected from possible SARS-CoV-2 positive patients at one of three locations: (1) Penn Presbyterian Medical Center Emergency Department, (2) Hospital of the University of Pennsylvania Emergency Department, and (3) Penn Medicine COVID-19 ambulatory testing center. Inclusion criteria including any adult (age > 17 years) who underwent SARS-CoV-2 testing via standard nasopharyngeal swab at the same visit. Patients with known COVID-19 disease who previously tested positive previously were excluded. After verbal consent was obtained by a trained research coordinator, patients were instructed to self-collect saliva into a sterile specimen container which was then placed on ice until further processing for analysis.
The saliva used in the remaining experiments was donated by one of the authors. Because it was only used for protocol optimization the Penn IRB has determined that it did not constitute human subjects research and therefore approval was not required.
Paper COV-ID
Squares of Whatman no. 1 filter paper (2 mm x 2 mm) were cut using a scalpel on a clean surface under a laminar flow hood and stored at room temperature until used. Using ethanol-sterilized fine-nosed tweezers a single square was dipped twice into unprocessed, freshly collected saliva with or without added SARS-CoV-2 (BEI Resources Cat. NR-52286) until saliva was saturated on paper by eye. Paper was then transferred to well of 96-well plate containing 10 ul of 1x TCEP/EDTA buffer (2.5 mM TCEP, 1 mM EDTA, 1.15 NaOH). Plate was placed on heat block inside laminar flow hood or inside open thermocycler and incubated at 95ºC x 10 min.
10 ul RT-LAMP mixture was prepared as described above in the absence of the N2 Spike RNA. 10 ul of RT-LAMP reaction mixture was added to each paper strip, then plate was sealed and incubated 65ºC x 1 hr, 95ºC x 5 min in QuantStudio Flex 7 (Thermo). 1 ul of each reaction was either diluted 1:100 or purified via SPRIselect beads and PCR amplified as described above.
Sequencing
Libraries were sequenced on one of the following Illumina instruments: MiSeq, NextSeq 500, NextSeq 550, NovaSeq 6000 and sequenced using single end programs with a minimum of 40 cycles on Read 1 and 8 cycles for index 1 (on P7) and index 2 (on P5).
Sequence Analysis
Reads were filtered for optical quality using FASTX-toolkit utility fastq_quality_filter (http://hannonlab.cshl.edu/fastx_toolkit/), then cutadapt45 was used to remove adapters and demultiplex LAMP barcodes. Reads were aligned to a custom index containing SARS-CoV-2 genome (NC_045512.2), Influenza H1N1 coding sequences (NC_026431.1, NC_026432.1, NC_026433.1, NC_026434.1, NC_026435.1, NC_026436.1, NC_026437.1, NC_026438.1), STATH coding sequence (NM_003154.3), and custom N2 spike sequence (Table S3), target sequences using bowtie246 with options --no-unal and --end-to-end. Alignments with greater than 1 mismatch were removed and the number of reads mapping to each target for all barcodes were extracted and output in a matrix. Barcodes with fewer than 25 total mapped reads were discarded.
Data Availability
Next generation sequencing data generated for this study are available at the NCBI GEO with accession GSE172118.
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE172118
Data Availability
Next generation sequencing data generated for this study are available at the NCBI GEO with accession GSE172118.
DISCLOSURE
R.W-T., C.A.T. and R.B. are inventors on a patent application filed by the University of Pennsylvania related to this work.
SUPPLEMENTARY FIGURE LEGENDS
ACKNOWLEDGMENTS
The authors thank E. Shields for careful proofreading of analysis scripts; B. Morris and R. Collman for the collection and distribution of clinical saliva samples; F. Bushman, S. Sherril-Mix, and Abigail Glascock for sharing RT-qPCR data on the clinical samples; the UPenn rapid assay task force for project feedback; and the gLAMP weekly forum for advice and guidance.