A Rapid SARS-CoV-2 Variant Detection by Molecular-Clamping Based RT-qPCR ======================================================================== * Shuo Shen * Andrew Y. Fu * Maidar Jamba * Jonathan Li * Mike J. Powell * Aiguo Zhang * Chuanyi M. Lu * Michael Y. Sha ## Abstract We applied XNA-based Molecular Clamping Technology to develop a multiplex qPCR assay for rapid and accurate detection of SARS-CoV-2 mutations. A total of 278 previously tested SARS-COV-2 positive samples originating primarily from San Francisco Bay Area were tested, including 139 Samples collected in middle January and 139 samples collected at the end of February 2021, respectively. The SARS-CoV-2 Spike-gene D614G mutation was detected from 58 samples (41.7%) collected in January 2021 and, 78 samples (56.1%) collected in February. Notably, while there were no N501Y mutation detected in samples from January, seven of the February samples were tested positive for the N501Y and D614G mutations. The results suggest a relatively recent and speedy spreading of the UK variant (B.1.1.7) in Northern California. This new Molecular Clamping technology-based multiplex RT-qPCR assay is highly sensitive and specific and can help speed up large scale testing for SARS-CoV-2 variants. ## Introduction While the worldwide vaccination efforts ongoing, the COVID-19 pandemic is continuing to spread with 120 million cases and 2.7 million deaths to date (March 2021). In the United States alone, there are currently over 30 million cases, and the death toll has passed 550 thousand. With the vaccinations picking up steam and more and more people acquiring immunity to the SARS-CoV-2, the focus is now shifting to more transmissible and potentially vaccine resistant novel variants of the virus. At present, at least four SARS-CoV-2 variants, all present in the USA, the UK B.1.1.7 (501Y V1) [1], South Africa B.1.351 (501Y.V2) [2], Brazil P.1 (501Y.V3) [3] and CAL.20C (20C/S:452R; /B.1.429) [4] [5] variants are of particular concern. Lineage B.1.1.7, is also known as 20I/501Y.V1, variant of concern 20DEC-01 (VOC-20DEC-01, previously written as VOC-202012/01) or commonly as the UK variant. It was found in the southeast of England in early October 2020 and has been observed to be increasing in both Europe and the United States [6–9]. It is estimated to be 40%–80% more transmissible than the wild-type SARS-CoV-2 [10–11]. Looking at the mutations present in these 4 main variants of concern, all of them have the D614G mutation, and three of them share the N501Y mutation (except CAL.20C variant). Both mutations are located on SARS-CoV-2 virus spike protein. With the D614G mutation, the amino acid change from aspartic acid to glycine is caused by an A-to-G nucleotide mutation at position 23403 of the virus genome. This change stabilizes the spike protein and enhances its fitness and infectivity [12]. Similar to D614G, N501Y mutation is also associated with higher virus infectivity and even worse, the N501Y mutation is within the receptor-binding domain (RBD) of the spike protein, featured by stronger binding to ACE2 receptor and significant drop of the original vaccine efficacy [13]. These two mutations, D614G and N501Y, are the two earliest and prominent mutations observed thus far as the COVID-19 pandemic continue to evolve. Next generation sequencing (NGS) has been the standard method for SARS-CoV-2 variants detection. Although the NGS-based assays could confirm the variants, it is expensive, time consuming and not widely available, limiting its utility in large scale testing demand for SARS-CoV-2 variants detection and monitoring. There has been an urgent need for testing platforms that could detect these variants of concern rapidly and cost-effectively. In this study we applied a Molecular Clamping Technology by using xenonucleic acids (XNA) and developed a multiplex reverse-transcription qPCR assay that can accurately and quickly detects known and emerging SARS-CoV-2 mutations (Fig 1). ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/04/05/2021.04.01.21254484/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2021/04/05/2021.04.01.21254484/F1) Figure 1. A high-throughput XNA-based Molecular Clomping Technology for SARS-CoV-2 variant detection. Xenonucleic acids (XNAs) are artificial genetic polymers retaining the Watson-Crick base-pairing capability, originally developed to store genetic information and evolved in response to external stimuli. For practical applications in disease diagnosis and treatment, they can also function as a source of nuclease-resistant affinity reagents (aptamers) and catalysts (xenozymes). More notably, they can be employed as molecular clamps in quantitative real-time polymerase chain reactions (RT-qPCR) or as highly specific molecular probes for detection of nucleic acid target sequences, due to its characteristic and stronger hybridization in XNA/DNA than DNA/DNA [14]. Furthermore, even a single base-pair mismatch between XNA/DNA duplex can result in a drop of 10-18°C in melting temperature Tm [15], allowing the highly specific clamping of XNA molecule onto the targeted sequence (usually wildtype, WT) to block the WT amplification, thus minimizing the WT background in RT-qPCR and selectively enhancing the signal of the mutant. The robust XNAs have been extensively used in the *in vitro* diagnostic assays for detecting cancer-associated gene mutations [16]. Theoretically, XNAs should work in RT-qPCR-based detection of SARS-CoV-2 mutations. Our report here represents the first attempt of using XNA-based QClamp application in SARS-CoV-2 mutation assay. We expect this fast, reliable and inexpensive mutation detection assay, as we intended to develop, can be made widely available for detecting and monitoring known and emerging SARS-CoV-2 variants during the ongoing CAVID-19 pandemic. Availability and easy access to SARS-Cov-2 mutation testing can also help enable real-time tracking of virus transmission chains and routes in various regions of the world [17]. ## Methods ### Study design and ethics Deidentified leftover patient nasopharyngeal swab (NPS) and saliva samples were used in the study. All patient specimens were collected in January and February 2021 and previously tested at UCSF affiliated San Francisco VA Medical Center clinical laboratory and DiaCarta CLIA certified clinical laboratory for clinical diagnostic or screening purpose. Other than qualitative RT-PCR results (positive or negative), only PCR cycle threshold (Ct) values were included in study analysis and no patient clinical chart reviews were performed. This study was approved by the institutional review board (IRB) at UCSF (UCSF IRB #11-05207) as a no-subject contact study with waiver of consent and as exempt under category 4. ### Sample collection Saliva and NPS samples of patients were used in the study. All patient specimens were collected, tested and resulted in January and February 2021 and subsequently stored in −80°C freezer. A total of 278 positive samples were selected for this study, including 139 collected in middle January and 139 collected in late February. Other than the qualitative RT-PCR results (positive or negative), only PCR cycle threshold (Ct) values were obtained and included in this study analysis. ### RNA extraction Automatic RNA/DNA extraction instrument MGISP-960 (MGI Tech Co., Ltd.) and MGI Easy Nucleic Acid Extraction Kit (Cat# 1000020261) was used for the SARS-CoV-2 viral RNA extraction according to the manufacturer’s instructions. Briefly, 180 µL of each nasopharyngeal swab sample or saliva sample was used for extraction. For each batch of clinical samples to be tested, a RNA extraction control (EC) was included (spike 20 µL of EC from the QuantiVirus™ SARS-CoV-2 multiplex kit (DiaCarta, Inc.) into 180 µL sterile RNase-free water). The clinical samples and spiked EC were processed and extracted on the MGI platform. The extraction output is RNA in 30-40 µL RNase-free water, 1 µL of which is used for the RT-qPCR reaction. Precautions were taken while handling extracted RNA samples to avoid RNA degradation. Extracted RNA samples were stored at −80°C if not immediately used for RT-qPCR. The turnaround time from sample extraction to PCR final report is about 4 hrs (Fig 1) [18]. ### Multiplex primer and probe design We targeted the conserved regions of E gene and ORF1ab gene [18] and adjacent areas to the N501Y and D614G mutations in the SARS-Cov-2 genome to design primers and probes for the detection all SARS-CoV-2 variants of concern. Similarly, we designed primers and probes for the human RNase P gene, used as RNA extraction control. Gene sequences were retrieved from GenBank and GISAID databases for primer and probe design to ensure coverage of all SARS-CoV-2 variant strains. Multiple alignments of the collected sequences were performed using Qiagen CLC Main Workbench 20.0.4., and conserved regions in each target gene were identified using BioEditor 7.2.5. prior to primer and probe designs. Primers and probes were designed to target the most conserved regions of each of the target genes of the viral genome, using Primer3plus software and following general rules of real-time PCR design. All primers were designed with a melting temperature (Tm) of approximately 60 □C and the probes were designed with a Tm of about 65 □C. The amplicon sizes were kept as minimum within the range of 70 bp to 150 bp for each primer pair to achieve better amplification efficiency and detection sensitivity. All designed primers and probes were ordered from Integrated DNA Technologies, Inc. (IDT, Coralville, IA, USA) and LGC Biosearch Technologies (Novato, CA, USA), respectively. ### XNA design, synthesis, purification and analysis The xenonucleic acids (XNAs) for SARS-COV-2 N501Y and D614G mutations were designed to match the wild type (WT) sequences in order to make them selectively block the qPCR amplification of the WT targets. Each XNA was also designed to partially overlap with and be of the same strand/sense as the corresponding fluorescent probe. All chemicals and solvents are of ACS grade or higher, purchased from Sigma, Fisher Scientific, Beantown Chemicals, Midland Scientific and other commercial sources. XNAs were synthesized via classic solid-phase peptide synthesis (SPPS) method on an INTAVIS MultiPep automatic synthesizer (INTAVIS Bioanalytical Instruments AG, Cologne, Germany; now a subsidiary of CEM Corporation) [19]. Commercially-available primary Bhoc-protected monomers with aminoethylglycine(AEG)-backbone (Fmoc-“A”, Fmoc-“T”, Fmoc-“C”, Fmoc-“G”), plus terminus-modifying monomers Fmoc-D-Lysine(tBoc) and Fmoc-“O” spacer/linker, were used as the starting materials. TentaGel Resin (from INTAVIS) was chosen and used as the solid support for the multiple-sequence parallel synthesis at a typical 3-umol scale using INTAVIS mini columns. The stepwise SPPS process follows the standard Fmoc chemistry starting from 3’ toward 5’ direction in DMF medium, mainly using HATU for coupling and piperidine for deprotection, within each cycle of solid-phase synthesis, then followed by a new cycle again, keeps repeating itself, untill completion of the entire XNA sequence. After the solid-phase synthesis procedure, the crude product was obtained after a cleavage/deprotection step with a TFA-based cocktail, containing 3-5% triisopropylsilane to minimize side reactions. The resulting off-white crude product then underwent fast purification procedure by size-exclusion chromatography (SEC) with G-25 Sephadex gel (GE Healthcare). For each of 7 XNAs synthesized, right after the SEC step, the highest-concentration fraction, as the main product, was identified by UV quantification at 260 nm (NanoDrop ND-1000) and subsequently analyzed by RP-HPLC (Agilent 1100 HPLC system, Aeris XB-C18 HPLC column of 100 x 4.5 mm, UV detection at 260 nm, column temperature 50 °C), and by mass spectrometry (Shimadzu Axima MALDI-TOF mass spectrometer at UCSF DeGrado Lab). The characteristic XNA identity and molecular weight were confirmed for all 7 XNA products. The selected SEC-fraction of each XNA was thus used for RT-qPCR assay accordingly. ### Optimization of RT-qPCR with XNA We created three XNAs for N501Y, four XNAs for D614G. A serial dilution XNA qPCR test for N501Y / D614G was conducted to select the best XNA and optimal XNA concentration in the RT-qPCR test. ### Real-time reverse-transcription PCR (rRT-PCR or RT-qPCR) The XNA-based RT-PCR were set as follows: total volume is 10 ul, including 1.0 µL of RNA, 2.0 µL of primer and probe mixture (final concentration of 0.2 µM and 0.1 µM respectively), 4.5 µL 8 µM of D614G XNA001 and 0.75 µM of N501 XNA003, and 2.5 L of 4x QuantiVirus SARS-CoV-2 One-step qRT-PCR Master Mix (Catalog# A28526, Thermo Fisher, Waltham, MA). The qPCR was performed at 25 °C for 2 min for uracil-N-glycosylase (UNG) incubation to remove potential carryover, and 53°C for 10 min for reverse transcription, followed by 95 °C for 2 min and then 45 cycles of 95 °C for 3 sec, and 59.5 °C for 30 sec. QuantStudio™ 5 Real-Time PCR System (Thermo Fisher, USA), BioRad CFX384 (Bio-Rad, USA) and Roche LightCycler 480 II (Roche, USA) were used for rRT-PCR amplification and detection [18–20]. ### Sanger sequencing verification All positive samples screened previously by qPCR were sent out for sanger sequencing to confirm their mutational status (SequeTech, CA, USA), and all sequences were analyzed via UCSC SARS-CoV-2 Genome Browser. [21] ### Analysis of the assay sensitivity The analytical sensitivity of our multiplexed RT-qPCR test with BioRad CFX384 was assessed to give the limit of detection (LoD) data. Since we had confirmed Ct is ~34.28 or 34.24 for ORF1ab or E gene for 4plex assay when the viral concentration about 100 copies/mL [18], we applied this to estimate the assay sensitivity due to no SARS-CoV-2 variant refence is commercially available. We used a two-fold dilution series from 800 copies/mL to 25 copies/mL of the templates in triplicates and confirmed the lowest concentration that was detectable with 95% confidence. ## Results ### XNA design and synthesis The sequences of XNAs were designed in the context of rational selection of primer/probe for the chosen target gene sequence, intended to match perfectly with the wild-type target DNA sequence. The best sequences (3-4 sequences each target) were iteratively adjusted and selected based on the criterion of multiple major physicochemical factors: sequence length, GC content, purine content and arrangement, self-complementarity, and melting temperature. High overall synthetic yields were found for the D614 XNA group (about 5-10%) while good yields for the N501 XNA group (~5%), translating to a 85-92% single-cycle yield, averagely, across all XNA synthesis here. MALDI-MS Spectra of two representative XNA biomolecules D614 XNA001 and N501 XNA003 were shown in Figure 2 and Supplementary Table 1. View this table: [Table 1.](http://medrxiv.org/content/early/2021/04/05/2021.04.01.21254484/T1) Table 1. Estimated Limit of Detection (LoD) of the Multiplex RT-qPCR ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/04/05/2021.04.01.21254484/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2021/04/05/2021.04.01.21254484/F2) Figure 2. MALDI-MS Spectra of two representative XNA biomolecules: (a) D614 XNA001; (b) N501 XNA003. Molecular weight was determined by the measured cationic (M+H)+ peak, for the singly charged parent ion. ### XNA enhance distinguishing mutant to the wild type of SARS-CoV-2 In order to test whether XNA clamping the wild type and enhance mutant detection, we compare a RT-qPCR with and without XNA. An amplification curve of D614G mutant vs wild type without XNA shown that it is difficult to distinguish mutant and wild type (Supplementary Figure 1a). However, with XNA, clearly it is easy to identify mutant to wild type of SARS-CoV-2 (Supplementary Fig 1b). To optimize the assay, we tested different concentration of XNA. The optimization results of XNAs were exemplarily displayed in Supplementary Tables 2 & 3. The XNA that generates the highest delta Ct value between wild type and mutants was selected for the next step in RT-qPCR. Based on the results, we selected D614G XNA 001 & N501Y XNA 003 (Figure 2). View this table: [Table 2.](http://medrxiv.org/content/early/2021/04/05/2021.04.01.21254484/T2) Table 2. Summary of Multiplex qPCR testing results of the 278 clinical samples collected in middle January and late February 2021. ### Analytical sensitivity We performed the QClamp multiplex RT-qPCR with XNA on various equipment (BioRad CFX384, Roche Light Cycler and QS5) and all the results were consistent. We diluted the patient sample from 800 copies/mL down to 25 copies/mL and repeated RT-qPCR. Since we had confirmed Ct is ~34.28 for ORF1ab at 4plex SARS-CoV-2 detection assay when the viral concentration about 100 copies/mL [18], we applied this to estimate the assay sensitivity. The data shown that the Ct values were around 33.4 for ORF1ab (wildtype) when estimated viral RNA concentration was at 100 copies/ml. Since N501Y and D614G mutant were detected, and its Ct were 34.07 and 33.78 separately (Table 1), so this indicate that the assay analytical sensitivity limit of detection (LoD) is estimated approximately 100 copies/mL. ### UK variant sped up in bay area of California In order to test whether there is the UK variant presenting in the San Francisco Bay Area, we screened 139 known SARS-CoV-2 positive samples in January and 139 positive samples in February 2021. Among the 139 positive specimens collected in mid-January 2021, 58 (41.7%) were positive only for D614G but not the N501Y (Table 2). However, among the 139 positive specimens collected at the end of February 2021, there were 7 (5.04%) specimens that were positive for both N501Y and D614G mutations, consistent with the U.K. variant (B1.1.7) (Table 2). Furthermore, the multiplex RT-qPCR test showed high specificity-amplification of all non-N501Y samples and negative controls were inhibited in the qPCR, while only the N501Y variants and positive controls were amplified (Figure 3). ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/04/05/2021.04.01.21254484/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2021/04/05/2021.04.01.21254484/F3) Figure 3. A representative qPCR amplification curve of SARS-CoV-2 variant N501Y. PC, positive control; WT, wild type; NC, negative control. In order to verify whether these fifty-eight D614G mutant and seven D614G/N501Y mutant samples were truly mutants, we tested these samples’ amplicons by Sanger Sequencing. All of the 58 positive D614G and 7 positive D614G/N501Y mutant samples were confirmed by Sanger Sequencing. The Sanger sequencing peaks showed the target mutant on viral cDNA T