Abstract
The current monkeypox outbreak as a global health emergency now spreads into more than 90 counties. Rapid and accurate diagnosis of the infections is fundamental for the viral disease containment. Here, we analyzed the primer and probe sequences in the CDC recommended monkeypox virus (MPV) generic real-time PCR assay by aligning against 683 reported MPV genomes worldwide. Sequence mismatches were found in more than 92% of genomes for the MPV generic forward and reverse primers. We also evaluated the specificity of seven other real-time PCR assays for MPV and orthopoxvirus (OPV) detection, and identified two assays (MPV_F3L and OPV_F8L) with the highest matching score (>99.4%) to the global MPV genome database. Genetic variations on the MPV genomes corresponding with the real-time PCR primer and probe regions were further specified and used to indicate the temporal and spatial emergence pattern of monkeypox disease. Our results show that the current MPV real-time generic assay may be unsuitable to accurately detect MPV, highlighting the need to develop new detection assays.
Introduction
Our ability to detect emerging and reemerging infectious disease agents has never been as important as it is today due to growing human populations and global travel. While the SARS-CoV-2 is gradually becoming an endemic virus, the world is now seeing the re-emergence of monkeypox virus (MPV). As of August 9, 2022, there are over 8,900 reported cases in the United States and over 30,000 cases in 88 countries (1). The World Health Organization (WHO) has declared a public health emergency of international concern on July 23, 2022 to respond the rapid spread of MPV worldwide.
Monkeypox virus in the Orthopoxvirus (OPV) genus is a zoonotic double-stranded DNA virus with a ∼197 kbp genome (2,3). In 1970, monkeypox was first identified in humans in the Democratic Republic of The Congo and Africa regions (4). Sporadic monkeypox cases were reported outside of African (U.S. and Europe), however, the infections were either related with international travel or exposure to animals imported from Africa (5,6). The first MPV case associated with the present outbreak was reported on May 6 in the United Kingdom, followed by cases in Spain, Portugal, and Canada (5,7). On May 18 cases were reported in Belgium, Italy; on May 20 cases were reported in Australia, France, Germany, and the Netherlands; and on May 21 cases were reported in Switzerland and Israel (7). Such a wide outbreak within two weeks in multiple counties across continents suggest that early non-reported transmission occurred before May 2022 (8).
The polymerase-chain reaction (PCR) has been used in the detection and diagnosis of viral infections such as SARS-CoV-2. On June 6, 2022 the Centers for Disease Control released a real-time PCR test using generic primers and probe for the detection of MPV (9) based on previous work conducted in 2010 describing the detection of a West African MPV strain and a Congo Basin MPV strain (10). This MPV generic assay has been used to detect human MPV infections and MPV present in wastewater samples (11,12). Several other real-time PCR assays have also been reported for MPV or OPV detection in clinical samples (10,13–17). However, the specificity and suitability of these generic primers and probe need to be carefully addressed because most of the primers and probes (generic or lineage-specific) were based on sequenced genomes obtained from MPV strains circulating more than a decade ago (2002 to 2009). Given the fast evolution of viral species including DNA viruses (18,19) it is likely that mutations have emerged in the oligos’ targeting regions for current MPV strain. The present report analyzes the oligos sequences used in conjunction with the current real-time MPV assays by aligning primer sequences against 683 reported MPV complete genomes from GISAID as of August 5, 2022. This analysis identified wide variations in the primer and probe sequences compared to the genome of currently circulating MPV stains. Our results indicate that the current MPV real-time generic assay may be unsuitable to accurately detect MPV and highlights the need to develop new detection assays.
Materials and Methods
Data source
The MPV genomes were downloaded from GISAID (https://gisaid.org/). As of August 5, 2022, 683 genomes with complete metadata were available, including 634 genomes (samples) collected from May 19, 2022, to July 29, 2022. The rest 49 samples include 7 from 1962 to 2005, 11 samples in 2006, 12 samples in 2007, 7 samples in 2017, 8 samples in 2018, 2 samples in 2019, and 2 samples in 2021. Metadata includes accession ID, sample collection date, sequence data submission date, and sampling location (Continent/country) associated with the sequences. The monkeypox virus reference genome Zaire-96-I-16 was downloaded from NCBI (NC_003310.1). MPV real-time assays (primers and probes) were from published studies including MPV Generic assay (9), MPV_O2L (13), MPV_F3L (14), MPV_C3L (10), MPV_G2R_WA (10), OPV_B2R (15), OPV_I7L (16), and OPV_F8L (17).
Sequence alignment
Genome sequences were downloaded as .fasta and imported into R (version 4.1.3) as the MPV genome database for analysis. We aligned each of the oligo (including forward primer, reverse primer, and probe) sequence to the database and computed the percentage of matches and non-matches. For non-matches, sequence was extracted and aligned by SnapGene software (https://www.snapgene.com/) to identify the specific nucleotides that are distinct from sequence in the MPV oligos. If the ‘mismatch’ in the targeted genome region is the correct degenerate codes, we treated it as ‘match’. If the ‘mismatch’ is ‘N’ or multiple ‘N’, we treated as ‘non-match’. The major types of sequence variation (mutation) were listed in Figure 1 and Figure 3A. In total, the original sequence and identified sequence variation account for 99.4% ∼ 100% of the total genomes for each of the MPV real-time assays. Four genomes (0.6%) were excluded in the MPV_F3L assay (forward primer) because non-match was found in the genome. These analyses were performed with customized R script.
(A) Alignment of MPV generic forward primer, reverse primer, and probe sequences (recommended in the CDC test procedure for MPV (9)) to the MPV reference genome (MPV-Zaire-96-I-16). The corresponding oligo sequences with mismatches found in the genome database for forward (MPV-F-mu) and reverse primers (MPV-R-mu) were also added on top of the reference genome. Red nucleotides indicate the variations identified in the 683 genome database and the nonsynonymous mutation is marketed by a red box. (B-C) The abundance of MPV-F (generic) and MPV-F-mu (mutated), MPV-R (generic) and mutated (MPV-R-mu) in the currently available 683 MPV genomes worldwide (B), and in the 58 U.S. genomes (C).
Mutation-based epidemiological analysis
For each MPV assay, we identified the major variation type(s) in the genome database and then assigned an identity as ‘match’ or ‘mutation’ to each of the genome for the specific oligo. Two mutation types were found for the MPV_G2R_WA assay and were labeled as ‘Mutation1’ and ‘Mutation2’, respectively. Merging with the metadata associated with the genome, we summarized the number of genomes for ‘match’ and ‘mutation’ groups based on the sampling collection time (year), sampling location (continent). We also sub-grouped the database by only selecting the year ‘2022’ to compute the percentages of ‘mutation’ in the current MPV outbreak.
Results
Generic MPV forward and reverse primers do not match the majority of reported genomes
We investigated whether forward and reverse primers and probe in the MPV generic assay match to the reported MPV genomes. As of August 5, 683 complete genomes with metadata have been uploaded in the GISAID. Of the 683 genomes, 634 (92.8%) were collected between May 19 and July 29, 2022. We then aligned each of the MPV generic oligo sequences to the genome database. Of the total 683 genomes, the MPV generic forward sequence (MPV-F, Figure 1A) had 100% identity to sequences present in 28 (4%) of the genomes. A single-nucleotide mutation A194165G (synonymous mutation, MPV-F-mu in Figure 1A) was found in 96% of the published MPV genomes (Figure 1B), and 100% in the genomes published from May to July 2022. Similar observations were also found in the generic reverse primer (MPV-R), which only matches 8% of the genomes. A nonsynonymous substitution, C194233T in the reverse primer region, was observed in 92% of the genomes (equivalent to 98.4% of genomes released from May to July 2022). In contrast, the probe sequence (MPV-P) is conserved and matches all of the genomes in the database.
We further compared the MPV generic primers and probe with the sequenced samples in the U.S. Of the total 58 genomes, the A194165G mutation in the forward primer region was observed in all the U.S. genomes, and the C194233T mutation observed in the reverse primer region was found in 93% of the U.S. genomes (Figure 1C), which are consistent with the results in Figure 1B. These results demonstrated that the CDC recommended forward and reverse primers designed for MPV testing do not match the major genome sequences in recent samples with MPV.
Evaluation of the specificity of other primers and probes for monkeypox virus and orthopoxvirus detection
Next, we evaluated seven other developed real-time PCR assays, targeting O2L (13), F3L (14), C3L, and G2R (10) genes for MPV, B2R (15), I7L (16) and F8L (17) genes for OPV detection. The selected targets are generally conserved regions within viral genomes to maximize viral detection (13,15). We aligned primer and probe sequences used in each of the seven assays against the 683 MPV genomes and computed percent identity. As shown in Table 1, the four reported assays designed for MPV detection have varied results. The primers in MPV_O2L have a >99% match with the database, but the probe only matches 4.1% of the genomes. A perfect match was not found for the MPV_C3L assay, which was used for the detection of the MPV Congo Basin strain. Primers and probe sequences used in the MPV_F3L assay showed >99.4% alignment against the genome database, suggesting its suitability for detection of the MPV-2022 outbreak. Primers used in the MPV_G2R assay designed to detect the West African strain showed a 100% match to the database and 95.6% match for the probe thereby validating previous phylogenetic analysis results that the current MPV lineage belongs to the West Africa clade (20).
Evaluation of the specificity of primers and probes for monkeypox virus (MPV) or orthopoxvirus (OPV) detection.
We further evaluated three real-time assays (OPV_B2R, OPV_I7L, and OPV_F8L) for the detection of viruses belonging to the genus orthopoxvirus, which includes monkeypox virus, camelpox virus, cowpox virus, and vaccinia virus (21). OPV_B2R primers highly match the database genomes, but its probe sequence has no match (Table 1). OPV_I7L has good matches for the forward primer and probe, but its reverse primer only matches 8.2% of the genomes. OPV_F8L oligos for the non-variola OPV generic test recommended by CDC has a high match percentage (>99.6%) with the genomes (Table 1). Thus, the MPV_F3L and OPV_F8L assays have the highest match rates to the sequenced MPV genomes.
Genetic variations across real-time PCR assays for MPV or OPV detection and epidemiological analysis
We next investigated genomic sequences in conserved regions that differ between the real-time PCR assays. About 96% of genomes have a G->T mutation in the probe of MPV_O2L assay, and this mutation is present across all MPV-2022 outbreak genomes (Figure 2A). Further analysis using the sample collection date found that this G->T mutation appeared in two samples in 1962 and 1970, was not detected in samples from 1978 to 2007, and re-emerged from 2017 onward (Figure 2B). For the OPV_I7L assay reverse primer, a C->T mutation was observed at the 3’ end in about 92% of all genomes and in 98.7% of genomes sequenced in 2022. It is interesting that this mutation is mostly observed in samples from Europe, South, and North America whereas the original sequence was more common in samples from Africa and Asia (Figure 2C). A G->C substitution in the probe sequence of OPV_B2R was observed in nearly all genomes sequenced in the MPV-2022 outbreak (Figure 2A).
(A) Genetic variations in the MPV genome database compared to primer and probe sequences used in detection assays. (B) Temporal analysis for the mutation in MPV_O2L probe sequence. This mutation is found in all the genomes in 2022. (C) Proportions of genomes for the reverse primer used in the OPV_I7L assay with or without mutation. (D) MPV_G2R_WA probe sequence and its two mutation types across the genomic database. One Mutation1 sample (green dot) was found in 2022 (overshadowed by the blue dot-Mutation2).
The MPV_G2R_WA assay probe has a 95.6% match to the database sequences (Figure 2A). Two mutation types were found for the rest of 4.3% genomes. A 7-nucleotide mutation (Mutation1) was found in 4.1% of genomes from 1978 to 2007 (Figure 2A and 2D). Mutation1 reappears in a sample obtained on June 10, 2022, from Luxembourg. Although 99.7% of the MPV genomes are conserved in this probe region, one new C->T synonymous mutation (Mutation2) (Figure 2A) was identified from a sample obtained on May 18, 2022 from Canada. Those results suggest that the genetic variations on the MPV genomes corresponding with the real-time PCR primer and probe regions could be used to indicate the temporal and spatial emergence pattern of monkeypox disease.
Discussion
Accurate diagnostics are fundamental for successful epidemic control (22). PCR-based molecular methods are widely used for pathogen detection in clinical and environmental samples. As a rapid and reliable tool, TagMan real-time PCR offers high sensitivity and specificity that targets a respective pathogen’s genome by using designed probe, forward and reverse primers. Thus, the exact matches of the three oligos with the target genome are required for sensitive and accurate diagnostics. Here, we analyzed the sequence of primers and probe used in the CDC recommended MPV generic real-time PCR assay as well as seven other MPV or OPV testing assays by aligning respective primer/probe sequences against published MPV full genomes. Our results showed mismatches in more than 92% of genomes for the MPV generic forward and reverse primers (Figure 1). One mutation C194233T is close to the 3’ end of the reverse primer, suggesting it may significantly impact the binding efficiency during amplification. Experimental investigations are underway to quantify the impacts of those mismatches in primers on detection sensitivity and efficiency for the new MPV lineage.
Of the seven other assays evaluated for MPV or OPV detection, MPV_F3L and OPV_F8L are found to have the highest matches (>99.4%) to the global genome database. However, it is necessary to mention that sequence match is one of the parameters determining the detection sensitivity and specificity, other parameters such as primer dimer, GC content, the template DNA, and inhibitors in the sample may also impact the diagnostic efficiency (23,24). Thus, further experimental tests are needed to validate their amplification efficiencies and limit of detections for the present MPV strain.
In summary, our analysis shows that primers and probe in the recommended MPV generic real-time assay harbor wide deviations from the sequenced MPV genomes, especially the genomes released in current monkeypox outbreak. Thus, it is critical to develop updated assays for accurate and specific detection of MPV to inform public health risk mitigation strategies and contain the outbreak.
Declaration of Competing Interest
The authors declare no competing interest.
Code Availability
All data and code produced in the present study are available upon reasonable request to the authors.
Acknowledgement
This work is supported by Faculty Startup funding from the Center of Infectious Diseases at UTHealth, the UT system Rising STARs award, and the Texas Epidemic Public Health Institute (TEPHI) to F.W.