Abstract
To better understand the factors underlying the continued incidence of clinical episodes of falciparum malaria in E-2020 countries targeting elimination, we have characterised Plasmodium falciparum disease transmission dynamics after a clonal outbreak on the northwest coast of Ecuador over a period of two years. We apply a novel, high-resolution genotyping method, the “varcode” based on a single PCR to fingerprint the DBLα region of the 40-60 members of the variant surface antigen-encoding var multigene family. Var genes are highly polymorphic within and between genomes, with var repertoires rapidly evolving by outcrossing during the obligatory sexual phase of P. falciparum in the mosquito. The continued incidence of clinical malaria after the outbreak in Ecuador provided a unique opportunity to use varcodes to document parasite microevolution and explore signatures of local disease transmission on the time scale of months to two years post-outbreak. We identified nine genetic varcodes circulating locally with spatiotemporal parasite genetic relatedness networks revealing that diversification of the clonal outbreak parasites by sexual recombination was associated with increased incidence of clinical episodes of malaria. Whether this was due to chance, immune selection or sexual recombination per se is discussed. Comparative analyses to other South American parasite populations where P. falciparum transmission remains endemic elucidated the possible origins of Ecuadorian varcodes. This analysis demonstrated that the majority of clinical cases were due to local transmission and not importation. Nonetheless, some of the varcodes that were unrelated to the outbreak varcode were found to be genetically related to other South American parasites. Our findings demonstrate the utility of the varcode as a high-resolution surveillance tool to spatiotemporally track disease outbreaks using variant surface antigen genes and resolve signatures of recombination in an E-2020 setting nearing elimination.
Introduction
In 2016 the WHO identified 21 countries, known as the E-2020 countries, having the potential to eliminate local transmission of malaria by 20201, with seven of them in Latin America at the time (Paraguay and El Salvador have since been declared malaria-free). Malaria transmission in many of these E-2020 countries is epidemic/unstable with risks of outbreaks and resurgent malaria 2–4. The elimination target requires a local reduction to zero incidence of indigenous cases. As progress is made towards elimination in these countries, it becomes critical to understand risk factors for disease transmission while maximizing limited funds and resources to examine individual cases. Ecuador is one of the five remaining E-2020 countries in Latin America. National malaria elimination efforts have largely focused on tropical areas, specifically the northwest coast and the Amazon region. These areas border non E-2020 countries, Colombia and Peru, that still have endemic transmission. Thus, rather than low transmission, Ecuador has epidemic/unstable P. falciparum transmission with P. vivax the dominant species causing clinical malaria infections.
Clinical cases caused by P. falciparum in Ecuador are mostly concentrated in the northwest coast, where a steady decrease in annual reported clinical cases was observed from 2009 (161 cases) until late 2012 (44 cases)5–7. A localized outbreak from November 2012 until November 2013 in Esmeraldas City documented 151 P. falciparum cases, the majority of cases (76%) in the northwest coast and Amazon region during this time period8. Population genetic analyses using microsatellite markers demonstrated that this outbreak was caused by the clonal expansion (i.e., single-source) of a residual parasite lineage previously reported on the north coast of Peru and on the coast of Ecuador8. Immediately after the outbreak, 31 cases were reported in 2014 and 189 cases in 2015 5, 9,10. It was unclear whether the clonal outbreak had contributed to P. falciparum clinical cases since 2013, with the factors underlying sustained disease transmission in Ecuador poorly understood. Moreover, Ecuador was categorized as “off-track” in the most recent E-2020 progress report and has not met the elimination target due to malaria resurgence11, 12. This points to an urgent need to better understand disease transmission patterns and whether clinical cases are due to imported or locally-acquired parasites.
Molecular surveillance of malaria in Latin America has focused on non E-2020 countries, e.g. Colombia, Brazil, Peru and Panama where malaria is endemic and transmission is moderate to low8, 13–19. This surveillance has relied on genotyping neutral molecular markers such as microsatellites or single-nucleotide polymorphisms. However, neutral markers, considered relatively slowly evolving, may not provide enough discriminatory resolution to define parasite evolution in relevant epidemiological time (1-2 years) in areas of low or epidemic and unstable transmission20, 21. Genes under selection can provide an alternative but complementary view of microevolution and they are key to examine population adaptation 22.
In this context, the microbiological paradigm for the surveillance of diverse pathogens is to study transmission dynamics using the genes encoding the major surface antigen to reflect pathogen transmission dynamics in relation to immune selection23, 24. The failure to observe this paradigm in the malaria field has stemmed from both the misconception that neutral markers track transmission dynamics in relation to immunity and the fact that the genetics of the var genes encoding the major surface antigen of the blood stages (PfEMP1) are considered “too complex”. Single copy antigen genes under much less selection have become an alternative25. Instead, we have embraced the complexity of the var system to understand the population genetics of var genes23, 26–31 in multiple locations in Africa, South America, and globally to present here a way to explore transmission dynamics through the lens of these immune evasion genes.
Each genomic var repertoire of P. falciparum consists of 40 to 60 members of the var multigene family with evidence that South American parasites may have fewer var genes, e.g. 42 var genes (excluding var2csa) reported in the Honduran laboratory strain HB3 after whole genome sequencing and assembly 32, 33. Our analysis of a South American dataset with a global dataset also showed fewer var genes in South American isolates as did a Brazilian data set 31, 34. As var genes lie on almost all of the 14 chromosomes of the haploid P. falciparum genome35 they undergo independent assortment during conventional meiosis to create new var repertoires as recombinants of parental var repertoires. During infection, differential switching between these genes in a repertoire enables immune evasion in the human host36, 37 thereby regulating parasite density and risk of clinical infection38. Indeed, several serologic studies examining anti-PfEMP1 antibody responses in naturally-infected children have shown that the acquisition of anti-PfEMP1 protective immunity is variant-specific and susceptibility to clinical episodes is dependent on gaps in the individual’s pre-existing antibody repertoire38–47.
A body of work defining the population genomics of the var multigene family by ourselves and others48 has demonstrated that var genes provide a promising biomarker for surveillance over epidemiological time scales (1-10 years). Var DBLα sequences can resolve population structure at global, continental, regional and local scales in areas of varying endemicity in Africa, Oceania, and South America23, 26–31, 49. Characterization of these major surface antigen genes also provides information about circulating repertoires of variants to enable measurement of immune responses. Translation of var population genetics in different epidemiological settings leads us to propose a new surveillance approach we call varcoding. This is essentially a fingerprinting method to describe the diversity of these genes within and between human hosts as well as repertoire diversification by sexual recombination, with Bayesian statistics to account for missing data (Fig. 1).
Defining the 2012-2013 clinical disease outbreak in Ecuador as a baseline, we applied the varcode methodology to characterize the transmission dynamics of clinical P. falciparum cases during and up to two years after the outbreak. Reconstruction of parasite genetic relatedness networks in space and time revealed that parasites with recombinant var repertoires were predominantly associated with the continued incidence of clinical cases after the outbreak. Further comparative analyses to published data from South American isolates28 elucidated possible origins of Ecuadorian parasites, demonstrating that the majority of clinical cases were due to local transmission and not importation. Our findings also demonstrate the translational application of the varcode as a high-resolution tool to detect, spatiotemporally track immune evasion genes as well as resolve signatures of recombination in E-2020 or low transmission settings.
Results
Varcoding was completed for 58 P. falciparum isolates that were collected between 2013 and 2015 in endemic areas of the northwest coast and Amazon region of Ecuador (Fig. 2) from individuals of all ages presenting with malaria symptoms and confirmed P. falciparum-positive by microscopy or rapid diagnostic tests. These isolates represent 21% of the total cases reported in 2013, 61% in 2014 and 3% in 2015 (60% of the cases reported in January, 7% in May and 8% in November). For more details on the sampling scheme and bioinformatics workflow see Methods and Supplementary Fig. S1. Overall, we identified 195 unique var DBLα variants or types from 2,141 DBLα sequences from the 58 isolates, representative of the diversity circulating in Ecuadorian P. falciparum populations, as indicated by accumulation curves approaching a plateau (Fig. S2).
Defining varcodes circulating in Ecuador
To define distinct varcodes circulating locally in Ecuador, we estimated parasite genetic relatedness by calculating the similarity index, Pairwise Type Sharing (PTS) and a Bayesian version of the same (Bayesian pairwise type sharing; BPTS, see Methods). BPTS estimates include statistical uncertainty due to differences in isolate repertoire size (i.e. the number of DBLα types identified in each isolate repertoire), enabling a rigorous examination of relatedness. The median repertoire size in Ecuadorian isolates was 37 DBLα types (range: 11-43 types, Table S1), which is in line with the expected number of var genes in the Honduran laboratory reference strain HB3 (n=42 var genes excluding var2csa, 33). Nonetheless, we used a uniform prior distribution for repertoire sizes between 40 and 50 types, which accounted for the possibility of missing data in total isolate repertoire sizes using degenerate primers for var DBLα PCR. For a further analysis to validate our PCR and sequencing methodology, including a further sub-classification of DBLα types into upsA and non-upsA groups and their respective isolate genomic proportions, see Methods and Supplementary Text 1.
We constructed genetic relatedness networks at a threshold of PTS ≥ 0.90 to identify putatively identical genomes within the margin of error of detection of all DBLα types in an isolate. We confirmed this definition by comparing varcodes derived from PTS to those derived from posterior mean BPTS estimates (Supplementary Fig. S3), and by examining the lower and upper bounds of the 95% highest density posterior intervals (HDPIs). This revealed nine genetically distinct varcodes in this study with varcode4, varcode5, varcode9 seen only once and at the other extreme 36 isolates had varcode1 (Fig. 3a and Supplementary Fig. S4a). We were unable to confirm whether any of the Ecuadorian varcodes were circulating pre-outbreak as samples were not available. As expected, the outbreak varcode1 was clonal (indicated by HDPIs that included 1; Fig. S5d), as previously demonstrated by microsatellite genotyping8. It is worth noting that even though we only sampled 30 P. falciparum isolates in Esmeraldas City from 140 (21%) outbreak cases in 2013, the fact that the disease outbreak was clonal and represented 91% of the 154 total cases on the northwest coast and Amazon region during 2013, means that we characterized a representative sample of the cases from the outbreak and during that year.
For each pairwise comparison, we calculated the HDPI width as an estimate of uncertainty (analogous to calculating the width of a frequentist confidence interval) and found there was high confidence (HDPI width ≤0.2) for 94.6% (632/668) of pairwise BPTS estimates (Supplementary Fig. S4b). Low confidence pairwise BPTS estimates (HDPI width>0.2; max = 0.33) were found only between the two varcode1 isolates with the smallest repertoire sizes (11 and 19 DBLα types; 36/36 comparisons). This lower statistical confidence follows naturally from under sampling of those isolates’ DBLα types. After aggregating the within-varcode pairwise comparisons, there was high confidence in the definition of all varcodes (Supplementary Fig. S5a-b).
For comparison of relatedness by microsatellite genotyping, we calculated Pairwise Allele Sharing (PAS) using previously published microsatellite data for the same isolates8, 50. Not surprisingly, 7 microsatellites did not resolve the nine varcodes but only identified four, three of which were single parasite isolates, at the threshold of PAS ≥ 0.90 (Supplementary Fig. S6a). Because two of the seven microsatellite markers were fixed in the population8, 50, we also generated the relatedness networks at the threshold of PAS ≥ 0.80, which allowed for an error of detection of 1 microsatellite allele. However, the majority of isolates clustered at this threshold and only three varcodes were resolved (Supplementary Fig. S6b). There were several pairwise comparisons (14%, 72/541 by PTS and 6%, 31/541 by BPTS) where P. falciparum isolates were found to be identical by microsatellites (PAS = 1) but much less related by var (PTS or BPTS ≤ 0.50, Supplementary Fig. S7). Nevertheless, the relationship between PTS and PAS scores was positively correlated (Pearson’s correlation coefficient = 0.757, p < 0.001, Supplementary Fig. S7a). Similarly, the relationship between BPTS and PAS estimates was also positively correlated (Pearson’s correlation coefficient = 0.779, p < 0.001, Supplementary Fig. S7b).
Persistent association of clinical cases with the same varcodes over time and large distances
We next explored whether parasites with the same varcodes caused clinical episodes after the outbreak. Persistent disease transmission over time (Fig. 3b) and large distances (Fig. 3e-f) was found as the same varcodes were identified post outbreak and sometimes in different sampling locations. The median time between first and last identification of the same varcodes with any clinical case was 216 days or approximately 7 months (range = 190 – 823 days) during the study period. For example, the outbreak varcode1 identified in Esmeraldas City was also identified in San Lorenzo, Esmeraldas (∼150km from Esmeraldas city) in 2013, then Cascales, Sucumbios (>300km away) in 2014, and then in Tobar Donoso, Carchi (∼150km away) in 2015 (Fig. 3b). This is in line with an earlier study using microsatellites that showed that P. falciparum isolates from the outbreak were genetically related to an Ecuadorian parasite isolate collected around 19908. This result was also consistent with reports in Peru51 and Colombia18, 52 where identical parasites were shown to be circulating up to five and eight years after first identification, respectively.
Spatiotemporal genetic relatedness networks reveal signatures of recombination
We next reconstructed spatiotemporal genetic relatedness networks to explore signatures of local disease transmission and parasite microevolution after the outbreak. We applied a PTS threshold of ≥0.50 to detect recombinant and/or genetically-related varcodes. By using the date and location where each isolate was collected, we animated the network chronologically to examine spatiotemporal disease transmission dynamics during the outbreak in 2013 and post-outbreak in 2014 and 2015 (Fig. 3c, 3f, GIF). This analysis resolved parasite microevolution relevant to disease transmission after the outbreak and revealed that four varcodes (varcode3, varcode4, varcode6, varcode7) were recombinants of the outbreak varcode1 (Fig. 3c). The remaining varcodes did not cluster in the network and were genetically distinct. These patterns were confirmed using BPTS estimates based on posterior means (Supplementary Fig. S8a) and their corresponding 95% HDPIs (Supplementary Fig. S5c-f and S8b-c). For the varcodes that were recombinants and/or genetically-related (i.e., cluster in the networks in Fig. 3c and Supplementary Fig. S8a), 70.9% (433/611) of the HDPIs were confidently above the threshold of BPTS ≥0.50 (Supplementary Fig. S5d). For varcodes that were genetically-distinct (i.e., do not cluster in the networks), 98.4% (368/374) of the HDPIs were confidently below the threshold of BPTS ≤0.50 (Supplementary Fig. S5f). As expected, the relatedness networks based on the 7 microsatellites did not resolve these patterns since all isolates were connected at the threshold of PAS ≥ 0.50 and genetic relatedness was overestimated (Supplementary Fig. S9a). Indeed, we were not always able to discriminate recombinants from genetically distinct parasites since several pairwise comparisons (25%, 307/1232 by PTS and 16%, 196/1232 by BPTS) that resulted in PAS ≥ 0.50 were found to be less related by var (PTS or BPTS ≤ 0.50) (Supplementary Fig. S7). Furthermore, the majority of recombinants were classified as essentially clonal since they clustered together at the threshold of PAS ≥ 0.80 (allowing for differences in one microsatellite allele, Supplementary Fig. S6b).
We were able to detect that varcode3 was in fact a recombinant of the outbreak varcode1, even though it was sampled in San Lorenzo during the same time period as the ongoing outbreak in 2013 (Fig. 3c, 3e). With the available data we were unable to confirm whether varcode3 was ‘imported’ to San Lorenzo from Esmeraldas City. The most plausible explanation is that the recombination event occurred in San Lorenzo during the outbreak. Parasites with recombinant varcode3 caused clinical episodes in multiple locations in 2014 (Fig. 3e). The other recombinant varcodes (varcode4, varcode6, varcode7) were identified after the outbreak in 2014 or 2015. Whether the recombinant varcodes resulted from sexual recombination events between outbreak varcode1 and genetically distinct parasites that were already circulating at low levels and/or in asymptomatic reservoirs in Ecuador (e.g. 5) or those that were previously imported could not be ascertained from the current study population. Interestingly, all the recombinant varcodes were identified in San Lorenzo indicating this area is a transmission hot spot and a reservoir of parasites with diverse var repertoires (Fig. 3e).
To further visualize the genetic profiles of varcodes and confirm the observed recombination signatures, we constructed a clustered heatmap based on the presence and absence of the 195 DBLα types across all isolates, such that isolates with similar genetic profiles cluster together (Fig. 3d). This analysis confirmed that in the case of isolates identified as recombinants of the outbreak varcode1 (i.e., varcode3, varcode4, varcode6, varcode7), a proportion of outbreak DBLα types as well as different DBLα types were present. This is consistent with the generation of a new varcode through outcrossing between two genomes with different var repertoires by conventional meiosis (i.e., approximately ≥50% sharing of DBLα types). By contrast, the genetic profiles of varcodes 2, 5, 8, and 9 had different DBLα types, leading to the definition as parasites with genetically distinct varcodes. When using 7 microsatellites, these recombination signatures in the genetic profiles were not identified (Supplementary Fig. S9b).
Parasites with recombinant varcodes most frequently caused P. falciparum clinical episodes following the 2012-13 outbreak
Prior to the outbreak there had been a steady decline in clinical cases, however, increased incidence of disease occurred after the outbreak. Therefore, we analyzed trends in the epidemiology of P. falciparum cases that occurred post-outbreak to understand if there were any key risk factors. Of the 25 individuals in our study that had a clinical P. falciparum episode after the outbreak in 2014 or 2015, we had age data for 18 individuals (72%, Table 1). There was no significant difference (Kruskal-Wallis test, p = 0.65) in the median age of individuals experiencing clinical episodes caused by parasites with the outbreak varcode1 (19 years, range = 19 – 57 years, N = 3 patients), a recombinant of varcode1 (i.e., varcodes 3, 4, 6, or 7) (25 years, range = 17 – 58 years, N = 13 patients), or by a different varcode (34.5 years, range = 32 – 37 years, N = 2 patients). A greater diversity of varcodes (1-9 inclusive) was associated with incidence of clinical malaria post-outbreak than during the 2013 outbreak (varcodes 1,2,3). Indeed in 2014, 79% of thecases sampled were caused by either parasites with the outbreak varcode1 (21%) or with any of the four recombinants of varcode1 (58%). The trend was similar in 2015 with 83% of the cases sampled caused by either parasites with the outbreak varcode1 (33%) or with any of the four recombinants of varcode1 (50%), although our sampling of reported cases in 2015 after January was limited. Importantly, overall we found that 80% of the cases we sampled after the outbreak were caused by either parasites with the outbreak varcode1 (24%) or with any of the four recombinants of varcode1 (56%), especially with varcode7. Thus, disease transmission after the outbreak in 2014 and into 2015 was predominantly associated with parasites with recombinant var repertoires of the outbreak clone.
Comparative analyses to South America: elucidating possible origins of Ecuadorian varcodes
We next examined the possible origins of the varcodes circulating locally in Ecuador by comparing to published data28 from 128 P. falciparum isolates collected from neighboring South American P. falciparum populations (Fig. 4a). When we combined the sequences from Ecuadorian isolates with those previously sequenced, we identified 543 unique var DBLα types in South America (compared to 458 in 28). This is largely representative of the var diversity circulating in South American P. falciparum populations, as indicated by sampling accumulation curves approaching saturation (Supplementary Fig. S10). Varcoding resolved a total of 97 varcodes in South America (PTS ≥ 0.90), with those identified in each country ranging from 9 in Ecuador and Venezuela, to 56 varcodes in French Guiana (Supplementary Fig. S11). The number of var DBLα types per isolate repertoire was low in all countries and indicative of only one P. falciparum genome infecting an individual (i.e. repertoire size ≤ 60, Fig. 4b, Supplementary Table S1). We found that the median number of var DBLα types in each isolate ranged from 36.5 in Venezuela to 48.0 in French Guiana and the maximum number of var DBLα types per isolate was 42 or 43 in all countries except French Guiana that appeared to have more multi-genome infections (max = 92). The median repertoire size was in the range of the number of var genes seen in the genome of the Honduran laboratory reference strain HB3 (N=42 excluding var2csa, 32, 33). An unrooted phylogenetic neighbor-joining tree revealed distinct clusters of genetically-related isolates in South America generally clustering by country (Fig. 4c), consistent with previous analyses in the region demonstrating geographic population structure28.
To examine whether any of the parasites with Ecuadorian varcodes were genetically related to other South American parasites, we constructed regional spatiotemporal relatedness networks and applied our threshold of PTS ≥ 0.50 (Fig. 4d). We identified a genetically-related Peruvian P. falciparum isolate that clustered with the outbreak and recombinant varcodes (PTS = 0.66-0.75 with varcode1 isolates). This is consistent with previous analyses using microsatellites showing the outbreak source was possibly a residual parasite lineage circulating in Peru in 1999-2000 and in Ecuador in the early 1990s8, 13. The fact that these isolates did not cluster with anything else suggests the outbreak may have been caused locally due to circulating parasites in unidentified reservoirs. Varcode5, which was sampled in the Amazon had a very different genetic profile to the other varcodes (≥83% unique types, Fig. 3d). It is worth noting that the parasite isolate with varcode5 also had a different genetic background with respect to its microsatellite haplotype as compared to the rest of the Ecuadorian isolates (3 of 7 unique microsatellite alleles). The epidemiological data for the putative infection location was recorded as possibly imported from Peru (EC40, Table 1). However, our analysis would suggest that this varcode is more related to P. falciparum populations from French Guiana and Venezuela. Nonetheless, Peruvian isolates also clustered with P. falciparum isolates from French Guiana even though they did not cluster with varcode5 and we found that 53% of the types identified in varcode5 were also identified in Peruvian parasites (Fig. 4e). We were able to determine a putative importation of a Colombian parasite (varcode9, EC53, Table 1, Fig. 4d). The varcodes 2 and 8 did not cluster with any other South American isolate suggesting parasites with these varcodes may be local to Ecuador. The epidemiological data for the putative infection location for varcode8 was recorded as Colombia (EC49 and EC50, Table 1). Although it did not cluster with Colombian P. falciparum isolates in our network analysis, we found that 67% of the types identified in varcode8 (collapsing all 3 isolates) were also identified in Colombian parasites (Fig. 4e). For varcode2, we found that 44% of the types identified in varcode2 were also identified in Colombia (Fig. 4e), providing evidence that parasites with varcode2 and varcode8 may represent residual parasites previously or recently imported from Colombia.
Conservation of individual DBLα types in space and time
Of significance, we found the same var DBLα types (56% of types identified in Ecuador, N=110/195, Fig. 4f) and varcodes that were genetically related to other South American parasites and sampled up to 13 years before (Fig. 4d-e) were contributing to local transmission in Ecuador. Indeed, conservation of the DBLα types in all varcodes was observed in space and time to varying degrees (Fig. 4e). We also found conservation of DBLα types on a continental scale with the number of the same types identified in any two or three countries being 124 (23%) and 59 (11%), respectively (Fig. 4f). Moreover, even DBLα types that were identified in relatively low frequencies in Ecuador (i.e., identified in a few P. falciparum isolates) were also identified in the South America dataset, often at relatively high frequencies in other countries (Supplementary Fig. S12). These patterns are consistent with some parasite varcodes being putative importations and/or highly-related to parasites in other countries, as described previously.
No evidence of recombination in any DBLα type over the course of the outbreak
The fact that the outbreak varcode1 was clonal provided a unique opportunity to examine the conservation of its 47 individual DBLα types after transmission to multiple human hosts over the course of the outbreak and up to two years after (N=36 infections). DBLα types are very variable, with the average pairwise identity of the sequences encoding this domain being approximately 42%33. The cut-off of 96% sequence identity in our pipeline has been routinely used to identify the same DBLα types within the limits of sequencing errors and PCR artifacts (27, 53, Methods). Our data show that the same DBLα types were conserved in isolates with varcode1 over the course of the outbreak (N=30 infections) and up to two years after the outbreak in the other six cases with varcode1 (Supplementary Fig. S13a). In total, we report 1,283 independent observations of the same 41 DBLα types identified in 36 cases with varcode1 assessed over 2 to 3 years (any type was identified in a median of 34 infections, ranging from 1 to all 36 infections) (Supplementary Fig. S13a). Given our threshold of PTS ≥0.90 to identify putatively identical var DBLα repertoires, only 10 of the 47 DBLα types found in the outbreak clone were not found in the majority of the isolates with varcode1 (median frequency = 1, max = 6, Supplementary Fig. S13a). Four of these were identified in the cases with recombinant varcodes (median frequency = 17.5, range = 6-19), two of which were also identified in the Peruvian isolate that clustered with the outbreak/recombinants in Fig. 4d, suggesting they are also conserved. The other 6/10 were singletons in the entire dataset (including the South America data) and could be a consequence of the PCR with degenerate primers and/or low-quality DNA.
Among the 15 cases with recombinant varcodes (varcodes 3,4,6, and 7), we report 339 independent observations of 39 of the 47 outbreak DBLα types. There were 26-27 outbreak types shared between varcode1 and the recombinant varcodes 3,4, and 7, and 17 types between varcode1 and varcode6 (Supplementary Fig. S13b). These findings highlight the conservation of DBLα types of the outbreak clone in space and time, a pattern further supported by the fact that 31 of 47 outbreak DBLα types were found among previously recorded South American isolates (in 2002 to 200828; see Supplementary Fig. S13c). This evidence points to conservation of these types despite the many transmission events that must have occurred between 2002 and 2013-15 across South America. We found that the highest number of outbreak DBLα types were identified in Peru (N=28, 60%) followed by Colombia (N=14, 30%) and Venezuela (N=8, 17%), and the lowest in French Guiana (N= 2, 4%, Supplementary Fig. S9c).
Discussion
Our investigation of the spatiotemporal incidence of clinical episodes of P. falciparum in Ecuador by varcoding supports the view that disease transmission after the 2012-2013 outbreak was sustained predominantly by Ecuadorian parasites. We observed persistence of the outbreak clonal lineage (identified with the same varcodes) and outcrossing of this lineage with other locally circulating parasites causing clinical cases, rather than recent importation of clonally transmitted parasites. Our results point to the need for resources to be focused locally in Ecuador to uncover the circulating asymptomatic reservoirs of infection identified by varcode recombination events. A role for human mobility must also be considered in the spread of P. falciparum as parasites with the same varcodes were also observed across large distances (∼150-300km).
San Lorenzo was found to be a transmission hotspot likely due to mining activities and occupation-related travel in these areas, as well as its proximity to the Ecuador-Colombia border3, 50, 54–56. These findings point to San Lorenzo for both genetic surveillance and targeted interventions. Importantly, our results provide strong evidence for ongoing transmission in Ecuador and provide the first baseline characterization of P. falciparum antigenic diversity and parasite varcodes circulating in the northwest coast and Amazon regions of Ecuador. It is worth noting that several recent outbreaks have occurred in the same areas of Ecuador (in 201610, 201857, 20196 and 202058) with malaria transmission remaining unstable rather than endemic.
We demonstrate that 58% of clinical cases sampled in 2014 immediately after the outbreak in Ecuador were caused by parasites with recombinant varcodes. The same trend was observed in 2015, although our sampling was limited relative to the number of reported cases for the months sampled. We ask is this a chance finding or a consequence of immune evasion? We hypothesize that these parasites with recombinant varcodes have new combinations of var DBLα types that may provide an immunological advantage in a population with variable levels of immunity, as would be expected during elimination campaigns59.
Network analyses and computational models combining evolution and epidemiology point to variant-specific immune selection defining var population structure60, 61. In scenarios where transmission is low and there is limited var diversity, the chance of recombination is also relatively low and thus immune selection is weaker. Selection in the context of an E-2020 country like Ecuador is likely variable due to the recent transition from moderate to epidemic transmission over the last 15 years1 with declining population immunity as a consequence of approaching elimination. Low population immunity would select for circulating clones leading to high similarity between var repertoires. Here, the signatures of recently recombined genomes with the outbreak clone that we detect in four varcodes were clearly associated with the continued incidence of clinical disease after the outbreak. Thus parasites with recombinant varcodes may present a risk factor for disease. We hypothesize that they may be better able to evade variant-specific immunity to PfEMP1 variants expressed by the outbreak clone by sharing 50% or less of the DBLα types of the outbreak clone. The availability of var DBLa sequences provides the potential to test this hypothesis by serological methods that detect variant-specific immunity. In this context, inferences based on neutral markers do not give information about immune evasion genes and variant-specific immunity, whereas varcodes identify circulating variants to which individuals can become immune. e.g. as was done in serologic studies in Papua New Guinea62 and the Brazilian Amazon63.
A previous study that included some of the same P. falciparum isolates from Esmeraldas City and San Lorenzo in the northwest coast described three main genetic clusters based on microsatellite genotyping50. In contrast, when considering the same isolates, varcoding resolved six varcodes circulating in the northwest coast. In this study, as expected comparative analyses to “gold-standard” microsatellite genotyping demonstrated that varcoding provided higher discriminatory resolution to detect fine-scale genetic variation and describe parasite microevolution related to disease transmission in epidemiological time. These findings were in accordance with the rapidly evolving var multigene family due to immune selection30, 60, 64, the overall number of microsatellite loci examined and the fact that two loci were fixed in the population8, 50 decreasing the denominator of polymorphic microsatellite markers from seven to five. Similar observations have been described in an earlier study in Venezuela where sympatric parasites identical at neutral microsatellite loci were shown to have very different var repertoires65. These South American genetic epidemiology studies can be compared to those of a peri-urban area in Senegal where reduced transmission has resulted from intense vector and antimalarial control. In Senegal, highly related genotypic clusters of P. falciparum isolates were resolved by both var and use of a 24-SNP barcode as neutral loci rather than microsatellite loci66. Genotyping var loci under immune selection in conjunction with neutral loci proves to be informative to validate clonality and avoids the need for whole genome sequencing as a potential rapid response to an outbreak investigation. Even in areas where WGS is routinely conducted, varcoding has the potential to identify relevant samples for downstream WGS as part of a larger and complementary genomic surveillance framework.
In contrast to the microevolution of parasite varcodes, we documented conservation of individual var DBLα types over space and time. This is similar to Africa27, 29 where we see highly variable but many conserved DBLα types yet rapidly evolving var repertoires or varcodes. Here we observed conservation of the outbreak varcode1 clone DBLα types after transmission within and between multiple human hosts on the scale of months to two years in Ecuador (1,283 independent observations of the same types when examining 36 cases with varcode1). Moreover, 110 of the 195 (56.4%) Ecuadorian DBLα types were identified across South America over 5 to 13 years (2,130 independent observations of the Ecuadorian DBLα types in 128 different infections from 28). These observations are consistent with balancing selection67 and in line with other studies demonstrating high levels of var DBLα sequence conservation on the time scale of up to 23 years in Brazil34, 68. Another independent example of conservation of DBLα sequences over time is found in a peri-urban area of Senegal over 7 years66. This pattern of conservation of var DBLα types observed in diverse sites confirms the potential of this genetic marker for malaria surveillance.
Of note, the observed conservation of DBLα types in natural infections in humans in Africa27, 29, 69 and South America would not be predicted by data from in vitro passage of long-term cloned lines of P. falciparum showing high rates of mitotic recombination within var genes70–72. Indeed, Claessens et al 71 reported a very high rate of 2-8 x10-3 mitotic recombinants created per erythrocytic cycle for three of four long-term in vitro cultured lines, “with DBLα domains showing the most recombinations”. The clone tree of HB3, selected for passage through mosquitos, did not show any recombinants suggesting that the high rates observed may relate to passage in vitro. While we identified conservation rather than diversification of var DBLα types in space and time, it is possible that mitotic recombinants are generated on the observed time scale but not transmitted. Such is the case in HIV (e.g. reviewed in 73). Further exploration of within and between host diversification of var genes in natural infections requires a more detailed genomic approach analyzing recently adapted parasite lines.
This is the first case study demonstrating the translational application of the varcode for outbreak surveillance in epidemic/unstable malaria transmission such as E-2020 settings. Specifically, data presented here have shown that varcoding was able to resolve the population genetics of sustained disease transmission after an outbreak in Ecuador. We conclude that varcoding requiring a single PCR with degenerate primers has great potential as a simple, cost effective and high-resolution approach to examine P. falciparum antigenic diversity in relation to population immunity as well as genome diversification by recombination. This will prove particularly useful in the context of changing patterns of human mobility and gene flow in the Americas where there is high demand for such a diagnostic surveillance method. The tool will also be useful in other epidemic or low transmission settings targeting elimination across the globe.
Here we show that var repertoire evolution via sexual recombination was associated with sustained transmission resulting in clinical infections. This is in line with our previous predictions of epidemic disease transmission in South America with parasites with novel var repertoires, or varcodes28, and serves as a warning of what may come in terms of monitoring the complexity of malaria resurgence in Latin America with both importation and microevolution of P. falciparum. Surveillance with neutral markers alone cannot investigate this potential risk related to recombination of variant antigen gene repertoires. The widespread migration of Venezuelans has already led to massive increases in P. falciparum cases in Brazil and Colombia74, as well as in the Peru-Ecuador border3. Such migration points to the possibility of importation of diverse parasites with antigenically novel var genes as we predicted 28, and is now also worrisome in the context of increased transmission of malaria in tropical regions due to the diversion of healthcare resources to the COVID-19 pandemic, as recently warned by the WHO75. Going the distance to elimination must be supported by appropriate molecular surveillance.
Methods
Ethics statement
The P. falciparum isolates collected from individuals of all ages presenting with uncomplicated malaria cases were obtained from the malaria surveillance protocol approved by the Ethical Review Committee of Pontificia Universidad Católica del Ecuador. Ethical approval for this study was obtained from Pontificia Universidad Católica del Ecuador (Quito, Ecuador) and University of Melbourne (Melbourne, Australia). Written informed consent was provided by study participants and/or their legal guardians.
Study design, study area and sample collection
In this molecular epidemiological study, we examined samples that were collected passively from 2013 to 2015 in endemic areas of Ecuador, namely the northwest coast and the Amazon region, as part of ongoing malaria surveillance by the Ecuadorian Ministry of Health and patients showing at the local clinics. During 2013, the majority of samples were collected from an ongoing outbreak that occurred in Esmeraldas city, Esmeraldas province. Details of the sample collection are described in Sáenz et al8. In 2014 and 2015, the “post-outbreak” samples were mostly collected from across the northwest coast with few from the Amazon region, reflective of the epidemiology of clinical P. falciparum cases in Ecuador (Northwest coast: 20 and 173 cases; Amazon: 11 and 16 cases in 2014 and 2015, respectively). Details of the sample collection are described in Vera-Arias et al 50. It is worth noting that although cases were reported throughout 2015, samples were not available for many of them due to logistical challenges. Briefly, subjects who were diagnosed with falciparum malaria either by light microscopy or a rapid diagnostic test were asked if they wanted to participate in the study and signed an informed consent form. The participants had to: 1) live in the areas where the samples were taken, 2) be over 2 years old, 3) agree to participate, 4) sign an informed consent form, 5) give a blood sample, 6) be able to understand the informed consent. From each participant a blood sample (either venous blood or dried blood spot) was collected as well as their responses to a basic demographic questionnaire (including places recently travelled and their address). The positivity of all samples was confirmed in the laboratory by microscopy and PET-PCR 76.
DNA extraction
Genomic DNA was extracted from venous blood or dried blood spot samples using a QIAamp DNA MINI KIT (QIAGEN, Germantown, MD, USA) using the protocol recommended by the manufacturer.
Microsatellite genotyping
The same P. falciparum isolates in this study were previously genotyped for seven putatively neutral microsatellite markers (TA1, POLYα, PfPK2, TA109, 2490, C2M34, C3M39) as described in8, 50. These previously published microsatellite genotyping data8, 48, 75 were used in the present analyses.
Var genotyping and related data processing
For var genotyping, the DBLα domains of antigen-encoding var genes were amplified and sequenced on a MiSeq 2×300bp paired-end Illumina platform as described in 29, 60. We obtained 6,929,470 raw sequence reads, which we cleaned and processed using the DBLaCleaner pipeline (60, http://github.com/Unimelb-Day-Lab/DBLaCleaner). Our customized bioinformatic pipeline has been described in detail in60. Briefly, we de-multiplexed and merged the reads as well as removed low-quality reads and chimeras using several filtering parameters (see Supplementary Fig. S1 for details). This resulted in 2,141 cleaned DBLα sequences (Supplementary Fig. S1). To identify distinct or unique DBLα types (i.e., unique genetic variants), we clustered the DBLα sequences from Ecuadorian P. falciparum isolates with 5,699 previously published28 DBLα sequences from other South American countries (Colombia (N=21 isolates), French Guiana (N=76 isolates), Peru (N=21 isolates), and Venezuela (N=10 isolates) at the standard 96% sequence identity27 using the clusterDBLalpha pipeline (http://github.com/Unimelb-Day-Lab/clusterDBLalpha). All the cleaned DBLα sequences in this study have been submitted to the DDBJ/ENA/GenBank (BioProject Number: PRJNA642683).
We further curated our dataset by translating the DBLα types into amino acid sequences using the classifyDBLalpha pipeline (http://github.com/Unimelb-Day-Lab/classifyDBLalpha) and removing any DBLα types that were non-translatable (N=4). In addition, any P. falciparum isolate with low sequencing quality (< 10 DBLα types) was removed from the analysis. Varcoding was completed for a total of 70 P. falciparum Ecuadorian isolates, however, 12 P. falciparum isolates with low DNA quality and/or poor sequencing quality were removed and we obtained var DBLα data for 58 isolates (82.9%). A total of 543 unique DBLα types identified in the 186 South American P. falciparum isolates (N = 58 Ecuadorian P. falciparum isolates from this study, N = 128 South American P. falciparum isolates previously published in 28) were used for subsequent var analyses at the regional-level, and only the 195 unique DBLα types identified in Ecuador were used for Ecuador-specific analyses.
A further validation of our sequencing and genotyping methodology was undertaken by analyzing the isolate genomic proportions of DBLα types classified as upsA and non-upsA using our classifyDBLalpha pipeline29 and are described in Supplementary Text 1.
Genetic relatedness analyses
Measuring pairwise type or allele sharing
To estimate genetic relatedness among isolates, we calculated the similarity index Pairwise Type Sharing (PTS) 26, as adapted by He et al60 to account for differences in var DBLα sampling across isolates in the case of var, and Pairwise Allele Sharing (PAS)77 in the case of microsatellites and constructed similarity matrices. Because P. falciparum is haploid we refer to var repertoires and microsatellite haplotypes in the same manner, i.e. a var repertoire of DBLα types or a multilocus haplotype of microsatellite alleles transmitted together. Every parasite isolate was compared to every other parasite isolate in the population to determine the proportion of shared loci (i.e. var types or microsatellites), but only “retrospective” pairwise comparisons were analyzed as a conservative approach to account for time order for the Ecuador-specific analyses. In other words, since we were interested in using the outbreak as a baseline this approach allowed us to look at genetic relatedness of an isolate with everything else circulating before its identification. For the comparative analyses with South American data we included all pairwise comparisons and did not correct for time order due to the differences in sampling time points (five to thirteen years before the Ecuadorian isolates were collected).
Measuring Bayesian pairwise type sharing
Genetic relatedness (or repertoire overlap) can also be explored more rigorously with unbiased Bayesian pairwise type sharing (BPTS, Johnson et al in preparation), which accounts for uncertainty in PTS estimates due to differences in isolate repertoire size. This approach uses Bayesian inference methods, which estimate repertoire overlap and uncertainty78, and uses them in a subsequent PTS calculation, carrying that uncertainty forward. The prior distribution for repertoire size, used in inference, was informed by observations as follows. First, the median observed repertoire size in Ecuadorian isolates was 37 types, ranging from 11 to 43 (Table S1). Second, the number of expected var genes with DBLα domains from whole genome sequencing data of the Honduran laboratory reference strain HB3 was 42 33 and 50 var genes based on long-read PacBio sequencing of HB379. And third, based on our sequencing data of 37 technical replicates of HB3, the median repertoire size or number of DBLα types per isolate was 39 (range 36-41 types), with 40 types consistently identified in the majority of replicates (range 21-37 replicates) (Supplementary Text 1). We therefore used a uniform prior on repertoire sizes between 40 and 50 types, combined with the general Bayesian repertoire overlap framework78 to produce unbiased estimates (posterior means). These were used to confirm our PTS estimates. As expected, the PTS and BPTS estimates were positively correlated (Pearson’s correlation coefficient = 0.919, p < 0.001, Supplementary Fig. S3). To measure uncertainty in central estimates, we computed a 95% highest density posterior interval (HDPI), a Bayesian version of confidence intervals, for each pairwise estimate. Like a frequentist confidence interval, the width of the HDPI provides a measure of uncertainty of each pairwise comparison. All posteriors were sampled using MCMC.
Interpretation of genetic relatedness measures
Since P. falciparum undergoes sexual recombination (i.e. meiosis) in the mosquito, conventional genetic interpretations can be applied to PTS and PAS. Therefore, pairwise isolate comparisons resulting in a PTS or PAS of 0 (0% shared loci), 0.5 (50% shared loci), and 1 (100% shared loci) would indicate genetically distinct isolates, recombinant isolates, and clones or genetically-identical isolates, respectively. We applied this approach to define “varcodes” as groups of isolates sharing ≥90% of their DBLα types (PTS ≥ 0.90) to identify putatively identical genomes within the margin of error of detection of 1-5 DBLα types in an isolate (i.e., within the margin of error of detection of all DBLα sequences in an isolate using degenerate primers for var DBLα PCR). We confirmed our interpretations of varcodes, recombinants and genetically distinct isolates by comparing to unbiased BPTS estimates (posterior means) and examining the lower and upper bounds of the 95% HDPIs as a measure of the statistical uncertainty of each pairwise comparison.
In the case of microsatellite genotyping, we applied a threshold of PAS ≥ 0.90 (sharing ≥90% of their microsatellite alleles) to define clones and also applied a threshold of PAS ≥ 0.80 to identify putatively identical genomes within the margin of error of detection of 1 microsatellite allele in an isolate (for comparative purposes to the threshold for var, Supplementary Fig. S2). It is important to note that 2 out of the 7 microsatellite markers genotyped were fixed in the population8.
Visualization of genetic relatedness networks
To visualize the genetic relatedness among isolates as determined by PTS or PAS, we constructed networks using the R packages ggraph80 and tidygraph81 where isolates are depicted as nodes and edges as the PTS or PAS values at a given threshold. The R package ggspatial82 was used to plot spatial networks using latitude/longitude coordinates for sampling location. To visualize genetic relatedness of parasites over time we used the R package gganimate83 to construct spatiotemporal relatedness networks. We generated a clustered heatmap based on the presence/absence matrix of DBLα types to visualize the genetic profiles of each isolate in Ecuador and each country in South America using the R package pheatmap84 and the “complete” clustering method. Unrooted neighbor-joining phylogenetic trees based on pairwise genetic distance (calculated as 1-PTS) were constructed using the R packages ape85 and ggtree86, 87.
Statistical analysis
We used R version 3.5.2 for all analyses88. The package dplyr89 was used for further data curating. We used base R and the R package epiR90 to perform chi-squared tests for univariate analyses of categorical variables to compare proportions and for non-parametric tests to compare distributions of continuous variables between two groups (Mann-Whitney U test) or among k groups (Kruskal-Wallis test) with a Bonferroni correction for multiple comparisons. To evaluate how well we sampled the true pool of var DBLα diversity (i.e. the true number of genetic variants circulating) in Ecuador and in South America, we used the R package vegan91 to generate species accumulation curves.
Data availability
The cleaned DBLα sequences generated in this study from Ecuadorian P. falciparum isolates have been submitted to the DDBJ/ENA/GenBank (BioProject Number: PRJNA642683). The python code for the DBLαCleaner pipeline is available on GitHub at https://github.com/Unimelb-Day-Lab/DBLaCleaner. The python code for the clusterDBLalpha pipeline is available on GitHub at https://github.com/Unimelb-Day-Lab/DBLalpha. The python code for the classifyDBLalpha pipeline is available on GitHub at https://github.com/Unimelb-Day-Lab/classifyDBLalpha. All other deidentified data and analysis code are available on the open-source GitHub repository at https://github.com/shaziaruybal/varcode-ecuador.
Data Availability
The cleaned DBLα sequences generated in this study from Ecuadorian P. falciparum isolates have been submitted to the DDBJ/ENA/GenBank (BioProject Number: PRJNA642683). The python code for the DBLαCleaner pipeline is available on GitHub at https://github.com/Unimelb-Day-Lab/DBLaCleaner. The python code for the clusterDBLalpha pipeline is available on GitHub at https://github.com/Unimelb-Day-Lab/DBLalpha. The python code for the classifyDBLalpha pipeline is available on GitHub at https://github.com/Unimelb-Day-Lab/classifyDBLalpha. All other deidentified data and analysis code are available on the open-source GitHub repository at https://github.com/shaziaruybal/varcode-ecuador.
Author contributions
S.R-P. and K.P.D. conceived and designed the study; F.E.S. and C.A.V-A. carried out field work to obtain P. falciparum isolates and epidemiological metadata; S.R-P. and K.E.T. developed the varcoding methodology; S.R-P. and S.D. performed the molecular experiments and sequencing; S.R-P., F.E.S. and C.A.V-A. curated the clinical and epidemiological metadata; S.R-P. performed the formal data analysis and visualization; E.K.J. and D.B.L. designed and conducted the Bayesian statistical analysis; S.R-P., F.E.S., K.E.T. and K.P.D. contributed to the interpretation of the data; K.P.D. supervised the work; S.R-P., F.E.S. and K.P.D. acquired funding; S.R-P. and K.P.D. wrote the paper with contributions from all authors.
Supplementary Information
Supplementary Figures
Supplementary Tables
Supplementary Text 1
As described in the Methods, the degenerate primers we use amplify the DBLα domain of var genes. We performed an additional validation of our PCR (using these degenerate DBLα primers) and sequencing methodology to understand the margin of error of detection of all DBLα types in an isolate. DBLα types were translated into amino acid sequences and classified as upsA or non-upsA, using the classifyDBLalpha pipeline (Ruybal-Pesántez et al. 2017) to examine whether the expected genomic proportions of upsA/non-upsA were obtained in each isolate.
Given our study was conducted in South America, we used the Honduran laboratory reference strain HB3 as a benchmark for the expected number of total DBLα types (i.e. repertoire size), as well as the proportion of upsA/non-upsA. Whole genome sequencing of HB3 has identified 44 var genes, with 8 upsA DBLα types (defined by DBLα domain 1), 34 non-upsA DBLα types (DBLα domains 0 and 2) and 2 upsE (var2csa genes, defined by DBLpam domains and do not have DBLα domains) (Rask et al. 2010). We independently verified this using 37 technical replicates of var DBLα PCR amplification and illumina sequencing of our HB3 laboratory isolate. It is worth noting that since HB3 has two var2csa genes that do not have DBLα domains, these will not be amplified by our degenerate primers, so we expect that only 42 of the var genes of HB3 could be amplified.
HB3 technical replicates
From the data obtained from 37 HB3 technical replicates, we identified the expected repertoire sizes with a median of 39 DBLα types (range: 36-41). A median of 7 upsA DBLα types (range: 6-8) and a median of 33 non-upsA DBLα types (range: 30-34) were identified (Figure 1). The median genomic proportion of upsA DBLα types was 17.5% (range: 15.4-19.5%) and 82.5% (range: 80.5-84.6%) for non-upsA DBLα types (Figure 2). We found that of the 46 identified in the 37 technical replicates, 40 of them were consistently identified in the majority of replicate isolates (range 21 to 37 replicates). Of these 40 types, 7 were ups-A and 33 were non-upsA. All of these findings are in line with what is expected from whole genome sequencing data (Rask et al. 2010).
Field isolate genomic proportions of upsA and non-upsA DBLα types
The 543 unique DBLα types identified in the 186 South American P. falciparum isolates (N = 58 Ecuadorian P. falciparum isolates from this study, N = 128 previously published P. falciparum isolates from Colombia, French Guiana, Peru and Venezuela) were translated into amino acid sequences and classified as upsA or non-upsA, using the classifyDBLalpha pipeline (Ruybal-Pesántez et al. 2017). There were 79 upsA and 464 non-upsA types.
Looking first at the 195 types (26 upsA and 169 non-upsA) identified in Ecuadorian isolates, we obtained a median genomic proportion of upsA of 10.8% (range: 5.1-18.2%) and 89.2% (range: 81.8-94.9%) of non-upsA types for all isolate var codes (Figure 3).
To confirm the patterns we observed in Ecuadorian isolates, we also compared them to the other South American isolates. In the other South American isolate var codes the genomic proportions of upsA/non-upsA were similar, with a median proportion of upsA of 9-14% and 86-91% for non-upsA types (Table 1).
With regards to the number of upsA identified in all the South American isolate var codes, we identified a median of 4-5 upsA types and 31-42.5 non-upsA types (Table 2).
Inheritance of DBLα types in the recombinant varcodes
For the outbreak var code1, its 47 DBLα types were classified as 5 upsA and 42 non-upsA. In Figure 5 we look specifically at the “inheritance” of upsA (blues) vs non-upsA (greens) types from the parental outbreak varcode1 in the case of the parasites with recombinant var codes (var codes3,4,6,7). This provides a proxy to examine inheritance of types with regards to their chromosomal location. The proportion of the outbreak types that were inherited in the recombinant parasite var codes is indicated in the darker shades of blue or green, showing that the proportion of inherited types varied both by var code and upsA/non-upsA. Overall, the DBLα type sharing patterns in parasites with recombinant varcodes are consistent with inheritance of 50% types, with a higher proportion of non-upsA inherited types (∼40-70%, i.e. 17 to 30 of the 42 types) vs upsA (∼20-50%, i.e. 1 to 3 of the 5 upsA types). The exception was var code7 where 60-80% of upsA were inherited, i.e. 3-4 of the 5 upsA types. The lighter shades correspond to those types that were not inherited from the outbreak clone but from the other parent.
Acknowledgements
The authors would like to thank the patients who contributed samples and the field teams who were involved in sample collections. We thank the Ecuadorian Ministry of Health, especially Luis E Castro, Julio Valencia and Javier Gomez Obando. We appreciate the support of the Australian Genome Research Facility for illumina sequencing of the samples. This research was financially supported by the Pontificia Universidad Católica del Ecuador (grant numbers: L13058, L13248, M13416, N13416, O13087[QINV0084] to F.E.S), the Fogarty International Center at the National Institutes of Health [Program on the Ecology and Evolution of Infectious Diseases (EEID), Grant number: R01-TW009670 to K.P.D.], and the National Institutes of Allergy and Infectious Diseases, National Institutes of Health (grant number: R01-AI084156 to K.P.D.). S.R-P. was supported by a Melbourne International Engagement Award from the University of Melbourne and gratefully acknowledges the J.D. Smyth Travel Award from the Australian Society for Parasitology that enabled her to travel to Ecuador to establish this collaborative project.
References
- 1.↵
- 2.↵
- 3.↵
- 4.↵
- 5.↵
- 6.↵
- 7.↵
- 8.↵
- 9.↵
- 10.↵
- 11.↵
- 12.↵
- 13.↵
- 14.
- 15.
- 16.
- 17.
- 18.↵
- 19.↵
- 20.↵
- 21.↵
- 22.↵
- 23.↵
- 24.↵
- 25.↵
- 26.↵
- 27.↵
- 28.↵
- 29.↵
- 30.↵
- 31.↵
- 32.↵
- 33.↵
- 34.↵
- 35.↵
- 36.↵
- 37.↵
- 38.↵
- 39.
- 40.
- 41.
- 42.
- 43.
- 44.
- 45.
- 46.
- 47.↵
- 48.↵
- 49.↵
- 50.↵
- 51.↵
- 52.↵
- 53.↵
- 54.↵
- 55.
- 56.↵
- 57.↵
- 58.↵
- 59.↵
- 60.↵
- 61.↵
- 62.↵
- 63.↵
- 64.↵
- 65.↵
- 66.↵
- 67.↵
- 68.↵
- 69.↵
- 70.↵
- 71.↵
- 72.↵
- 73.↵
- 74.↵
- 75.↵
- 76.↵
- 77.↵
- 78.↵
- 79.↵
- 80.↵
- 81.↵
- 82.↵
- 83.↵
- 84.↵
- 85.↵
- 86.↵
- 87.↵
- 88.↵
- 89.↵
- 90.↵
- 91.↵