Abstract
We enrolled arriving international air travelers in SARS-CoV-2 genomic surveillance, using molecular testing of pooled nasal swabs, and sequencing positive samples for viral lineage. Traveler-based genomic surveillance provided early warning variant detection; we reported the first U.S. Omicron BA.2 and first BA.3 in North America, weeks before next reported detection.
Background
Despite layered mitigation measures, international travel during the COVID-19 pandemic continues to facilitate global spread of SARS-CoV-2, including novel variants of concern (VOCs). On November 26, 2021, B.1.1.529 (Omicron) was designated as a VOC by the World Health Organization [1]. On December 6, 2021, as part of measures to reduce the introduction and spread of Omicron, the requirement for a negative SARS-CoV-2 test taken before air travel to the United States was shortened from three days to one day pre-departure [1]. Although SARS-CoV-2 genomic sequencing has increased significantly during the pandemic [2], there is still a gap in early detection of emerging variants among arriving travelers.
In September 2021, the Centers for Disease Control and Prevention (CDC), in collaboration with private partners, implemented a voluntary SARS-CoV-2 genomic surveillance pilot program.
Initially we enrolled arriving air travelers from India. On November 28, we expanded the program to include travelers arriving from countries with high travel volumes, including those where Omicron was first detected.
Methods
Design, Setting, and Participants
During September 29–November 27, 2021, the surveillance program included travelers arriving on seven direct flights from India at three international airports: John F. Kennedy, New York (September 29), Newark Liberty, New Jersey (October 4), and San Francisco, California (October 12). During November 28–January 23, Hartsfield-Jackson Atlanta International Airport, Georgia was added, and participation was offered to travelers from South Africa, Nigeria, the United Kingdom, France, Germany, and Brazil, arriving on approximately 50 flights per day. Participants were 18 years or older, provided informed consent, and completed demographic, clinical, and travel history questions.
Sample Collection
Participants could opt-in for either, in-airport dry nasal swab sample self-collection, at-home saliva sample collection 3-5 days after arrival, or both (Supplementary Figure 1). For in-airport sampling, travelers were asked to self-collect a lower nasal swab sample. Samples were placed in pooled collection tubes with 5–25 other samples, with size adjusted based on flight volume, and shipped to the Concentric Laboratory Network in viral transport media. During September 29–November 27, pooled samples were grouped together based on arriving flight. During November 28-January 23, pooled samples were grouped by country of origin. For the at-home kit, travelers were asked to collect a saliva sample on day 3–5 after arrival and send it to the laboratory.
Laboratory Testing
All samples underwent SARS-CoV-2 reverse transcription polymerase chain reaction (RT-PCR). After November 27, samples were tested for S-gene target failure (SGTF) using TaqPath COVID-19 assay [3]. All positives underwent whole genome sequencing and variant lineage determination. Reverse transcribed RNA was amplified using the ARTICv3 protocol [4]. Amplicons were pooled and prepared using standard protocols. For Illumina sequencing, samples underwent tagmentation and were sequenced on NovaSeq 6000 (2×50 bp; Illumina). For rapid lineage identification, a ligation-based library was prepared and sequenced on GridION (Oxford Nanopore) as described in supplemental methods.
Reporting
Travelers participating in pooled testing were advised to submit their at-home kit for individual testing. Individual test results were reported to participants via a secure digital portal and to public health authorities per CDC reporting guidelines [5]. Sequencing data were provided to CDC for viral culture and characterization and uploaded to the Global Initiative on Sharing Avian Influenza Data (GISAID) public database.
Statistical Analysis
For this analysis we focused on the effectiveness of pooled testing for early variant detection and thus included pooled testing results only. Using Fisher’s exact tests, we assessed differences in pooled positivity rates across two collection periods (September 29–November 27 and November 28–January 23) and by flight country of origin. Statistical analysis was conducted in R 4.0.3. This activity was reviewed by CDC and was conducted consistent with applicable federal law and CDC policy.1
Results
During September 29, 2021–January 23, 2022, we enrolled 16,149 (∼10%) of an estimated 161,000 eligible travelers, yielding 1,454 sample pools. Overall, 221 (16%) of 1,367 pooled samples (average pool size 11 swabs) tested were SARS-CoV-2-positive. The median turnaround time from sample collection to sequencing was 11 business days (range, 5 - 20). For select samples, we performed expedited sequencing within 48 hours to confirm validity of SGTF as an early indicator for Omicron. Positivity increased significantly from 1.8% (6/338) during September 29–November 27 to 20.9% (215/1029) after November 27 (p<0.001). Percent positivity of pooled samples varied significantly by country of flight origin, ranging from 43.5% (40/92) in South Africa, to 9.3% (68/733) in India (p < 0.001) (Supplementary Table 1). Before November 28, all 6 lineages were Delta (B.1.617-like), apart from one undetermined lineage. During November 28–January 23, 67% (145/215) of positive pooled samples collected were Omicron (B.1.1.529-like), 5% (11/215) were Delta (B.1.617-like), and the remaining 27% (59/215) lineages could not be determined due to low sample sequencing coverage (low number of sample nucleotide bases aligned to a reference genome) (Figure 1a and Supplementary Table 2). Of 145 Omicron sequences, 112 exhibited complete or partial SGTF; BA.2 sub-lineages did not. Omicron sub-lineages included BA.1 (100), BA1.1 (12), BA.2 (26), and BA.3 (1), BA.2 + Orf1a:M85 (1), and BA.2 + S:R346K (1). Four samples were identified as Omicron, but sublineage could not be determined due to low sequencing coverage. A sample collected on December 14 was the first reported BA.2 in the United States, 7 days earlier than any other U.S. report (Figure 1b). Similarly, a sample collected on December 3 was the first reported BA.3 in North America, 43 days before the next report [6].
Discussion
This traveler-based SARS-CoV-2 genomic surveillance program was able to identify importation of variants, including Omicron sub-lineages BA.2 and BA.3 at 7 and 43 days before they were reported elsewhere in the United States and North America, respectively. Overall, 16% of pooled tests were positive, with 21% positivity following Omicron emergence. We detected an increased number of positive post-arrival pooled samples after December 6, when air passengers were required to have a negative sample collected within one day before flight departure.
Possible reasons for high pooled test-positivity on arrival despite negative pre-departure testing include timing of infection (i.e., before infection can be detected by available testing), use of certain testing modalities with lower sensitivity [7], or infection soon after pre-departure sample collection [8, 9]. If passengers had infections that were not yet detected in pre-departure testing, longer flight times may have led to detection of more positive tests on arrival [9]. Many of our positive pools were from flights over 16 hours long. Finally, it is possible that fraudulent test results were presented to meet the pre-departure testing requirement [10].
Pooled testing in this program is advantageous as it enables efficient, large volume sampling and increases testing throughput while conserving resources. This can be valuable for continued detection when prevalence of SARS-CoV-2 infection is low. The pooled testing design minimizes dilution and reduces loss of sensitivity by pooling dry swabs at the point of collection Each Concentric network laboratory is validated to ensure molecular assay sensitivity of 1,500 viral copies/mL. The disadvantage of pooled testing is an inability to directly link test results with individual-level data. Follow-up individual testing, such as the at-home test kits collected in our program (data not presented), provide an additional opportunity to capture linkable meta-data.
With ∼ 10% participation rate, we were able to detect the first Omicron sub-lineages BA.2 and BA.3 not previously reported in the United States or North America, weeks before they were reported by other US and North American sequencing efforts. The country-level proportions of variants that we identified were consistent with those reported by national and global sequencing programs [2]. Our study suggests that when COVID-19 rates are high, as during the Omicron surge, a 10% participation rate would be sufficient to detect relatively rare variant sublineages. As COVID-19 rates decline in the post-Omicron period, the program could shift to population-level genomic surveillance techniques that are not reliant on passenger participation such as wastewater or air sampling from aircraft environments to increase variant detection.
Detection of imported emerging infectious diseases has traditionally focused on travelers presenting to health clinics after symptom onset [12]. COVID-19 presents unique challenges since transmission often occurs before symptom onset or in asymptomatic infected persons [7]. By the time variants are detected there is often widespread community transmission. Many countries have required testing for arriving travelers to limit introduction and spread of SARS-CoV-2 [11] yet few utilize traveler-based viral genomic surveillance to detect new variants and provide more detailed epidemiological data. Earlier detection of new SARS-CoV-2 variants of interest and concern allows researchers and public health officials the needed time to gather information about transmissibility, virulence, and vaccine effectiveness to enable adjustments to treatment and prevention strategies [2].
This traveler-based genomic surveillance program underscores the importance of public-private partnerships in achieving public health priorities in an ever-changing pandemic, and the utility of surveillance tools beyond traditional individual testing. The program’s scalability and adaptability, including the ability to rapidly add locations and expedite sequencing, were key factors for success. Traveler-based SARS-CoV-2 genomic surveillance provides a model of pathogen detection that can be used as an early warning, sentinel system for future outbreaks affecting international travelers or pandemics.
Data Availability
All data produced in the present study are available upon reasonable request to the authors
Funding for this manuscript
This work was supported by the Centers for Disease Control and Prevention contract award number 75D30121C12036.
Potential conflicts of interest
The authors acknowledge no conflicts of interest, relevant financial interests, activities, relationships, and affiliations exist.
Disclaimer
The findings and conclusions of this report are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention. This activity was conducted in partnership with commercial partners, Xprescheck and Concentric by Ginkgo. The use of products’ or services’ names is for identification purposes and does not mean endorsement by the Centers for Disease Control and Prevention.
Supplementary Data
Supplementary Methods
Enrollment Process
Passengers and flight crew arriving from pre-selected international destinations were invited to participate in the voluntary program at one of four US international airports. During September 29–November 27, flight crew announced the program during selected international flights from India using a provided script and directed passengers to informational brochures. During November 28-January 23, in-flight announcements were not used as pooled samples were grouped by country of origin rather than flight. During both enrollment periods, program information and booths were placed in a strategic location at each airport, in the arrivals area near baggage claim, to engage interested arriving travelers. Guidance for collection of dry nasal swab and saliva self-collection samples were provided in technical instructions [1-2].
We estimated the total number of eligible participants on pre-selected international flights using the following methodology: for a sample week, we reviewed the flight numbers indicated on the program pamphlet by the participants and estimated the total number of eligible passengers assuming ∼200 passengers per flight.
Viral Whole Genome Sequencing
Reverse transcribed RNA was amplified using the ARTICv3 protocol [3]. Amplicons were pooled and prepared for library preparation using standard protocols. For Illumina sequencing, samples underwent tagmentation and were sequenced on NovaSeq 6000 (2×50 bp; Illumina). For rapid lineage identification, a ligation-based library was prepared and sequenced on GridION (Oxford Nanopore). With SARS-CoV-2 genome MN908947.3 as reference, Illumina data was aligned with BWA-MEM and Guppy-called Nanopore data with minimap2 (using ont-map option). Variant Call Format (VCF) files were generated with FreeBayes [4] (Illumina) or medaka-variant (Nanopore) and these VCF files used in both cases to generate consensus sequences with bcftools. Lineages were called from consensus sequences with pangolin v3.1.16 [5] using the sarscov2_lineages workflow on the Terra platform [6]. Variant lineages were not assigned for sequences that did not meet pangolin minimum length of 10,000 bases and <50% N (unassigned) bases. Clade assignments, reference-based QC calls, mutational profiles, and associated visualizations for analyses were produced through the Nextclade webserver v1.8.1 [7]. The presence or absence of minor variant strains was assessed through analysis of alternate allele frequencies in the Illumina data set generated by both FreeBayes (v1.3.1) and iVar (v1.3.1). For mixed samples, lineage deconvolution was performed using two independent workflows: LCS (https://arxiv.org/abs/2110.01117v1) and Freyja (https://www.medrxiv.org/content/10.1101/2021.12.21.21268143v1.full.pdf).
Acknowledgements
The authors would like to thank the entire XpresCheck (Erica Mares, Lesley Shirley, Rob Stein, Henry Streich, Miguel Yapor), Concentric by Ginkgo (Thomas Aichele, Dan Bayley, Juskarun Cheema, Tyler Clarkson, Aakash Desai, Joseph Fridman, Alix Hamilton, Corey Hoehn, Erica Jackson, Hannah Knoll, Frank Langston, John Mcbride, Justin Montgomery, Jason Ng, Rich Nordin, Sarah Rush, Zach Smith, Sativa Turner, Erika Gute, Maria Pis-Lopez, Cherish Weiler), the CDC Division of Global Migration and Quarantine Travelers’ Health (Teresa Smith, Jessica Allen, Laura Leidel, Igor Ristic, Robin Rinker) and Quarantine and Border Health Services Branches (Clive Brown, Tai-Ho Chen, Alida Gertz, Matthew Palo), CDC COVID-19 Response Health Department Liaisons, US Customs and Border Protection, and state and local public health authorities - who supported the operational implementation of the program on the ground in airports.
Footnotes
↵1 See e.g., 45 C.F.R. § 46.102(l)(2); 21 C.F.R. part 56; 42 U.S.C. §241(d); 5 U.S.C. §552a; 44 U.S.C. §3501 et seq
References
Supplemental References
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.