Abstract
Background Nasal pathogen detection sensitivities are often as low as 70% despite advances in molecular diagnostics. It has been suggested that this is linked, in part, to the choice of sampling method.
Methods A diagnostic test accuracy review for sensitivity, using recently developed Cochrane methods for conducting rapid reviews, and the PRISMA protocol was undertaken, with QUADAS-2 risk of bias assessments and meta-analysis of included studies. Sensitivities were calculated by a consensus standard of positivity by either method as the gold standard. Insufficient and/or inaccurate, cross sectional or anatomical site pooling methodologies were excluded.
Results Of 13 included studies, 8 had ‘high’ risk of bias, and 5 had ‘high’ applicability concerns. There were no statistical differences in pooled sensitivities between collection methods for 8 different viruses, and neither with use of PCR, Immunofluorescence nor culture. In a single study, Influenza H1N1 favoured nasopharyngeal swabs, with aspirates having 93.3% of the sensitivity of swabs (p>0.001). Similar equivocal sensitivities were noticed in detecting bacteria.
Conclusions The chain of sampling, from anatomical site to laboratory results, features different potential foci along which sensitivity may be lost. A sufficient body of evidence exists that use of a different sampling method will not yield more respiratory pathogens. The new Cochrane Rapid Reviews guidance helped rapidly answer this relevant and timely clinical question.
Background
Accurate laboratory-confirmed diagnoses aid both timely treatment and surveillance of respiratory infections at global levels, through rapid detection1,2,3. The frustration of false negative results for specific pathogen carriage, experienced by clinicians thus escalating treatment upon clinical suspicion alone, predates SARS-CoV-19, and leads to reliance on repeat tests and imaging4,5. Suboptimal sensitivity has persisted for viruses despite the adoption of Polymerase Chain Reaction (PCR) as the gold standard above viral culture and direct immunofluorescence (DIF), where increased specificity of PCR has in itself decreased sensitivity in comparison to other methods due to false positives no longer being incorporated6. PCR avoids the decreased sensitivities in patients over 5 years of age seen with DIF, and is further advantaged in being able to quantify viral load (qPCR)7,8,9. Higher viral loads present in the early course of a viral infection predominate in the nose than in the throat, and slightly predominate in the nasopharynx than the anterior nasal cavity10,11,12. For bacteria, there is a related but distinct microbiome between the anterior nares and nasopharynx13. The nasopharynx is the uppermost portion of the throat lying at the back of the nasal cavity and accessible horizontally along the nasal floor past occasionally obstructing turbinates and deviations of the septum. A nasopharyngeal swab (NPS) is inserted to a depth equal to the distance from the nostril to the earlobe (nasotragal length, NTL), or until the nasopharynx is felt (a depth of up to 14cm), with less deep swabs >=5cm sampling the middle meatus or anterior nares14,15,16,17. The NTL in children and infants is shorter, but remains considerably longer than the 2cm depth occasionally cited in NPS studies, and is well described in approaches to paediatric intubation18,19. Lack of clarity over such definitions and ensuing heterogeneity has frustrated meta-analysis of this area20. A review of methods for Influenza detection found increased yield when pairing combinations of diverse methods21. Combined oropharyngeal and anterior nasal swabs are shown to be comparable in sensitivity to a single sample of the nasopharynx whilst benefiting from higher patient satisfaction22,23,24. These combined throat/nose swabs have become recommended practice for self-administration of the test25. Paired oropharyngeal/NPSs convey increased sensitivity compared to NPS alone26,27. The swab type used is an important consideration, with greater yield of respiratory epithelial cells and greater patient satisfaction with a flocked swab (akin to a miniature toilet brush) than a rayon-tipped swab (resembling a long cotton “ear” bud) but pathogen detection rate is equivocal28,29. Other important pre-laboratory variables such as pre-impregnation of swabs with transport media, immediate placing in medium following collection or refrigeration of the sample appear to add little to the diagnostic yield30,31. Nasal aspiration (NA) involves the removal of mucous from the nasal cavity with a suction catheter, which is then removed with a mucous trap and subsequently flushed with saline or transport medium. The similar but distinct nasal wash (NW), also called the Naclerio method, is obtained by the drainage without suction of a small volume of saline flushed into the nose32,33. Samples obtained by nose blowing are not widely used, and the high prevalence of Staphylococcus aureus in these samples suggests microbial contamination from more keratinised topographies than just the internal nose34,35.
Whether or not a paired test is used, whether tests are repeated, and whether or not other variables exist, it remains a potent question as to which is the optimal nasopharyngeal sampling method – aspiration, washing or swabbing. In the absence of an objective, categorical gold standard, a dual composite reference standard combining these two imperfect tests can be used to create a “consensus standard” or “either positive” rule, against which to compare sensitivities36,37. Such a reference standard of course overestimates true sensitivity, with even 100% sensitivity representing an optimal fraction achievable by that limited method, but allows a head-to-head comparison of the two techniques with eyes open to this innate underestimation of viral presence38. This approach has the failing of having one-sided false negatives i.e. all the pathogens detected in one sample were also detected in the other, which thus has a potentially misleading sensitivity of 100%. This issue can be addressed through risk of bias assessments. If mild increases in pathogen yield suffice to influence best practice, then any similar small gains by changing collection method type require analysis of available evidence39,40. The current true sensitivity of nasopharyngeal swabbing is estimated at 71-89%41. An increase in sensitivity from 70-90% is enough to decrease by more than half the pre-test probability of a true infection up to which it would be deemed safe to act as if a negative test was indeed so42. This rapid review is designed to compare the available evidence per pathogen, laboratory method and collection method using new streamlined systematic review methods43,44.
Methods
This systematic review was conducted along the Interim Guidance from the Cochrane Rapid Reviews Methods Group for conducting rapid reviews, following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement, and with reference to the Cochrane handbook for diagnostic test accuracy reviews45,46,47. The protocol was registered with the International Prospective Register of Systematic Reviews (PROSPERO) (registration no. CRD42020189577) prior to formal screening of search results against eligibility criteria48. The protocol was developed in consultation with knowledge users in respiratory medicine, paediatrics, microbiology and otolaryngology as well as a focus group with members of the public to promote fitness for purpose.
Inclusion criteria
All studies comparing sensitivity in microbiological sampling for the upper respiratory tract were included in the search. Variations in viruses, bacteria and fungi detected and differing laboratory techniques were included and stratified by these categories in the reported results. The populations included healthy patients, those with respiratory disease (including inoculation of healthy patients), infants populations to adults, and studies in different seasons and climates. These however were not compared, being considered to influence carriage and detection rates, but not influence sensitivity discriminately between collection methods. Studies were limited to those in humans, and those since publication of WHO guidelines for the collection of human specimens for laboratory diagnosis of avian influenza infection (2005)32. Pooled samples such as swabbing of the nose and throat by the same swab or where samples were not paired from the same patient were excluded. Where a consensus standard was not derivable from the data, and this information could not be obtained, these studies were also excluded.
Nasopharynx definition
For the purposes of this review, swabs of the nasopharynx were defined as swabbing to depth of >5cm in adults and >3cm in children or with relevant reference to surface anatomy, by following WHO guidelines, or citing other well documented studies in the methods section. Where this data was lacking, the authors were contacted for clarification, and/or judgments made on inclusion based on use of appropriate anatomical nomenclature and documented training of staff. Broad interpretation of the term “nasopharynx” elsewhere in the literature to include the middle meatus and anterior nares, led to the inclusion of ‘nasopharyngeal’ studies with sparse methodological detail, but graded as ‘high’ for applicability concerns.
Search strategy
Literature search strategies were developed with a life sciences librarian. Cochrane CENTRAL, MEDLINE and Embase were searched, followed by supplemental exploration of reference lists of other reviews for focused optimization of search saturation. In the interests of time, other databases and grey literature were not included, and studies were restricted to those in English. Peer review of search strategy was not undertaken. The search was conducted on 09/06/2020.
Study selection
A title and abstract screening form was piloted using 30 abstracts and adopted without modification to dual screen 20% of abstracts with conflict resolution. Remaining abstracts were screened by one reviewer (MF) and the second reviewer screened all excluded abstracts. A full text screening form was piloted using 5 full text articles with the same process followed as title and abstract screening. Web based review portal Rayyan QCRI was used to streamline the selection process while allowing screening to be conducted in parallel49. Search results are summarized in the PRISMA flow diagram [figure 1] and search terms are accessible via the registered protocol on PROSPERO.
RT-PCR: Reverse Transcriptase Polymerase Chain Reaction
† Studies may fall in more than one category
Data Extraction
The following data were collated onto a Microsoft Excel spreadsheet: Publication year, number of participants, collection type, separate nostrils used, age of population, transport medium, analysis technique, swab type and type and quantity of microbiota detected. Variations in viruses, bacteria and mycobacteria and differing laboratory techniques were included and stratified by these categories in the report. Over and above the two designated methods described by the WHO of aspirates (extraction of fluids by suction catheter) and washes (free drainage without suction of instilled saline into a receiving bowl), there was a third hybrid method, where flushed water was then aspirated32.” For the purposes of this study, aspirates obtained after instilling water into the nose and those aspirated “dry” have been grouped together under “Aspirate”, and washes not involving suction termed “Washes.” Where studies included other collection methods, compared sensitivities from multiple anatomical sites, or compared sensitivities of different laboratory techniques all calculated with against a gold standard from one collection method alone, only data meeting the inclusion criteria above and demonstrating a head to head analysis of NP swabs and nasal aspirates/washes were extracted. In cases where the consensus standard was equal to the sensitivity of one of the sampling methods i.e. there were no false negatives for this collection method, this data was still included but acknowledged as having ‘high’ risk of bias under the reference standard domain.
Data Analysis
Where there were a sufficient number of studies, results formed a proportional meta-analysis, else a narrative was used. Sensitivity analysis and McNemars’ test for paired samples were derived using Medcalc and Scistat online statistical software respectively50,51.
Risk of Bias assessment
The QUADAS-2 risk of bias tool for diagnostic test accuracy reviews was used to grade risk of bias and applicability concerns by one reviewer (MF) with verification by second viewer (JD)52. This was modified to include the signalling questions “Were false negatives two-sided?” under reference standard, and “Were separate nostrils used?” under flow and timing. Risk of bias tables were collated using RevMan53. Studies funded by, or with material help from the company supplying the tests were noted.
Results
The initial search identified 253 articles [figure 1]. 13 were added from searching reference lists of key papers and reviews. After screening titles and abstracts, the full texts of 186 publications were reviewed. Of the 54 of these which met the eligibility criteria, 13 were included in the final review, including three from supplemental searching and one published abstract. No studies detected fungi or mycobacteria. One study was an abstract published within a conference proceedings supplement.
Risk of Bias assessment
Using the QUADAS-2 tool, overall risk of bias was high: 8/13 studies with “high” risk of bias. Applicability concerns were “low” in 8/13 studies [figure 2, 3]. Lack of information on the patient selection process in 11/13 led to “unclear” risk of bias under patient selection. There were “low” applicability concerns due to patient selection, but this was a reflection of the review question including any and all populations, there was considerable homogeneity of age, ethnicity or disease status. The high frequency of false negatives being one sided (8/13), led to high risk of bias under the reference standard domain. Three studies declared material support from companies manufacturing the testing kits.
Heterogeneity
The available literature was limited by methodological heterogeneity of participant age, participant health, laboratory methods and collection methods, even within collection methods given the same name [table 1].
Characteristics of Included studies
Laboratory methods
Methods used to detect pathogen carriage varied across studies. Eleven employed species-specific molecular methods: seven using PCR and six using direct immunofluorescence, including one using both. Five different immunoassay kits were used for immunofluorescence. Three cultured on inoculated Skim-milk-tryptone-glucose-glycerin for bacteria and one used the “R-mix” rapid culture method for viral growth. Where recorded, the cycle threshold for PCR was >40 with the exception of one which was >35. Six used saline to transport the samples, six used viral transport medium, one used Guianidinium Thiocyanate Buffer and one was unspecified.
Virology
PCR analysis of NA and NPS for group A pathogens, i.e those associated with hospitalising illness, included Respiratory Syncytial Virus, Parainfluenza virus, Metapneumovirus and Influenza A+B [figure 4]. Sensitivities as a fraction of consensus standard (positive for either collection method) ranged between 84% and 96% for these pathogens by both collection methods. Similar lack of statistical dissimilarity was found when stratifying for group B viruses Rhinovirus, Adenovirus, Coronavirus*, Enterovirus (not normally associated with severe disease), but the range of sensitivities were greater [figure 5]54,55,56,57,58. When immunofluorescence was utilised for the same, there was no improved sensitivity for NA and NPS, with the exception of Influenza A (H1N1) where swabs had a greater sensitivity (p<0.001)59,57,60. Nasal swabs for influenza was similarly demonstrated to have greater sensitivity for detecting Influenza in one study of 122 participants [figure 6]61. Another study of 89 paired samples, in which the exact nature of nasal washes was not described in detail, found the exact same RSV and Parainfluenzavirus Influenza sensitivity, and a near equivocal influenza swab/wash sensitivity of 100/97.1 (p-value =1)62 Many studies described findings seeming to advantage one collection method or other as a standalone, but this disappeared when pooled with others testing for the same pathogen by the same techniques. Indeed, when combined, the sensitivities of both collection methods for Parainfluenzavirus was the same (84.7%/84.7%).
RSV: Respiratory Syncytial Virus
CI: Confidence Interval
C+: Total consensus positive
No. of Pts.: Number of Patients
RT-PCR: Reverse Transcriptase Polymerase Chain Reaction
CI: Confidence Interval
C+: Total consensus positive
No. of Pts.: Number of Patients
RT-PCR: Reverse Transcriptase Polymerase Chain Reaction
RSV: Respiratory Syncytial Virus
CI: Confidence Interval
C+: Total consensus positive
No. of Pts.: Number of Patients
DIF: Direct Immunofluorescence
Bacteriology
Neither aspirate-wash versus swab for Bordetella pertussis PCR, nor non-typable Haemophilus influenzae in culture, yielded a significant advantage63,64. Collated sensitivities of the Naclerio method vs. NFS for a variety of species in 24 healthy British adults favoured NFS for Neisseria (60.2/100%), Diptherioids (66.7/100%), and Alpha-haemolytic streptococci (18.8/100%, p <0.001), the Naclerio method for Staphylococcus aureus (100/66.7%), and equivocal for Moraxella catarrhallis 33. A similar number of Kenyan infants presenting to hospital with mild illness not requiring hospitalisation, and having a suction catheter passed to the nasopharynx grew strep. pneumoniae in 55 samples. In comparison, 47 (85.0%) of these grew the pathogen on their NFS (p=0.005)65. These high yields may reflect the later adoption of the pneumococcal vaccine in Kenya in 201166.
Discussion
This systematic review found a moderate body of evidence comparing nasopharyngeal swabs with aspirates and washes with no significant difference in sensitivity. These findings were predominantly with PCR, comparing swabs with suction-using aspirates, and covered a range of potential viruses. Beyond these strata data were sparse, particularly for purely wash-based methods, and for detection of bacteria. Statistical significance of higher NP swab sensitivity was high for H1N1, but as this same study found no clear advantage this method for Influenza A nor B, this could be a statistical outlier. Some of the studies included were from an era of DIF and culture, which will have less relevance in future with the predominance of genomic diagnostics. Furthermore, the mechanical understanding at a microscopic level is poorly understood – although swabbing and brushing are more abrasive and likely to access deeper layers of the mucosal barrier. The inherent difference that bacteria and viruses are extra- and intracellular respectively, and the adhesive properties of biofilm also require consideration67.
The abundance of confounding variables can only be accounted for in part by the risk of bias assessments. Variations in transport, time in storage and operators persisted, though not deemed to skew the results. Similarly, the strict inclusion criteria for recording of sampling methodology still allowed for much operator dependent intra-and inter-study variability. While a true 100% sensitivity test is elusive, the approximation of similar sensitivity rates for multiple different method comparisons implies saturation of this diagnostic chain: only on the smaller studies were large differences in sensitivities seen, and these not only disappeared when pooling studies, but comparing with better powered studies. This implies a limiting common denominator to all. On the diagnostic chain from mucosa to lab bench, the step least likely to be controlled is the specific anatomical sampling technique. Single operator sampling under direct vision and controlled conditions have been described elsewhere and are required for confidence in the niche sampled68. Nevertheless the methodological heterogeneity was controlled by only including case matched control studies, with variables more likely to confound overall than favour one or other sampling method.
Ergonomics also merit consideration: the washing method has been described as more comfortable for adults than a nasopharyngeal swab, and in children anterior nasal swabbing results in a significantly lower infant distress score than an aspirate33,23. The perceived and achieved discomfort must also be presumed to affect the thoroughness and accuracy in a linear fashion. In the current pandemic, concern persists around suctioning as an aerosol generating procedure, however swabbing carries its own risk of induced coughing, sneezing or vomiting69,70.
Limitations of current literature
The Cochrane rapid review protocol proved a portable and efficient mode of prompt evidence synthesis for this timely clinical question, purely with the use of open access freeware. This protocol maintains a moderate degree of quality assessment while removing full search saturation and streamlining study selection and data extraction. As rapid reviews are an evolving methodology, it is unclear the extent to which methodological omissions compromise the saturation of these results71.
The clinical application of viral detection is not straightforward. Such techniques detect only pathogen carriage and do not demonstrate fulminant respiratory disease. Variations in viral shedding in the upper respiratory tract include: a shorter time to peak viral concentrations in saliva in SARS-CoV-19 compared to severe acute respiratory syndrome (SARS) (5 days vs 7-10 days), and completed viral shedding of Influenzavirus in adults is only completed around 5-7 days compared to infectivity persisting beyond 10 days in infants72,73,74. The Attributable Fraction, namely the percentage of times a disease is caused by a detected virus ranges from 12% for Rhinovirus to 93% for Respiratory Syncytial Virus75. Thus, even truly reliable results only contribute to a dynamic clinical picture.
Need for better understanding of anatomy of the nose in the literature is also called for. “Nasopharyngeal swabs” are not purely so, as the NP cannot be reached except via contact with the turbinates and septum, thus would be more appropriately named a “pan-nasal” swab. Interdisciplinary collaboration between otolaryngologists, pneumonologists and microbiologists would aid robust education of non-specialists in this regard.
Future Directions
How else might the sensitivity be improved? Moistening of the swab appears to add little sensitivity to dry swabs76,77. Cross sectional audit of health professionals swabbing techniques against WHO guidelines in current pandemic climate could reveal variations in practice and drive quality improvement. A true benchmark for 100% sensitivity will remain elusive, but repeated titres in patients with known disease can give a retrospective estimate of the sensitivity of initial samples.
Future accuracy and resolution in the area of 16S-RNA and whole genome sequencing are vulnerable to conflicting results and applicability concerns from any sampling errors. If there is no clear advantage between collection methods for dichotomous carriage/non-carriage, this still leaves the issue of the best method for describing ecology as a whole. For instance, increased diversity of microbiota are removed from brushing the inferior turbinate compared with nasal washing78. Given the aforementioned niche-specific diversity, it is difficult to assess if such variations denote a different topographical area being sampled or a different constellation of organisms preferentially collected by that method, perhaps by being less mucosally adhesive. Detailed delineation of the correctly sampled microbiome is a prerequisite to the future application of such technologies to diagnostics and therapy. Measurements of overall nasal diversity or key operational taxonomic units under controlled conditions may aid establishment of symptom scores for upper respiratory infections which would bypass the need for accurate sampling in the clinic79,80. These in turn could aid safe de-escalation of treatment81,82. Emerging point of care diagnostics are in readiness to accelerate all accurate and reliable respiratory pathogen sampling to guide timely treatment, and surveillance on a global level83,84.
Data Availability
Additional data such as search terms and protocol can be accessed on PROSPERO https://www.crd.york.ac.uk/prospero/ reference number CRD42020189577
DECLARATIONS
Competing interests
None to declare
Funding
None to Declare
Authors’ contributions
MF Conceived the study, registered the protocol, conducted the literature search, screened titles abstracts and full texts, conducted the risk of bias assessments, performed the statistical analysis and authored the manuscript.
MK Helped provided advice on inclusion and exclusion criteria, provided clinical relevance advice, and reviewed the final manuscript
JD Dual screened a proportion of titles, abstracts and full texts, validated the risk of bias assessments and reviewed the final manuscript
All authors have read and approved the final manuscript
Availability of data and materials
Full data, including inclusion and exclusion criteria and search terms can be accessed on the PROSPERO register registration no. CRD42020189577 https://www.crd.york.ac.uk/PROSPERO
Acknowledgements
Many thanks to Mr Lucasz Zygan for lending his expertise in Otolaryngology
Footnotes
Flynn-m5{at}ulster.ac.uk
↵* These studies predate the SARS-COVID-19 strain
List of Abbreviations and Acronyms
- PRISMA
- Preferred Reporting Items for Systematic Reviews and Meta-Analyses
- QUADAS-2
- Quality Assessment of Diagnostic Accuracy Studies -2
- (RT-)(q-)PCR
- (Reverse Transcriptase)(quantitative)Polymerase Chain Reaction
- SARS(-CoV-19)
- Severe Acute Respiratory Syndrome(–coronavirus-2019)
- DIF
- Direct Immunoflourescence
- NPS
- Nasopharyngeal Swab
- NTL
- Naso-tragal Length
- NA
- Nasal Aspiration
- NW
- Nasal Wash
- PROSPERO
- International prospective register of systematic reviews
- WHO
- World Health Organisation
- Rayyan QCRI
- Rayyan Qatar Computing Research Institute
- NP
- Nasopharyngeal
- RevMan
- Review Manager
- 16S-RNA
- 16S - Ribonucleic Acid