ABSTRACT
Background The traditional diagnosis of sepsis has always been based on microbial blood culture (BC). However, BC suffers from (1) long culture cycle, leading to delay in results, and (2) low diagnostic yields. Metagenomic next-generation sequencing (mNGS) has been proposed as an efficient and agnostic option that potentially overcomes these issues. In this study, a mNGS workflow utilizing a novel filter to specifically capture white blood cells and deplete host DNA background, was evaluated against BC results, as well as mNGS without host depletion, for pathogen identification.
Materials and Methods Patients admitted to Taipei Veterans General Hospital (TVGH) with suspected sepsis were recruited to the study approved by the IRB. Blood sample was taken for BC (designated as BC1) before any antibiotic exposure. Upon patient enrolment, blood was taken again and divided in 3 portions with one used for the 2nd BC (BC2). The other two were used for mNGS with one processed with the filter and the other without filtering, to assess the effectiveness of host-depletion by the filter.
Results A total of 50 patients were recruited among which 45 had results for all 4 tests. mNGS with filter had the highest positive rate of 74.4%, followed by mNGS without filter and BC1 (51.1% and 50.0% respectively), while the 2nd BC had the lowest positive rate of 22.0%. Further, mNGS was less sensitive to antibiotics exposure as compared to BC. The overall correlation between samples with vs without filtration (R2=0.96) confirmed that filtration does not affect microbial composition in a sample. For the BC positive samples, the effect of host depletion by filtration increased microbial target reads/million QC reads from 46 reads to 243 reads on average. Microbial reads enrichment by the filter appeared to be more effective for the samples with lower microbial concentration, thus increasing the test sensitivity over mNGS without filter. Using the 2nd BC results as reference, mNGS with filter and mNGS without filter exhibited sensitivities of 81.8% and 63.6%.
Conclusion The mNGS with filter was able to recover most of the pathogens identified by clinical BC and achieved the highest diagnostic yield. With the clinical implementation to complete the workflow within 24 hours, it has the potential to overcome slow turnaround and low diagnostic yield issues of traditional BC.
Introduction
Sepsis is a life-threatening condition arising from the human immune response to the infection of the bloodstream [1]. Sepsis can progress into septic shock, organ failure and death if diagnosis and correct treatment are not timely made [2]. Morbidity and mortality rates of blood stream infections (BSIs) are especially high in the intensive care units (ICU) and neonatology units of hospitals [3]. The traditional workflow to diagnose sepsis in clinical microbiology laboratories starts with blood culturing (BC), necessitating between 1 ml and 10 ml of blood per BC bottle, often in automated systems such as the BACTECTM (Becton Dickinson-BD, Maryland, USA) or BacT/Alert (bioMérieux, Marcy l’Etoile, France. A positive BC is followed by Gram staining. Bacterial genus and species identification can be performed on the cultured strain with biochemical assays or MALDI-TOF mass spectrometry (MS). Antibiotics susceptibility testing of the cultured strain is generally performed with microbiological assays.
The short-comings of traditional methods have been expounded [4]: they do not detect viruses and parasites; some bacterial pathogens are difficult to culture; the culture cycle is long, so the results are delayed; and it is difficult to identify mixed infections (due to single colony picking). One of the major shortcomings of traditional methods is its low diagnostic yield (positive rates). In one study [4], the positivity rates of blood, sputum, and BALF using traditional detection methods were 14% (13/90), 38% (34/90), 22% (10/45), respectively. In immunocompromised and immunocompetent groups of patients, 47% (14/30) and 27% (16/60) of patients, respectively, were positive by NGS but negative by traditional detection methods.
Despite the availability of rapid molecular assays, most early antibiotic treatment is still empirical. According to one study [5], approximately 46% of early empirical antibiotic treatment was inappropriate, directly leading to a sepsis-related mortality rate of almost 35%. Approximately 50% of antibiotics administered are unnecessary or too broad-spectrum, which increases the toxicity of drugs and the incidence of bacterial resistance [5]. Therefore, early identification of pathogens is particularly important to enable targeted antibiotic therapy. Several reports have presented the potential advantages of molecular diagnostics over BC, such as a shorter turnaround time, detection of difficult to grow bacteria or detection after prior intake of antibiotics which inhibits bacterial growth [6]. Caliendo et al. reported that molecular diagnosis of BSI can reduce hospitalization rates and length of ICU stays, and decrease mortality due to BSI [7].
In recent years, metagenomic NGS has been proposed as a more efficient and accurate means for pathogen diagnosis and expanded the options of diagnostic strategies for pathogen identification [4] [8] [9] [10]. mNGS is based on the “agnostic” extraction and sequencing of all nucleic acids in a patient’s specimen in parallel, so as to obtain the sequence of host and microorganisms. mNGS can be applied to a wide range of specimen types (sputum, throat swab, blood, alveolar lavage fluid, pleural fluid, cerebrospinal fluid, pus, tissue specimens, etc), and it can directly detect nucleic acids in clinical samples without biasness or selectivity. Viruses, bacteria, fungi, and parasites can be detected in an unbiased manner. mNGS could detect and identify multiple pathogens simultaneously.
NGS was used for the clinical diagnosis of neuroleptospirosis in a 14-year-old critically ill boy with meningoencephalitis; this case was the first to demonstrate the utility of metagenomic NGS (mNGS) in providing clinically actionable information, as successful diagnosis prompted appropriate targeted antibiotic treatment and eventual recovery of the patient [11]. Huang et al. showed that in patients with pulmonary infection, the positive-rate of mNGS (88.30%) for pathogen detection in pulmonary infection was much higher than that of traditional detection methods (25.73%), while the specificity of mNGS (81.16%) was slightly lower than that of traditional detection methods (88.41%) [9]. Cheng and Yu [4] showed that, in a cohort of immunocompromised and immunocompetent sepsis patients, 77 (86%) were positive for 1 or more pathogens using NGS, and 50 (56%) were positive using traditional detection methods.
One widely recognised major hurdle in mNGS of blood samples is the overwhelming presence of human DNA, leading to vast wastage of sequencing real estate [11]. A few methods have been proposed for removing host DNA background, including differential lysis of human cells [12] and methylated human DNA removal [13]. Recently, host cell depletion device such as Devin fractionation filter has been made available for removing human nucleated cells. Incorporating such device into mNGS workflow imposes an efficient way to enrich microorganisms in the sample, to increase the sensitivity of the assay or decrease the total cost of the assay. The objective of this study is to evaluate the efficacy of the Devin fractionation filter in host depletion and enrichment of microbial genomic sequences in the mNGS workflow used in this study. Another objective of this study is to evaluate the clinical performance of the mNGS workflow incorporating Devin filtration, using a cohort of patients with suspected sepsis who had been admitted to Taipei Veterans General Hospital (TVGH).
MATERIALS AND METHODS
Patient samples
Blood samples were recruited from patients admitted to Emergency Department of Taipei Veterans General Hospital (TVGH) from April 2021 to September 2022 who were suspected of sepsis. The study has been approved by the Institutional Review Board (Ethics Committee) of TVGH (IRB number, 2021-03-013AC, approval date: March 22, 2021). As illustrated in Fig. 1, Blood sample for routine clinical blood culture was taken upon admission as the 1st blood culture (designated as BC1) which may be followed by antibiotic exposure for most of the cases. Blood sample for this study was then taken after consent was obtained. It was divided in 2 portions. One portion was used for blood culture designated as 2nd blood culture (BC2) while the other portion of approximately 8 mL was used for mNGS.
Experimental plan for this study.
Host depletion using Devin™ fractionation filter
In order to assess the effectiveness of host-depletion by the Devin™ filter, the ∼8 mL whole blood sample was further divided into two portions, one processed for host-depletion using the Devin™ fraction filter (Micronbrane Medical, Taiwan), and the other processed without filtering. The Devin™ fractionation syringe filter series exploits the antifouling Zwitterionic coating technology to specifically capture the white blood cells (>99%) in the whole blood, while allowing other blood components to flow through the membrane. The filter was securely attached to a syringe on the one side. Approximately 4 mL of whole blood sample was transferred into the syringe. The syringe plunger was gently pressed down to push the sample through the filter to a 15 mL falcon tube. Both the filtered and unfiltered blood samples were used for the downstream assays.
Sample processing and DNA extraction
As illustrated in Fig. 1, ZymoBIOMICS Spike-in Control I (High Microbial Load) (Zymo Research, Irvine, CA) which included two extremophiles bacterial species (Imtechella halotolerans and Allobacillus halotolerans) was added to all samples including NTC (No Template Control) at the concentration of 10^4 Genome Copies/mL to act as internal reference controls. Both the filtered and unfiltered blood samples were centrifuged at low speed (400g) for 15 min at room temperature to obtain the plasma. The plasma was then centrifuged at high speed (16,000g) to obtain the sample pellet for DNA extraction using Devin™ Microbial DNA Enrichment kit (Micronbrane Medical, Taiwan). For selected samples, the supernatant after the high-speed centrifugation was also obtained for cell-free DNA (cfDNA) extraction using iCatcher® Circulating cfDNA1000 Kit (CatchGene Co., Ltd., Taiwan).
Library construction and NGS
Library preparation with DNA extracted from the pellet was carried out using Illumina DNA Prep Library Kit (Illumina, USA), according to manufacturer’s instructions. After DNA tagmentation, PCR amplification was carried out using the following conditions: (Top lid at 100°C); 68°C, 3min; 98°C, 3min; 15 cycles of 98°C, 45sec/ 62°C, 30sec/ 68°C, 2min; 68°C, 1min; 10° C, ∞. After amplification, the reaction was purified using Sample Purification Beads and eluted with 21 µl of Resuspension Buffer (Illumina, USA). 20 µl supernatant was transferred to a new Lobind Eppendorf tube and stored at -20°C. The constructed libraries were sent to a service provider for sequencing on Illumina NovaSeq platform to obtain a minimum of 20 million reads per sample at 150bp. NGS Library preparation for cfDNA from plasma were carried out using xGen™ ssDNA & Low-Input DNA Library Preparation Kit (Integrated DNA Technologies, USA) following manufacturer’s protocol.
Bioinformatics pipeline and results interpretation
All sequencing reads were trimmed for adapter sequences and poor-quality bases (<Q30) using fastp v0.23.2 [14]. Reads mapped to human genome (GRCh38) were performed with bwa 0.7.17 (bwa-mem algorithm) [15]. The remaining reads were aligned to microbial database with bwa 0.7.17. A set of representative genomes for microorganisms (bacteria, viruses, fungi, protozoa, and other multicellular eukaryotic pathogens) from the NCBI Nucleotide and Genome databases was used for microbial alignment. The final database consisted of about 1400 genomes. For each sample, the % of each microorganism (microbe %) was calculated as the % of the classified reads in the total microbial reads. The absolute classified reads were normalized to one million QC reads named as RPM (Reads per Million) for each microorganism. A non-template control (NTC) was mandated for each reagent batch. The Microbe % and RPM was calculated in the same way for NTC as for the samples. The NTC was used to filter out contaminants from the laboratory and reagents. Sample to NTC ratios were computed using % of reads/total microbial reads (microbe %SAMPLE : microbe %NTC). Microorganisms were kept only if their % were found ≥ 5-folds in samples than in controls and their RPM ≥ 5. mNGS results were primarily compared with microbial identification results from BC2 (the reference result), since both mNGS and BC2 resulted from the same blood draw. Subsequently, BC1 results as well as other culture results if available were also used for evaluating the overall performance of mNGS. Prior to finalization of read-length of analysis, a subset of 11 BC1 positive samples were subject to analysis based on 100bp, 120bp, and 150bp (Supplemental Table 1). Analysis using various read length did not affect the recovery of positive sequences. As such, 100bp-based analysis was selected to be used in this study as the sequencing and analysis time of 100bp read length is the most feasible condition to finish the whole metagenomic NGS workflow within 24 hours for real clinical application.
Comparison of target reads from analyses based on 100bp, 120bp and 150bp read lengths.
RESULTS
Clinical Performance of mNGS with and without host depletion
Total of 50 test subjects (identified by sample number in Table 1) were recruited for this study. These were patients admitted into Emergency Department with suspected sepsis. The microbial culture results from the first blood draw (BC1) were the routine clinical results. This may be followed by antibiotic exposure in most cases. Blood sample for this study (BC2) was then taken after consent was obtained. For testing the efficacy of host depletion, each sample was processed with and without Devin filtration followed by the same DNA extraction, NGS library construction process and sequenced at 150bp. Three cases (#25, #31 and #37) had library preparation failure for both with and without filter samples. Two cases (#27 and #38) failed library preparation failure for the without filter sample only. Hence, the success rate of library prep was 92% (92/100).
Summary of mNGS results correlated with clinical microbial culture and reference microbial culture results.
As the turn-around time (TAT) is one of the crucial parameters for pathogen detection method, we first evaluated the microorganism classification performance by different sequencing read lengths in the 11 samples having positive culture results for both BC1 and BC2. As shown in Supplementary Table 1, the sequencing analysis results were compared among different read length of 150, 120 and 100bp in both mNGS with and without Devin™ filter. Reads were normalized to million QC reads (RPM) for comparison among samples. Overall, a highly comparable results were observed for different read length among all 11 samples. Most importantly, when applying the tentative cut-off criteria for potential pathogen identification, viz fold change (microbe %SAMPLE : microbe %NTC) ≥ 5 and microbial RPM ≥ 5, the results were unaffected by different read length. Hence, analysis with 100 bp read lengths offers the most efficient option, and was selected to be used in this study. The summary of the reads classification results at 100 bp for all samples was shown in the Supplementary Table 2.
Sample reads from all samples.
Both fold change (microbe %SAMPLE : microbe %NTC) and individual species RPM (Reads per Million) are direct indicators of the amount of species in the sample. Using BC results as a comparison, setting both indictors at a threshold of ≥ 5 for pathogen calling appeared to be reasonable criteria to achieve good sensitivity and specificity. The microorganisms identified passing these criteria for all samples were listed in Supplementary Table 3. The identification results of all samples were then compared to both the BC1 and BC2 results and summarized in Tables 1 and 2. As shown in Table 1, when comparing to BC2 positive results, mNGS with filtration was able to detect the pathogens consistent with the microbial culture in 9 out 11 samples while mNGS without out filter was able to detect 7 out of 11 samples. Hence the sensitivity of mNGS with or without filtration was 81.8% and 63.6% respectively. Filtration let to higher sensitivity performance through increasing overall number of microbial reads and target species reads in most of the samples. In sample #17, both mNGS methods failed to detect MRSA. In sample #9, both mNGS failed to identify E. coli but were able to detect other species passing the criteria set. Further, mNGS (with filtration) provided the highest diagnostic yield (35/47; 74.4%), followed by mNGS (without filtration) (23/45; 51.1%), BC1 (25/50; 50%) and BC2 (11/50; 22%). Out of 12 samples which were positive for BC1 but negative for BC2, mNGS detected pathogen in 8 (with filter) and 4 (without filter) samples consistent with the culture results of BC1. Hence, this data also suggested that mNGS is less sensitive to antibiotic treatment as compared to culture methods.
Complete list of potential pathogens Identified by mNGS (Fold change (% Microbialsample/% MicrobialNTC >=5 and RPM >=5)
Potential pathogens identified by mNGS in the 22 blood culture negative samples.
In general, mNGS detected more microorganisms per sample as compared to microbial culture (see columns labelled as “Others” in Table 1 and “# Potential Pathogens Identified” in Table 2). Microbial detection by mNGS from blood culture negative samples, where both BC1 and BC2 were negative, is shown in Table 2. Interestingly, amongst the 22 blood culture negative samples (both BC1 and BC2 negative), microbial detection in 5 samples were correlated to additional testing results – mNGS results in 4 samples included a pathogen that correlated with other clinical results; 1 sample was negative for mNGS and had negative results from all routine tests. This suggested that although mNGS detected more pathogens as compared to microbial culture, the additional pathogens called may be correlatable to other clinical results, and may be clinically useful.
Effect of Devin filtration on number and ratio of microbial reads
Although human reads remained the predominant class of sequence from whole blood samples despite the employment of host cell depletion by Devin™ filter, total microbial RPM increased considerably in 38 out of 45 pairs of samples having data from both mNGS with and without filter. For the BC positive samples, the effect of host depletion by filtration increased microbial target reads/million QC reads from 46 reads to 243 reads on average. A plot of the microbial RPM before and after filtration (Fig. 2a) showed that a wide range of enrichment (fold change in microbial RPM upon filtration compared to that without filtration) occurred amongst the samples. Fold-enrichment ranged from less than 1-log to more than 3-log (Fig. 2a, Supplementary Table 2). Particularly, the enrichment of microbial reads appears to be more effective with lower number of microbial reads entering the filter. In a small fraction of samples, enrichment was unsuccessful and number of reads declined slightly upon filtration. The reasons could be various and would be further discussed in the later session. In a fraction of samples, enrichment appeared insignificant, with no enrichment or even decrease in number of microbial reads upon filtration (samples 10, 12, 21, 22, 40, 48, 50). Interestingly, 3/7 of these were negative for BC1 and BC2, and 3/7 of these were negative for BC2, suggesting that there might be no significant level of pathogen content in these samples.
Comparisons of mNGS results from “With Filter” and “Without Filter” samples. (a) Comparison of sample reads (in Reads per Million) with and without filtration, arranged in order of decreasing reads for “Without Filter” samples. Y-axis (Sample output in RPM) is in log scale. (b) Plot of microbial genome % from “Without Filter” samples vs “With Filter” samples.
The % of individual species reads (microbial%) was computed by dividing each species reads to the total microbial reads (after removing human and unclassified reads). During infection, microbial% of the pathogen would be higher than that in normal individuals or a pre-defined baseline [16]. As such, we use the fold-enrichment of individual species microbial% in a sample over that in the NTC as one of the criteria to identify potential positive pathogens. Hence, it is important that the process of Devin filtration should not randomly alter the microbial% for each microorganism in a sample. To verify this, the microbial % reads for each microorganism in all samples with filter was correlated with that from the same samples without filter (Fig. 2b). Results showed that there was a tight correlation (R 2=0.96) in microbial % reads with and without filtration, and the correlation was close to 1:1. Hence, the process of Devin filtration does not affect proportion of microbial species in general.
Comparison with cfDNA detection
Although this study was based on the use of gDNA for pathogen detection, a number of mNGS studies employs cfDNA as the target for pathogen detection. The strength and weakness of a cfDNA approach is discussed below. To investigate the correlation between gDNA and cfDNA approaches, cfDNA was harvested from the plasma from selected BC positive samples and subject to mNGS. Results showed that most of the expected pathogens were also detectable by cfDNA (Table 3). In the absence of filtration, the microbial RPM was similar between gDNA and cfDNA which was 280 and 140, respectively. However, with the use of filter, gDNA-based mNGS obtained significantly increased microbial RPM of an average of 2,359, while cfDNA-based method only achieved and average microbial RPM of 95 by filter, which was even slightly less than the process without filter. Hence, gDNA but not cfDNA was amenable to host-depletion and enrichment in microbial sequences. This suggests that signal derived from cfDNA would be closer to baseline noise, imposing a greater tendency for calling false positives.
Comparison between genomic DNA-based mNGS and cell-free DNA-based mNGS in selected samples.
DISCUSSION
Enhancement of mNGS sensitivity by host depletion
One long-standing challenge of mNGS is the overwhelming presence of host DNA background in human biospecimens especially the whole blood. Although different host depletion methods have been tested, they are not practical in the clinical workflow in addition to widely varied host depletion effect. The host cell/DNA depletion by filtration tested in this study showed significant level of enrichment of microbial reads comparing to the sample processed without filter, especially for the samples with low microbial content. The host depletion indeed enhanced the sensitivity from 63.6% for mNGS without filter to 81.8% to mNGS with filter, suggesting that filtration can enrich microbial content in the gDNA portion of the sample using the process tested in this study. On the other hand, cfDNA was compared to this gDNA based mNGS approached in this study because most of the reported studies were based on cfDNA. No significant enrichment of microbial content in the cfDNA portion was observed between samples with and without filtration. Thus, we demonstrated in this study that gDNA based mNGS was amenable to host-depletion and enrichment in microbial sequences by filtration. With the easy process of filtration together with 100bp sequencing and analysis, it is possible now to establish a clinical applicable workflow can be completed within 24 hours with enhanced sensitivity comparing to other mNGS methods.
Criteria for pathogen identification by mNGS
Another major challenge of mNGS is that currently it lacks consensus of the definitive criteria for distinguishing positive pathogen from background. It’s documented that most of the reagents used for mNGS have been found to also introduce foreign DNA during the sequencing process. This phenomenon has been described as “kit-ome”, which will seriously confound the sample result [8]. Therefore, it is important to capture the nucleic acid background of the environment and reagents used for sequencing, which can help filter out contaminated background readings during the interpretation of the results. In this study, a template-free control (NTC) was used in the mNGS analysis to set the baseline for making positive calls. We used the tentative cut-off criteria fold change (microbe %SAMPLE : microbe %NTC) ≥ 5 and microbial RPM ≥ 5 for potential pathogen identification to achieve sensitivity as well as remove potential false positive from the background as much as possible. However, the current criteria may miss some potential positive. For expample, in sample 33 of which the urine culture is positive for Aerococcus urinae, this species showed significant fold change of microbe %SAMPLE : microbe %NTC) but microbial RPM of 3.18 didn’t pass this species as positive. Further refinement of such criteria need to be tested in a much larger sample size. On the other hand, in some of the samples especially those processed at the same time, there were some identified positives may contributed by the process background such as sample 3 and 4, sample 7 and 8, sample 13 and 15. Further refinement of this would be 1) to apply NTC to every batch of sample processing and sequencing; 2) to build a database of all NTCs of the same batch of reagents.
Clinical accuracy and diagnostic yield
Karius offers a commercial cfDNA based mNGS laboratory developed test (LDT) for sepsis diagnosis. The test received CLIA and New York State approvals, with a claimed sensitivity of 92.9% and a claimed specificity of 62.7%. Working on pathogen detection in cerebrospinal fluid (CSF) by mNGS, Miller et al [16] reported a claimed sensitivity of 73% and a claimed specificity of 99%. In some study [17], sensitivity and specificity were not explicitly stated. In this study, a sensitivity of 81.8% was reported.
Karius reported 2.7x more causal pathogens than initial blood culture and 1.3x more causal pathogens than all microbiology tests. Cheng and Yu [4] reported in their study that 77 (86%) were positive for 1 or more pathogens using NGS, compared to 50 (56%) which were positive using traditional detection methods. In this study, the diagnostic yield by mNGS with host depletion by filtration (90%) was significantly higher than clinical microbial culture (56%). Hence, mNGS was able to provide more positive results than conventional pathogen detection.
A previous study showed that the rate of positive mNGS results was constant over the different time points after sepsis developed, while the positivity of blood culture decreased at later time points [10]. In another study, cell-free DNA sequencing could still identify fungus such as P. jirovecii or Aspergillus species among patients receiving effective antifungal agents [18]. In this study, the second blood culture done after antibiotic treatment had significantly lower sensitivity compared to mNGS, and there were several cases where an organism is detected consistent with BC1 but was negative by BC2. Hence, our findings were consistent with other studies in that mNGS possessed greater sensitivity for those who had blood sampled prior to effective antimicrobial agent exposure, suggesting mNGS could be a more effective detection method for patients undergoing antibiotic treatment.
Future prospects and concluding remarks
The target independent identification of potential pathogen by mNGS has made it a very promising tools for clinical microbiology comparing to other molecular tests. It can detect different microorganisms simultaneously, for example, Escherichia coli and Candida albicans in sample 43. It can detect microorganisms unculturable by routine culture method, for example, Torque teno virus in sample 41. In conclusion, mNGS was able to [19] identify majority of the organisms detected by microbial culture and showed a good correlation with routine clinical results. In addition, it provided more results, potentially able to assuage the short-coming of poor diagnostic yield of conventional culture. However, the demonstration of clinical utility is required for such test to be applied for clinical diagnostics. Stanford Healthcare set out to determine the real-world clinical utility of the Karius test in a subset of immunocompromised patients [20], with the conclusion that, in 48 cases, positive impact was observed in 6 (7.3%) cases, negative impact observed in 3 (3.7%) cases, and no impact in 71 (86.6%) cases. More studies need to be performed to assess clinical utility in different group of patients.
The other important questions to consider are whether mNGS has the diagnostic accuracy required to complement currently available diagnostics, and how best to integrate mNGS into current testing algorithms. One possibility is to reserve mNGS as last resort for cases for which conventional microbiological testing has failed to provide an answer, or to include mNGS more proximally in testing in parallel to conventional tests. These are currently an active subject of discussion in the microbiology field that needs to be determined empirically.
Data Availability
All data produced in the present work are contained in the manuscript