Abstract
Reasons for the spectrum of severity of active tuberculosis (TB) disease are incompletely understood. We sought to identify master regulators of host immune responses that determine disease severity in pulmonary TB. We performed molecular profiling of human in vivo immune responses to discover associations with the extent of radiographic TB disease in a patient cohort. We then undertook mechanistic studies to test causality for the observed associations using the zebrafish larval Mycobacterium marinum infection model. Transcriptional profiling of human immune recall responses to tuberculin skin test (TST) challenge, a surrogate for immune responses to TB in the lung, revealed that type I interferon activity in the TST transcriptome was inversely associated with radiological disease severity. Abrogation of type I interferon signalling, achieved by CRISPR-mediated mutagenesis of stat2, led to increased susceptibility of zebrafish larvae to M. marinum infection, as a result of reduced recruitment of myeloid cells required to restrict mycobacterial growth, to the site of disease. Our data support a host protective role for type I interferon responses in mycobacterial infection, with potential applications for risk-stratification of adverse outcomes and development of a host-directed therapy to mitigate against severe disease.
One Sentence Summary Type I interferon responses contribute to host immune protection in mycobacterial infection by recruitment of myeloid cells to the site of disease.
Introduction
Mycobacterium tuberculosis (Mtb) infection results in diverse clinical manifestations, ranging from asymptomatic infection to active tuberculosis (TB), which comprises an enormous spectrum of disease severity (1, 2). Although we know that immune responses contribute to the risk of disease, current understanding of the role of immune response variation in disease severity is limited. Identification of immune correlates of disease severity may offer new insights into the immune pathways that contribute to protection and pathogenesis. This is necessary to enable novel vaccine design and evaluation, stratify disease risk in people who become infected and inform development of novel host-directed therapies to shorten treatment regimens, mitigate against antimicrobial resistance, and against the chronic sequelae of tissue injury in TB (3).
Variation in tissue immunology of human TB at the site of established disease is likely confounded by differences in the chronicity of the disease process. To overcome this limitation, we evaluated in vivo immune responses at the site of a standardised antigenic challenge using the tuberculin skin test (TST). We have previously shown that transcriptional profiling of skin biopsies from the site of the TST provides comprehensive molecular and systems level evaluation of human in vivo immune responses to Mtb with significantly greater sensitivity than conventional clinical and histological assessments (4, 5). The TST gene signature is enriched within pulmonary TB granulomas and accounts for genome-wide differences in the transcriptomes of TB infected lung compared to healthy lung tissue (5, 6). Therefore, the TST provides a faithful and readily accessible surrogate to study immune responses at the site of TB disease.
In the present study, we identified quantitative variation in immune pathways in the TST response correlated with the severity of human TB disease, and then tested the hypothesis that these pathways were causally related to outcomes of Mycobacterium marinum infection in zebrafish larvae. This natural host-pathogen interaction generates granulomatous inflammation that recapitulates key aspects of human TB pathology and in particular, provides unparalleled opportunities to study innate immune responses in isolation, given the absence of functional adaptive immunity until 3-4 weeks (7). Our innovative strategy revealed an inverse relationship between type I interferon activity and radiographic severity of human TB, suggesting a host-protective role for type I interferons, that we confirmed in zebrafish larvae by showing that mutagenesis of stat2 to model abrogation of downstream type I interferon signalling led to increased severity of M. marinum infection.
Results
Outlier analysis of the TST transcriptome defines human in vivo immune responses to active tuberculosis
We supplemented data from our previously reported TST transcriptional responses (8) to provide a total sample size of 51 people with microbiologically confirmed pulmonary TB who were within one month of commencing anti-tuberculous chemotherapy (Table 1, Fig. S1A, B). We evaluated disease severity using a published radiographic scoring system which incorporates extent of disease and the presence of cavitation on chest x-ray (9). In order to capture the inter-individual variation in immune response to the mycobacterial challenge, outlier profile analysis (10) was used to identify transcripts in the TST of each individual that were significantly enriched compared to gene expression in control data from skin biopsies at the site of saline injections. The integrated list of outlier genes statistically enriched in TST samples compared to control saline samples from all 51 individuals defined an active TB TST transcriptome comprised of 3222 transcripts. This analysis revealed that although some aspects of the TST response are consistent in all people, there is also considerable variability between individuals (Fig. S1C). The 3222 transcripts identified by this approach, were enriched for canonical pathways involved in cell mediated responses, consistent with our previous description of the TST gene signature identified by conventional differential gene expression analysis (5, 8) (Fig. S1D).
Reduced type I interferon activity in the TST transcriptome is associated with increased severity of human TB disease
Next, we sought to identify immunological pathways correlated with disease severity. We identified individual genes in the TST transcriptome that showed statistically significant correlations with the radiographic severity score. We subjected these genes to Ingenuity Pathway Analysis upstream regulator analysis to collate groups of co-regulated genes (Fig. 1A and Fig. S2A). These were annotated by their upstream regulators at the level of cytokines, transmembrane receptors, kinases and transcriptional regulators, representing the key components of signalling pathways which mediate immune responses. To provide increased confidence that these modules represented co-regulated genes in each molecular pathway, we retained only those that had significantly greater co-correlated expression than modules of genes randomly selected from our TST transcriptomes (11). 111 modules were identified among genes inversely correlated with radiographic TB severity, suggesting they may contribute to host protection against severe disease (Fig. 1A-C and Fig. S2B). These included transcriptional modules representing pro-inflammatory cytokine activity (tumour necrosis factor alpha and interleukin (IL)1 beta), interferon gamma and T cell activity, consistent with existing evidence predominantly supporting their protective roles in TB (12) (Fig. 1B). In addition, we found enrichment of multiple co-regulated gene networks attributed to type I interferon signalling, suggesting that type I interferon responses may also contribute to host protection in mycobacterial infection (Fig. 1B, C). Interferon gamma was also predicted to regulate one of only two small putative networks within genes positively correlated with disease severity; the other driven by TCL1A, an AKT kinase activator that promotes cellular survival (13) (Fig. S2C).
To validate the immunological responses predicted to be associated with disease severity by upstream regulator analysis, we quantified the expression of two independently derived and largely non-overlapping type I interferon inducible gene modules reflecting the biological activity of type I interferons (Table S1) (5, 8), within the TB TST transcriptome (Fig. 1D). This provided independent validation of the inverse relationship between type I interferon responses and radiographic severity of TB (Fig. 1E), suggestive of a host-protective role. By contrast, the expression of the entire TST transcriptome and that of three distinct independently derived interferon gamma inducible gene modules (5) (Table S1) did not correlate with disease severity (Fig. S3).
stat2 CRISPant zebrafish larvae exhibit increased susceptibility to M. marinum infection
The TST transcriptome does not solely reflect anti-mycobacterial T cell memory (4) and type I interferons are primarily induced following activation of innate immune recognition pathways (14). Therefore, to evaluate whether impaired type I interferon responses cause more severe mycobacterial disease, we exploited the zebrafish larval M. marinum infection model, which allows exclusive assessment of innate immunity (7). The mammalian canonical type I interferon signalling pathway is highly conserved in zebrafish (15, 16) (Fig. 2A). First, we confirmed that intravenous M. marinum infection of wild type zebrafish larvae induces a type I interferon response, as evidenced by enrichment for two distinct independently derived transcriptional modules reflecting type I interferon activity (11, 16) (Table S1) within genome-wide transcriptomic data obtained after four days of mycobacterial growth (Fig. 2B, C). To block downstream type I interferon signalling we used CRISPR-mediated mutagenesis of stat2, a subunit of the interferon stimulated gene factor (ISGF)3, the principal transcription factor complex which induces expression of type I interferon stimulated genes (ISG) (17–19), for which there is a single zebrafish orthologue (15) (Fig. 2A). Injection of three guide RNA/Cas9 ribonucleoprotein (RNP) complexes each targeting a distinct stat2 exon, into one cell stage wild type zebrafish embryos led to >90% efficient mutagenesis, confirmed by next-generation sequencing (Fig. S4A and Fig. S5A). We verified loss of functional downstream type I interferon signalling by showing that induction of the classical ISG Mxa (16, 19) by recombinant interferon phi 1, measured by expression of a reporter mxa:mCherry transgene, was significantly attenuated in stat2 CRISPants compared to siblings injected with “scrambled” (negative control) RNPs designed to lack genomic targets (Fig. S4B and Fig. S5B-C). stat2 CRISPants developed normally up to five days post-fertilisation (Fig. S5B).
There was no difference in survival of M. marinum infected stat2 CRISPants and scrambled RNP injected siblings (Fig. 3A, B) within the timeframe of our experiments. However, mycobacterial burden, the number of bacterial foci and the spatial dissemination of bacteria, determined by quantitation of fluorescent foci of M. marinum, were all significantly higher in intravenously infected stat2 CRISPants compared to control RNP injected siblings, indicative of more severe disease (Fig. 3A, C-E). These findings are consistent with a host protective role for type I interferons in mycobacterial infection, as suggested by our observation of an association between reduced type I interferon activity and increased disease severity in human TB (Fig. 1).
Steady state neutrophil numbers are reduced in stat2 CRISPant zebrafish larvae
Macrophages are critically important for control of M. marinum infection in zebrafish larvae (20–22) and neutrophils may also contribute to mycobacterial killing (23, 24). Therefore, to investigate the mechanisms by which stat2 deficiency leads to worse mycobacterial infection we used transgenic zebrafish lines with fluorescent immune cell lineages to generate stat2 CRISPants that have fluorescent macrophages or neutrophils. First, we asked whether stat2 disruption impacted steady state macrophage or neutrophil numbers. We demonstrated by quantitative fluorescence microscopy, using integrated fluorescence as a surrogate for cell number, that stat2 deficiency did not affect baseline macrophage numbers (Fig. 4A, B). However, it was associated with reduced neutrophil numbers (Fig. 4C, D).
Recruitment of macrophages but not neutrophils to the site of sterile injury is stat2 dependent
Type I interferons have previously been shown to influence cellular migration (14, 25). Therefore, we tested macrophage and neutrophil recruitment in stat2 CRISPants using a tailfin transection model of sterile injury, most commonly used to evaluate neutrophil recruitment (26) (Fig. S4C). The timing of neutrophil homing to the site of sterile tail fin transection was normal in stat2 CRISPants, with maximal accumulation at six hours and dissipation of recruited cells becoming evident by 24 hours, similar to that of control RNP injected embryos (Fig. 5A, B) and in keeping with previous studies (26, 27). Despite baseline neutropenia, the number of neutrophils at the wound site at the point of maximal recruitment in stat2 CRISPants exceeded that of control RNP injected embryos. But by 24 hours the number of neutrophils at the injury site was significantly reduced in stat2 CRISPants, likely reflecting the attenuated increase in total neutrophil numbers in response to the acute inflammatory challenge (Fig. 5C, D).
As previously reported, macrophage recruitment to the tail wound site was slower than that of neutrophils (27, 28). Very few macrophages were present at the tailfin transection site by one hour in either stat2 disrupted or control transgenic larvae, but recruitment progressively increased until 24 hours in both groups (Fig. 6). Fewer macrophages were found in the vicinity of the wound site at all time points post-injury in stat2 CRISPant transgenic embryos compared to control RNP injected siblings, the majority being located outwith the tailfin (Fig. 6). Taken together these data suggest that stat2 is necessary for macrophage recruitment but non-essential for neutrophil recruitment to a sterile injury site. In addition, baseline neutropenia did not impact the number of cells localised to the wound at peak neutrophil recruitment.
Recruitment of myeloid cells to the site of M. marinum infection is stat2 dependent
Next we focused on macrophage recruitment to the site of mycobacterial infection because this has been reported to be crucial for control of M. marinum growth in zebrafish larvae (21). We found a significant reduction in the number of macrophages recruited to the site of localised hindbrain M. marinum infection in stat2 deficient transgenic zebrafish larvae compared to their counterparts injected with scrambled RNPs (Fig. S4D and Fig. 7A, B). We then evaluated whether stat2 disruption affected recruitment of neutrophils to the site of mycobacterial infection. We found that in contrast to their normal peak recruitment to a sterile wound, the number of neutrophils attracted to the site of mycobacterial disease was also significantly diminished in stat2 CRISPants (Fig 7C, D). Collectively, these results demonstrate that myeloid cell recruitment to the site of M. marinum infection is stat2 dependent.
In total, the recruitment studies suggest that stat2 disruption leads to a broad defect in macrophage recruitment to both infectious and non-infectious inflammatory foci. By contrast, stat2 is non-essential for neutrophil migration to sterile injury. The latter finding is consistent with defective migration, rather than neutropenia, being the principal mechanism for reduced neutrophil numbers at the site of M. marinum infection.
Discussion
We have taken an innovative approach, combining molecular and systems level evaluation of human immune responses to a standardised experimental challenge, to identify master regulators of biological pathways associated with TB disease severity. We experimentally validated host factors responsible for differences in phenotype using a relevant, tractable animal model which represents a natural host-pathogen pairing (7). Molecular interrogation of human in vivo responses to Mtb revealed that reduced type I interferon activity within the TST transcriptome was associated with worse pulmonary disease. Deficient type I interferon signalling, achieved by genetic disruption of stat2, led to more severe mycobacterial infection in zebrafish larvae, via a mechanism involving impaired recruitment of macrophages and neutrophils to the site of disease and reduced neutrophil numbers at baseline. Taken together our results demonstrate an important role for type I interferon responses in contributing to innate immune protection against mycobacterial disease.
Peripheral blood interferon-inducible gene expression that diminishes with successful treatment is detectable in human active TB (29–31). These transcriptional signatures comprise both type I and type II interferon inducible genes but have mainly been interpreted as type I interferon responses, fostering the view that type I interferons are detrimental to the host in human TB. Mouse data predominantly suggest that type I interferon responses contribute to increased susceptibility to mycobacterial infection by mechanisms including antagonism of IL1 mediated protection (32–34) and reduced responsiveness of macrophages to interferon gamma (35). However, this phenotype has not been universally observed, likely due to differences in host and pathogen genetics (36). Moreover, impaired type I interferon responses are generally associated with relatively minor differences in bacterial burden or survival (36). Of note, Mtb is not a natural mouse pathogen, and this approach may not model host-pathogen interactions in human TB faithfully (37).
In support of a protective role for type I interferons, adjunctive interferon alpha therapy in addition to conventional anti-tuberculous antibiotic treatment led to symptomatic and radiological improvement, with a reduction in bacterial burden in respiratory tract samples, in small-scale human studies (38–40). In keeping with this, an early peripheral blood type I interferon-inducible gene signature predicts successful BCG-induced protection of macaques from subsequent Mtb challenge (41). Type I interferon mediated restriction of mycobacterial growth in in vitro granulomas comprising collagen impregnated microspheres containing Mtb infected human peripheral blood mononuclear cells (PBMC) (42) also suggests that type I interferon responses benefit the host.
Our observation of reduced recruitment of macrophages to the site of disease is likely the most important factor driving increased susceptibility to M. marinum infection in stat2 CRISPants. Although macrophages have been implicated in the dissemination of M. marinum in zebrafish larvae (20, 43), reduction of total macrophage numbers, impaired macrophage migration to the site of mycobacterial infection or failure to control intracellular bacterial growth within macrophages have all been associated with poor outcome (20–22, 44, 45). In contrast, increasing macrophage numbers via augmentation of M-CSF signalling either pre-or post-granuloma formation, promotes resistance to M. marinum infection by reducing granuloma necrosis, with consequent curtailment of extracellular mycobacterial growth (21). The recent report that human genetically inherited CCR2 deficiency is associated with reduced pulmonary recruitment of monocytes, resulting in depleted numbers of functionally normal alveolar macrophages and increased susceptibility to mycobacteria, is also consistent with the notion that adequate numbers of macrophages are important for protective anti-mycobacterial immunity (46).
The reduction in neutrophils present at the site of M. marinum infection in stat2 CRISPants may have arisen due to baseline neutropenia, defective migration or both combined. On the basis that steady state neutropenia did not affect the timing and magnitude of peak neutrophil recruitment to sterile injury, impaired migration is probably the main driver of this phenotype, which likely also contributed to increased severity of mycobacterial disease. Previous data show that reduced access of neutrophils to mycobacteria leads to increased bacterial burden in a zebrafish line in which neutrophils express mutant cxcr4, resulting in peripheral neutropenia and impaired neutrophil recruitment to sites of inflammation, due to their retention in haematopoietic tissue (23, 47).
Our study has some limitations. We were not able to address the reasons for variation in type I interferon activity in our human cohort. Serum samples for measurement of autoantibodies to type I interferons are not available and our sample size is insufficient for robust detection of associations between genetic variation and the expression levels of type I interferon signalling pathway genes. We were also unable to completely disentangle whether reduced neutrophil recruitment to the site of M. marinum infection was a consequence of neutropenia or impaired migration. We have not determined the mechanism by which attenuated type I interferon signalling leads to reduced myeloid cell recruitment to M. marinum. The expression of several chemokines is type I interferon inducible (25, 48, 49) and the CCR2-CCL2 and CXCR3-CXCL11 chemokine axes are known to be involved in macrophage recruitment to M. marinum (43, 50). Future studies are required to test the hypothesis that stat2 mutagenesis disrupts chemokine-mediated cellular recruitment to the site of mycobacterial disease. Nonetheless, our work provides new insights into type I interferon mediated host protection against mycobacterial infection.
Variation in type I interferon responses may arise due to genetically or epigenetically encoded variation in the immune response to Mtb, neutralising type I interferon auto-antibodies, burden and virulence of the mycobacterial inoculum and stochasticity in the immune response pathway following exposure (51–55). A central tenet of all these possibilities is that differences in the host immune response to Mtb lead to differences in disease severity. Our findings have potential implications for risk stratification of individuals in whom any of these mechanisms predisposing to low type I interferon activity are identified. In active TB, such individuals could be targeted for longer or more intensive drug regimens, or adjunctive type I interferon therapy delivered to the respiratory tract, to mitigate against severe disease. Early intervention with type I interferon treatment, before destruction of the lung parenchyma, is likely to be crucial to augment macrophage containment of Mtb, but timing and duration of therapy will require evaluation in relatively large-scale experimental medicine trials.
Materials and Methods
Ethics statement
The study was approved by the London Bloomsbury Research Ethics Committee (16/LO/0776). Written informed consent was obtained from all participants.
Study design
We aimed to identify master regulators of human immune responses that correlate with severity of TB disease, using transcriptional profiling of biopsies from the TST as a surrogate for immune responses to TB in the lung and to test causation for observed associations by genetic manipulation of identified upstream regulator molecules in the zebrafish larval mycobacterial infection model. To encompass a broad spectrum of disease severity we recruited 51 adults (>18 years) from four London hospitals, newly diagnosed with smear or culture positive pulmonary TB, who were within four weeks of starting treatment. This is in keeping with the median sample size of previous studies which have successfully demonstrated a relationship between severity of TB disease and immunological parameters (29, 56–59). Individuals with coincident human immunodeficiency virus infection, malignancy, taking immunomodulatory therapy, a history of immunization within the preceding two weeks or previous keloid formation, or unable to give informed consent were excluded. Study participants underwent intradermal injection of 0.1 ml of two units of tuberculin (AJ Vaccines) or saline in the volar aspect of the forearm as previously described (4, 5, 8). At 48 hours clinical induration at the injection site was measured, immediately prior to sampling by 3mm punch biopsy as previously described (4, 5). Additional saline samples from individuals with cured or latent TB, BCG vaccine recipients and healthy volunteers have been described previously (8).
RNA purification from skin biopsy and zebrafish embryo samples
Skin biopsies were homogenized as previously described (8). Pooled zebrafish embryos (25-30 per experimental condition) collected in Qiazol (Qiagen) were homogenized in CK14 lysing kit tubes (Bertin Instruments) using a Precellys Evolution homogenizer (Bertin Instruments) for 10-20 seconds at 5500 rpm. Total RNA from skin biopsies and zebrafish samples was purified using the RNEasy mini and micro kits (Qiagen), respectively, according to the manufacturer’s instructions. Genomic DNA was removed using the TURBO DNA-free kit (ThermoFisher).
Transcriptional profiling of skin biopsy and zebrafish larval samples
Genome-wide mRNA sequencing was performed as previously described (8, 60). Complementary DNA (cDNA) libraries were prepared using the KAPA HyperPrep kit (Roche). Sequencing was performed in paired-end mode with the NextSeq 500 High Output 75 Cycle Kit (Illumina) using the NextSeq500 system (Illumina) according to the manufacturer’s instructions, generating a median of 22 million (range 12.3-27.9 million) 41 base pair reads per human sample and 16.8 million (range 15.1-18.1 million) reads per zebrafish sample. Transcript alignment and quantitation against the Ensembl GRCh38 human genome assembly release 100 or GRCz11 zebrafish genome assembly release 104 was performed using Kallisto (61). Transcript-level output counts and transcripts per million (TPM) values were summed on gene level and annotated with Ensembl gene ID, gene name, and gene biotype using the R/Bioconductor packages tximport and BiomaRt (62, 63). Downstream analysis was performed using R 3.6.3 on log2 transformed TPM values for protein-coding genes only. TPM values <0.001 were set to 0.001 prior to log2 transformation. To determine the need for and effect of batch correction in the human data, principal component analysis (PCA) was performed using the prcomp R function on the integrated list of 6802 genes with the 10% lowest variance in expression for each library preparation date. This revealed separation of three samples in PC1, likely accounted for by lower RNA concentrations or RIN values compared to the rest of the dataset (Fig. S1A). After batch correction performed using the ComBat function of the sva R package (64), allocating samples with PC1 score <0 and >0 to separate batches, this was no longer evident (Fig. S1B).
Radiographic quantitation of TB severity
Disease severity was evaluated by estimating the extent of radiographic abnormalities and whether cavitation was present on chest x-ray at the time of diagnosis, as previously described (8, 9).
Outlier analysis
The BioBase Bioconductor R package was used to perform outlier analysis based on the z-score outlier detection (ZODET) algorithm (10), to identify transcripts in each TST sample whose expression was significantly higher than the mean expression in control saline samples (>2SD, p value <0.05), with a fold difference filter of >1 log2. Chi-squared tests implemented in R were then used to identify outlier genes statistically enriched in TST samples compared to saline samples, using a p value <0.05. To avoid type 2 error no correction for multiple testing was used in this analysis. In instances where multiple Ensembl gene IDs were associated with a single gene symbol, duplicate gene symbols were removed to retain the replicate with the highest expression for each sample.
Pathway enrichment analysis
The biological pathways represented by the TST transcriptome were identified by Reactome pathway enrichment analysis using XGR as previously described (65, 66). For visualization, 15 pathway groups were identified by hierarchical clustering of Jaccard indices to quantify similarity between the gene compositions of each pathway. For each group the pathway term with the largest number of annotated genes was then selected as representative of enriched biology.
Upstream regulator analysis
TST transcriptome genes whose expression was significantly correlated with radiographic severity of TB disease were identified by Spearman rank correlation, p value <0.05, without correction for multiple testing, to prevent type 2 error. Genes positively and negatively correlated with disease severity were separately subjected to Ingenuity Pathway Analysis (Qiagen) to identify upstream molecules responsible for their transcriptional regulation. This analysis was restricted to molecules annotated with the following functions: cytokine, transmembrane receptor, kinase and transcriptional regulator, representing the canonical components of pathways which mediate transcriptional reprogramming in immune responses. Enriched molecules with an adjusted p value <0.05 were considered statistically significant.
Upstream regulator functional modules
TST gene modules representing the functional activity of their upstream molecule were identified as previously described (11). In brief, we calculated the average correlation coefficient for pairwise correlations of the expression levels of the target genes associated with each predicted upstream regulator. We compared these to the distribution of average correlation coefficients obtained from 100 iterations of randomly selecting equivalent sized groups of genes from the TST transcriptome. Only target gene modules with average correlation coefficients higher than the mean of the distribution of equivalent sized randomly selected groups by ≥2 SD (Z score ≥2) with false discovery rate <0.05, were retained. We visualised these upstream regulator modules as network diagrams using the Force Atlas two algorithm in Gephi v0.9.2 (67), depicting all statistically over-represented molecules (false discovery rate <0.05) predicted to be upstream of ≥4 target genes.
Transcriptional modules
Interferon inducible transcriptional module gene lists are provided in Table S1. Independently derived human type I and type II interferon inducible gene modules were described previously (5, 8, 11). Two further modules were derived to distinguish type I or type II interferon inducible gene expression using published transcriptional data for primary human keratinocytes stimulated with a selection of cytokines (Gene Expression Omnibus accession: GSE36287) (68). Stimulus-specific modules for type I interferon (interferon alpha) and type II interferon (interferon gamma) were derived by selecting genes overexpressed >4-fold in the cognate cytokine condition relative to all non-cognate conditions combined (paired t test with α of p<0.05 without multiple testing correction), excluding genes also induced by each non-cognate cytokine condition, using the same criteria. The babelgene R package (https://igordot.github.io/babelgene/) was used to convert human gene symbols to their respective zebrafish orthologues to generate the zebrafish Stat1 module, excluding zebrafish genes reported to be an orthologue of more than one human gene. The zebrafish type I interferon inducible gene module comprises 360 genes whose expression was significantly induced (≥2 fold, adjusted p value <0.05) six hours after intravenous injection of three day post-fertilisation (dpf) larvae with 1 nl of recombinant interferon phi 1 protein (1 mg/ml) as previously described (16).
Venn diagrams
Area-proportional Venn diagrams visualizing the overlap between constituent genes in transcriptional modules were generated using BioVenn (69).
Zebrafish husbandry
Zebrafish were raised and maintained at 28.5°C following a 14/10 light/dark cycle according to standard protocols (70) in University College London (UCL) and Neuro-PSI Zebrafish Facilities. All work was approved by the British Home Office (Project License PP8780756) and the French Ministry for Research and Higher Education (Project Licence 39238). All experiments were performed in accordance with the relevant guidelines and regulations, on larvae up to five days post-fertilisation, before they are protected under the Animals (Scientific Procedures) Act. Adult zebrafish were spawned to collect embryos for experiments. Table S2 lists the lines used in this study (26, 71, 72). Embryos were maintained at 28.5°C in egg water containing 60 μg/ml Tropic Marin Sea Salt (Norwood Aquarium) and anaesthetised with egg water containing 200 µg/ml buffered tricaine (3-aminobenzoic acid ethyl ester) (Sigma-Aldrich) during injection, tail fin transection and imaging. Larvae used for in vivo imaging were maintained in egg water supplemented at 24 hours post-fertilisation (hpf) with 0.003% PTU (1-phenyl-2-thiourea) (Sigma-Aldrich) to inhibit melanisation.
CRISPant Generation
To maximise the likelihood of loss of functional protein we employed a previously described triple-exon targeting approach (73), with the modification that Cas9 protein was not diluted before use. CRISPR RNA (crRNA) (Alt-R CRISPR-Cas9 crRNA), tracrRNA (Alt-R CRISPR-Cas9 tracrRNA) and recombinant Cas9 protein (Alt-R Streptococcus pyogenes Cas9 Nuclease V3) were obtained from Integrated DNA Technologies (IDT). Three pre-designed crRNAs targeting distinct exons were selected for the stat2 gene, prioritizing those that targeted early, asymmetric (not divisible by three) exons common to both protein-coding transcripts, with low off-target and high on-target scores and in a genomic position which allowed design of primers flanking the target site to generate a ∼200 bp amplicon, for verification of mutagenesis by next generation sequencing (MiSeq). crRNA sequences and genomic locations are provided in Table S3. To generate gRNAs equal volumes of crRNA and tracrRNA diluted to 61 μM in Duplex buffer (IDT) were annealed at 95°C for 5 minutes. Equal volumes of gRNA (61 μM) and Cas9 protein (61 μM) were incubated at 37°C for 5 minutes to assemble separate 30.5 μM ribonucleoproteins (RNPs) for each target. Equal volumes of the three RNPs were pooled, prior to injection giving a final concentration of 10.2 μM for each. Negative control “scrambled” crRNAs computationally designed to be non-targeting (Alt-R CRISPRCas9 Negative Control crRNA #1, #2, #3) were prepared and pooled as described above. 0.5 nl of pooled stat2 RNPs or scrambled RNPs was injected into the yolk sac of single-cell stage wild type or transgenic zebrafish embryos before cell inflation as previously described (73). After injections embryos were transferred into sterile petri dishes with fresh egg water and maintained at 28.5°C to the appropriate age for experiments, as described below.
Interferon phi 1 stimulation
Three dpf stat2 CRISPant Tg(mxa:mCherry) larvae and scrambled RNP injected siblings were injected in the coelomic cavity with 1 nl of recombinant zebrafish interferon phi 1 protein (1.25 mg/ml), produced as previously described (74). stat2 and control CRISPants were shuffled so that injections were performed blindly. Bovine serum albumin (BSA) (1.25 mg/ml) was injected as a control. Interferon phi 1 and BSA solutions were supplemented with 0.2% phenol red (Sigma-Aldrich) to visualize the inoculum. Induction of Mxa reporter expression was determined by fluorescence microscopy 24 hours post-injection.
M. marinum infection
M. marinum M strain expressing the pmsp12 mWasabi (75) or psMT3 mCherry plasmid (76) was cultured using Middlebrook 7H10 agar (Becton Dickinson and Company) supplemented with 0.5% oleic acid/albumin/dextrose/catalase (OADC) (Becton Dickinson and Company), 0.5% glycerol (Sigma-Aldrich) and hygromycin (50 μg/ml) (Fisher Scientific). To generate bacterial suspensions for injection, M. marinum from agar plates was cultured statically at 28.5°C for 24-36 hours in 10 ml Middlebrook 7H9 broth (Becton Dickinson and Company) supplemented with 10% albumin/dextrose/catalase (ADC) (Becton Dickinson and Company) and hygromycin (50 μg/ml) (Fisher Scientific). Bacteria were harvested at mid-log growth (optical density (OD)600nm 0.7–1), (OD)600nm 1 representing 108 colony forming units (cfu)/ml, washed three times in phosphate buffered saline (PBS) (Gibco) and resuspended in 2% polyvinylpyrrolidone (PVP)40 (Sigma-Aldrich)/PBS containing 10% phenol red (Sigma-Aldrich) to aid visualisation of injections (77).
Zebrafish embryos were manually dechorionated using jeweller’s forceps (Dumont #5, World Precision Instruments) prior to bacterial infection. To generate systemic infection, 400 cfu M. marinum in a volume of 1 nl were microinjected into the caudal vein of embryos staged at 28-30 hpf. Four days post-infection, embryos were either subjected to fluorescence microscopy to measure disease severity or 25-30 per condition pooled and harvested in Qiazol (Qiagen) for evaluation of immune responses by RNAseq. To generate localised infection, 200 cfu M. marinum in a volume of 1 nl were injected into the hindbrain ventricle (HBV) of two dpf zebrafish larvae. Cellular recruitment was evaluated by fluorescence microscopy at 6-18 hours post-infection.
Tailfin transection
Tailfin transections were performed as described previously (26, 78). Briefly, live anaesthetised three dpf zebrafish larvae were transferred with a 1 ml Pasteur pipette (Scientific Laboratory Supplies) to a sterile petri dish containing 2% agarose (Bioline). Transections of the caudal fin were performed using a sterile microscalpel (World Precision Instruments) by applying steady downward pressure to create the incision at the boundary with the notochord. Following injury, larvae were rinsed in egg water, and maintained in individual wells of a 96-well tissue culture plate (TPP) at 28.5°C until live imaging to quantify neutrophil or macrophage recruitment to the site of injury at 1, 6 and 24 hours-post-wound (hpw).
Fluorescence microscopy
All fluorescence microscopy was performed on live, anasethised larvae. For quantitation of bacterial burden and dissemination four days post-infection, steady state neutrophil numbers at two dpf and cellular recruitment at 2-4 dpf, larvae were imaged using an M205FA stereofluorescence microscope (Leica) with a 1x objective. Larvae were imaged on a flat 1% agarose plate for quantitation of severity of mycobacterial infection and steady state neutrophil numbers and a 96-well tissue culture plate (TPP) in which the wells were coated with 1% agarose, for cellular recruitment to tailfin transection. Micro-wells were created in 1% agarose using a Stampwell Larvae 1 device (Idylle) for optimal positioning of 2-3 dpf larvae for imaging of cellular recruitment to hindbrain ventricle M. marinum infection. Brightfield and fluorescence images were captured using a DFC365 FX camera (Leica) and exported as 16-bit Tagged Image Format (TIF) files for analysis. For quantitation of steady state macrophage numbers three dpf larvae were imaged in 96-well alignment plates (Funakoshi) containing 75 μl egg water per well using a high-content wide-field fluorescence microscope (Hermes, IDEA Bio-Medical) with a 4x objective, capturing five z planes 50.6 μm apart in four contiguous fields of view per embryo. We created an ImageJ macro to semi-automate generation of single maximum intensity projection z-stack montages from the Hermes images, which were saved as 16-bit TIF files for analysis (Table S4. Mxa reporter expression was recorded with an EVOS M7000 microscope (ThermoFisher) equipped with a 2x/NA0.06 objective and Texas Red fluorescence cube. Larvae were positioned laterally in micro-wells created in 2% agarose with an in-house-printed stamp. A single fluorescence plane was recorded as the depth of field is sufficient to capture the entire larva. Transmitted light images were also recorded and both were exported as 16-bit TIF files.
Image analysis
To quantify bacterial burden, number of bacterial foci and spatial distribution of bacteria, images were analysed using custom QuantiFish software, previously developed in our group, to enable rapid quantitation of fluorescent foci in zebrafish larvae (79). Mxa reporter fluorescence was measured with ImageJ. Transmitted and fluorescence images were stacked. Images were slightly rotated to horizontality where necessary. A 600×100 pixel rectangular region of interest was defined on the transmitted light image to encompass the entire gut and liver areas. Mean fluorescence intensity was then measured on the fluorescence image. For quantitation of total cell number and recruitment of fluorescent immune cells to the site of tailfin transection we developed a new custom Python script which enables quantitation of integrated fluorescence within concentric zones (known as Sholl circles) (80). To achieve this, the script leverages the OpenCV, NumPy (81), Matplotlib, SciPy (http://www.scipy.org), scikit-image (82) and pandas Python libraries to facilitate image processing and data extraction (Table S5. First the user is prompted to select a folder containing images for analysis, as well as the file extensions corresponding to the brightfield and fluorescence images, which are essential for the subsequent steps. The script then iterates through each pair of brightfield and fluorescent images prompting the user to define two points on the brightfield image, which serve as the start and end positions between which to create concentric circular regions (80) on the corresponding fluorescence image. The script employs the image processing techniques described in Table S5 to quantify integrated fluorescence of the entire image and that within each region, as a surrogates for the total number of immune cells and the number at each spatial location, respectively and generates output files containing this information. Integrated fluorescence within the first zone (the wound site) and distance from the site of tailfin transection of the zone in which the maximal integrated fluorescence (a surrogate for the highest cell numbers) was detected (identified manually) were used as measures of immune cell recruitment for each zebrafish larva. The number of cells recruited to the site of hindbrain M. marinum infection was quantified by manual counting.
Genotyping by next generation sequencing
Genomic DNA extraction
Genomic DNA was extracted from anasethetised individual zebrafish larvae using the modified hot sodium hydroxide and Tris (HotSHOT) method as previously described (73, 83). Briefly, larvae in 96-well plates were incubated in 50 μl base solution (25 mM KOH, 0.2 mM EDTA in water) at 95°C for 30 min then cooled to room temperature, before addition of 50 μl neutralization solution (40 mM Tris-HCL in water).
PCR
Extracted DNA (2 μl per 50 μl reaction) was subjected to PCR using Phusion High-Fidelity DNA Polymerase (New England Biolabs) (98°C for 30 seconds (s), once only, 40 cycles of 98°C for 10 s, 59°C for 30 s, 72°C for 20 s, 72°C for 10 minutes, once only). Sequences and genomic positions of the primers used are provided in Table S3. Successful amplification of a single product of approximately 300 bp was verified by 2% agarose (Bioline) Tris-borate-EDTA (TBE) containing Redsafe (Chembio) gel electrophoresis. PCR products were purified using ExoSAP-IT Express (ThermoFisher), concentration of purified DNA determined by Nanodrop spectrophotometer (ThermoFisher) and samples diluted to 15-25 ng/μl for sequencing.
Next generation sequencing and data analysis
Sequencing was performed by the UCL Fish Genomic Service using the MiSeq Reagent Kit v2 (300-cycles) (Illumina) and the MiSeq system (Illumina) in paired end mode, according to the manufacturer’s instructions. Sequencing data were received as two FASTQ files (forward and reverse) for each sample. Visual sequence analysis was performed using Geneious 10.2.6 (https://www.geneious.com) for initial verification of successful genome editing close to the PAM site. Objective quantitation of the proportion of reads mutated and predicted to be frameshifted was achieved using the ampliCan R package (84), which utilizes the FASTQ files and a metadata file providing the amplicon, protospacer adjacent motif (PAM) and primer sequences (Table S3) and sample information, as previously described (73). AmpliCan was run with default parameters, with the exception that min_freq = 0.005 (mutations at a frequency below this threshold were considered a sequencing error) and average_quality = 25. Mutations present in uninjected or scrambled RNP injected control samples were not included as Cas9 induced mutations.
Statistical analysis
Statistical analysis was performed using GraphPad Prism 9.3.1 for Windows (GraphPad Software) and R3.6.3. Chi-squared tests were used to identify outlier genes statistically enriched in transcriptomic data from TST samples compared to saline samples, with no correction for multiple testing to avoid type 2 error. Spearman rank correlation was used to determine the strength of relationship between TST transcriptome gene expression and TB disease severity (two-tailed), transcript levels of upstream regulator target gene modules and TST random gene modules (one-tailed, positive correlation only) and mean expression of transcriptional modules in TST transcriptomic data with severity of TB disease (two-tailed). The Benjamini-Hochberg correction method for multiple comparisons was used to identify upstream regulator target gene modules with average correlation coefficients significantly higher than the mean of the distribution of equivalent sized randomly selected gene modules by ≥2 SD (Z score ≥2) with false discovery rate <0.05. Survival of stat2 CRISPant and control zebrafish larvae was compared using a log-rank (Mantel-Cox) test; a p value <0.05 was considered significant. On scatter plots horizontal lines represent median values and error bars indicate the interquartile range. On box and whisker plots horizontal lines represent median values, box limits indicate the interquartile range and whiskers extend between the 5th and 95th percentiles of the data. Two-tailed Mann-Whitney tests were used for two group comparisons of unpaired data; a p value <0.05 was considered significant. Where p values are not presented on graphs, * denotes p<0.05.
Data Availability
Human RNAseq data are already available in the ArrayExpress repository under accession number E-MTAB-6816. Zebrafish RNAseq data will be made available in ArrayExpress at the time of peer reviewed publication of the manuscript. The source code and manual for QuantiFish software used for analysis of bacterial burden and dissemination in zebrafish larvae are available at https://github.com/DavidStirling/QuantiFish. The ImageJ macro used to generate montages from the Hermes images and source code and manual for the Python script used for analysis of steady state cell numbers and cellular recruitment in zebrafish are available at https://github.com/AndSzyShe/particle_analysis. Source data for figure panels with fewer than 20 data points are available in Data file S1.
https://www.ebi.ac.uk/biostudies/arrayexpress/studies/E-MTAB-6816
Funding
UK Medical Research Council Clinician Scientist Fellowship MR/N007727/1 (GST)
Wellcome Trust Investigator Award 207511/Z/17/Z (MN)
Sir Henry Dale Fellowship jointly funded by the Wellcome Trust and the Royal Society 105570/Z/14/Z (PME)
Fondation pour la Recherche Medicale grant EQU202203014646 (J-PL)
National Institute of Health Research Biomedical Research Centre Funding to University College Hospitals NHS Foundation Trust and University College London (GST, MN)
This research was funded in whole, or in part, by the Wellcome Trust [207511/Z/17/Z, 105570/Z/14/Z]. For the purpose of Open Access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.
Author contributions
Conceived the study: GST, MN
Collected the data: ASS, BSMDDE, JR, ET, GP, J-PL, GST
Provided resources or analysis tools: ASS, BSMDDE, JR, CTT, MCIL, HK, GP, PME, EMP, J-PL, MN, GST
Performed the data analysis / interpretation: ASS, BSMDDE, JR, CTT, GP, PME, EMP, J-PL, MN, GST
Prepared the manuscript draft: GST, ASS, BSMDDE
All authors reviewed and contributed to the final manuscript.
Competing interests
The authors declare that they have no competing interests.
Data and materials availability
Human RNAseq data are already available in the ArrayExpress repository under accession number E-MTAB-6816. Zebrafish RNAseq data will be made available in ArrayExpress at the time of peer reviewed publication of the manuscript. The source code and manual for QuantiFish software used for analysis of bacterial burden and dissemination in zebrafish larvae are available at https://github.com/DavidStirling/QuantiFish. The ImageJ macro used to generate montages from the Hermes images and source code and manual for the Python script used for analysis of steady state cell numbers and cellular recruitment in zebrafish are available at https://github.com/AndSzyShe/particle_analysis. Source data for figure panels with fewer than 20 data points are available in Data file S1.
Acknowledgements
We thank Michelle Berin and Victoria Dean (Clinical Research Nurses, UCL Division of Infection and Immunity) and Mylah Ramirez (Research Nurse, Barts Health NHS Trust) for support with research governance, participant recruitment and sample collection. We thank the UCL Zebrafish Facility staff, particularly Heather Callaway, Jenna Hakkesteeg, Karen Dunford, Carole Wilson and Elise Hitchcock for maintenance of the zebrafish lines used in this study, especially during the first wave of the COVID-19 pandemic. We thank UCL Genomics for library preparation and sequencing of samples for transcriptional profiling (www.ucl.ac.uk/child-health/research/genetics-and-genomic-medicine/ucl-genomics). We thank Alex Lubin and Katie-Jo Miller for next generation sequencing of zebrafish samples for genotyping. This work has benefited from the facilities and expertise of CELPHEDIA-TEFOR, UAR2010 TEFOR Paris-Saclay (TEFOR Infrastructure – Investissement d’avenir – ANR-II-INBS-0014) and from the Recombinant Protein Platform of the Institut Pasteur, Paris.
References and Notes
- 1.↵
- 2.↵
- 3.↵
- 4.↵
- 5.↵
- 6.↵
- 7.↵
- 8.↵
- 9.↵
- 10.↵
- 11.↵
- 12.↵
- 13.↵
- 14.↵
- 15.↵
- 16.↵
- 17.↵
- 18.
- 19.↵
- 20.↵
- 21.↵
- 22.↵
- 23.↵
- 24.↵
- 25.↵
- 26.↵
- 27.↵
- 28.↵
- 29.↵
- 30.
- 31.↵
- 32.↵
- 33.
- 34.↵
- 35.↵
- 36.↵
- 37.↵
- 38.↵
- 39.
- 40.↵
- 41.↵
- 42.↵
- 43.↵
- 44.↵
- 45.↵
- 46.↵
- 47.↵
- 48.↵
- 49.↵
- 50.↵
- 51.↵
- 52.
- 53.
- 54.
- 55.↵
- 56.↵
- 57.
- 58.
- 59.↵
- 60.↵
- 61.↵
- 62.↵
- 63.↵
- 64.↵
- 65.↵
- 66.↵
- 67.↵
- 68.↵
- 69.↵
- 70.↵
- 71.↵
- 72.↵
- 73.↵
- 74.↵
- 75.↵
- 76.↵
- 77.↵
- 78.↵
- 79.↵
- 80.↵
- 81.↵
- 82.↵
- 83.↵
- 84.↵