Upper airway gene expression reveals a more robust innate and adaptive immune response to SARS-CoV-2 in children compared with older adults =========================================================================================================================================== * Eran Mick * Alexandra Tsitsiklis * Natasha Spottiswoode * Saharai Caldera * Paula Hayakawa Serpa * Angela M. Detweiler * Norma Neff * Angela Oliveira Pisco * Lucy M. Li * Hanna Retallack * Kalani Ratnasiri * Kayla M. Williamson * Victoria Soesanto * Eric A. F. Simões * Amy Kistler * Brandie D. Wagner * Joseph L. DeRisi * Lilliam Ambroggio * Peter M. Mourani * Charles R. Langelier ## Abstract Unlike other respiratory viruses, SARS-CoV-2 disproportionately causes severe disease in older adults and only rarely in children. To investigate whether differences in the upper airway immune response could contribute to this disparity, we compared nasopharyngeal gene expression in 83 children (<19-years-old; 38 with SARS-CoV-2, 11 with other respiratory viruses, 34 with no virus) and 154 adults (>40-years-old; 45 with SARS-CoV-2, 28 with other respiratory viruses, 81 with no virus). Expression of interferon-stimulated genes (ISGs) was robustly activated in both children and adults with SARS-CoV-2 compared to the respective non-viral groups, with only relatively subtle distinctions. Children, however, demonstrated markedly greater upregulation of pathways related to B cell and T cell activation and proinflammatory cytokine signaling, including TNF, IFNγ, IL-2 and IL-4 production. Cell type deconvolution confirmed greater recruitment of B cells, and to a lesser degree macrophages, to the upper airway of children. Only children exhibited a decrease in proportions of ciliated cells, the primary target of SARS-CoV-2, upon infection with the virus. These findings demonstrate that children elicit a more robust innate and adaptive immune response to SARS-CoV-2 infection in the upper airway that likely contributes to their protection from severe disease in the lower airway. Keywords * COVID-19 * SARS-CoV-2 * coronavirus * age * children * adults * transcriptomics * nasopharyngeal ## Introduction One of the defining features of the COVID-19 pandemic has been the striking relationship between disease severity and age1–3. While infection with other respiratory viruses, such as influenza or respiratory syncytial virus (RSV), causes significant morbidity and mortality in both young children and older adults3–9, severe COVID-19 occurs disproportionately in older adults and only very rarely in children1,3,10–14. A recent comprehensive modeling study estimated that the infection fatality rate is lowest for children ages 5-9 (∼0.001%) and that even adults in their 40s may already be at 100-fold greater risk of death from COVID-191. The age-dependent effect on disease severity and mortality has been shown even when accounting for age-associated comorbidities15. A few studies have examined differences in systemic immunological profiles of hospitalized children and adults with COVID-19, revealing greater breadth and neutralizing activity of SARS-CoV-2 specific antibodies as well as stronger CD4+ T cell responses to viral spike protein in adults16,17. These studies suggested that poorer outcomes in adults are not due to failure to engage a systemic adaptive immune response, and that children may exhibit a less pronounced adaptive response because their infection course is milder. One study reported higher serum levels of IL-17A and interferon-γ (IFNγ) in children early in their hospitalization17, suggesting a more robust innate immune response may contribute to the milder disease course. Our understanding of age-related differences in the immune response at the site of initial infection, the upper airway, remains limited. Several large-scale studies have found no systematic differences between children and adults in the distribution of SARS-CoV-2 viral load measured in nasopharyngeal (NP) swabs18,19, suggesting children are not generally better able to control viral replication in the upper airway. However, differences in the upper airway microenvironment and immune response could still contribute to protection from severe disease in children, for example, by limiting migration of the virus into the lower airway. Small-scale studies have begun to compare the upper airway immune response to SARS-CoV-2 in children and adults, with some contradictory results. One study found that children with COVID-19 expressed higher levels of genes associated with innate immune pathways, including interferon-stimulated genes and genes related to NLRP3 inflammasome signaling20. Another study, however, found no age-related differences in interferon-stimulated gene expression and reported globally similar host transcriptional responses between adults and children with diverse types of viral infections21, highlighting the need for further investigation. Importantly, neither study directly controlled for SARS-CoV-2 viral load when comparing immune-related gene expression between children and adults. We previously used metagenomic RNA-sequencing of NP swabs to compare upper airway gene expression in adult patients with COVID-19, other viral acute respiratory illnesses or non-viral illnesses22. Our analysis revealed a pronounced interferon response in COVID-19 patients, proportional to SARS-CoV-2 viral load, but attenuated activation of additional innate immune and pro-inflammatory pathways compared to patients with other viral infections22. Here, we report new sequencing data from a relatively large pediatric cohort to enable assessment of age-related differences in the upper airway transcriptional response to SARS-CoV-2 infection and shed further light on these outstanding questions. When controlling for viral load, our results suggest that differences in the overall magnitude of interferon-stimulated gene expression in the upper airway of children and adults with COVID-19 are relatively subtle and seem unlikely to explain their distinct clinical outcomes. However, we also find clear evidence of more robust pro-inflammatory and adaptive immune responses in the upper airway of children, which may contribute to their protection from severe disease. ## Results and Discussion To compare the upper airway gene expression response to SARS-CoV-2 infection in children and adults, we utilized our previously published dataset of NP swab RNA-sequencing from an adult cohort alongside newly sequenced swabs from a pediatric cohort. All samples in both cohorts were obtained in the course of clinical testing for SARS-CoV-2 by reverse transcription polymerase chain reaction (RT-PCR) at the University of California San Francisco or Children’s Hospital Colorado. We included patients up to 19 years of age in the pediatric cohort and restricted the adult cohort to those 40 years of age or older to impose clearer age separation. As in our previous analysis, we divided each age cohort into three viral status groups: 1) patients with PCR-confirmed SARS-CoV-2 infection (“SARS-CoV-2” group), 2) patients negative for SARS-CoV-2 by PCR with no other pathogenic respiratory virus detected by metagenomic RNA sequencing (“No Virus” group), and 3) patients negative for SARS-CoV-2 who had another respiratory virus detected by sequencing (“Other Virus” group). Finally, we limited the samples in the SARS-CoV-2 group to those with at least 10 viral reads-per-million (rpM), comparable to PCR Ct values below 3022, which have been associated with recovery of actively replicating virus from respiratory specimens23–25. The final dataset included 83 children (38 SARS-CoV-2, 34 No Virus, 11 Other Virus; median age 4, IQR 2-12) and 154 adults (45 SARS-CoV-2, 81 No Virus, 28 Other Virus; median age 62, IQR 47-71) (**Figure 1A,B; Supplemental Table 1A-C**). There were no statistically significant differences between children and adults with respect to the site of clinical testing (**Supplemental Table 1A**). Most of the patients in the SARS-CoV-2 group in both age cohorts were tested as outpatients, indicative of an early/mild stage of disease (**Supplemental Table 1A**). Samples in the SARS-CoV-2 group in both age cohorts spanned several orders of magnitude of viral load, and while viral load trended higher in the children this did not reach statistical significance (**Figure 1C**). Rhinovirus was the most prevalent other respiratory virus detected in both age cohorts and a diversity of other viruses were also identified (**Figure 1D**). ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/07/19/2021.07.15.21260285/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2021/07/19/2021.07.15.21260285/F1) Figure 1: Cohort characteristics **A)** Number of patients in the SARS-CoV-2, No Virus and Other Virus groups in the adult and pediatric cohorts. **B)** Age distribution across the three viral status groups in the adult and pediatric cohorts. **C)** Distribution of SARS-CoV-2 viral load, measured in reads-per-million (rpM), in adult and pediatric SARS-CoV-2 patients. P-value derives from a two-sided Mann-Whitney test. **D)** Distribution of viruses in the Other Virus groups in the adult and pediatric cohorts. Absolute numbers are provided above each bar while the y-axis indicates percentage out of each cohort’s Other Virus group. One child had both influenza and rhinovirus and two adults had both RSV and rhinovirus. We began by performing differential expression (DE) analyses between the SARS-CoV-2 and No Virus groups within each age cohort separately. This approach minimizes confounding by age-related gene expression differences unrelated to SARS-CoV-2 infection and any potential batch effects, though it could be affected by differences among the patients in each cohort’s No Virus group. The analyses yielded 1,961 and 1,216 differentially expressed genes at an adjusted p-value < 0.1 for the pediatric and adult cohorts, respectively (**Supplemental Data 1**). We then performed gene set enrichment analyses26 (GSEA) using Gene Ontology (GO) biological process annotations27 on the DE results in each cohort and compared the enriched pathways. As expected, a range of immune related pathways were upregulated in both adults and children with SARS-CoV-2 compared to those with no virus (**Figure 2A; Supplemental Data 2**). Pathways related to the interferon response appeared as a whole to be strongly induced in both children and adults. Children, however, demonstrated stronger upregulation of the B cell and T cell activation pathways, macrophage activation and phagocytosis, and several cytokine production pathways, including IL-2, IL-4, TNF and IFNγ. A few immune pathways showed inconsistent expression changes compared to the respective No Virus groups, including neutrophil mediated immunity and mast cell mediated immunity. Intriguingly, both children and adults with SARS-CoV-2 infection exhibited downregulation of olfactory receptor gene expression (sensory perception of chemical stimulus pathway), consistent with the loss of sense of smell clinically observed in both age cohorts28,29. ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/07/19/2021.07.15.21260285/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2021/07/19/2021.07.15.21260285/F2) Figure 2: Pathways activated in children and adults upon SARS-CoV-2 infection. **A)** Normalized enrichment scores of selected GO biological process terms that reached statistical significance (Benjamini-Hochberg adjusted P-value < 0.05) in the GSEA using DE genes between the SARS-CoV-2 and No Virus groups in either the adult or pediatric cohort. Statistical significance is denoted by a black outline around the circle. **B)** Normalized enrichment scores for the same GO terms as in (A) in the GSEA using DE genes between children and adults with SARS-CoV-2. Dark color bars represent pathways that reached statistical significance (Benjamini-Hochberg adjusted P-value < 0.05). We complemented the analyses comparing SARS-CoV-2 and No Virus patients in each cohort separately by directly comparing gene expression between SARS-CoV-2 infected children and adults, controlling for viral load. We identified 5,352 differentially expressed genes at an adjusted p-value < 0.1 (**Supplemental Data 3**), and GSEA of the DE results yielded overall similar patterns to those described above (**Figure 2B; Supplemental Data 4**). In particular, B cell related pathways (B cell activation, humoral immune response), T cell related pathways (T cell activation, IFNγ production, IL-2 and IL-4 production) and chemokine/cytokine signaling were more highly expressed in children with COVID-19. While some immune pathways did not reach statistical significance in the direct comparison between children and adults with SARS-CoV-2, they typically trended in the same direction (**Figure 2B**). On the other hand, the stark disparity in neutrophil activation observed in the comparison to the No Virus groups was only weakly supported in the direct comparison, likely reflecting differences among the No Virus patients themselves. The direct comparison clearly revealed lower expression of cilia-associated genes in children with SARS-CoV-2 and also suggested a trend toward lower expression of interferon-stimulated genes, though the pathway just missed the statistical significance cutoff. As expected, many developmental processes unrelated to infection also differed in the direct comparison between children and adults (**Supplemental Data 4**). Many of the pathways identified in the GSEA results as differentially expressed between children and adults with COVID-19 were tightly related to particular cell types. We therefore applied *in silico* estimation of cell type proportions30 based on marker genes derived from an airway single-cell study31 as an additional approach to contextualize our findings (**Figure 3, Figure 3–figure supplement 1; Supplemental Data 5**). Consistent with the GSEA results, we found that SARS-CoV-2 infection triggered significantly greater recruitment of B cells to the upper airway in children compared to adults, which was also evident in the comparison between children and adults with other respiratory viruses (**Figure 3A**). In contrast, differences in estimated T cell proportions were much subtler (**Figure 3–figure supplement 1**), suggesting the GSEA results reflected distinctions in T cell identity and regulation rather than cell number. ![Figure 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/07/19/2021.07.15.21260285/F3.medium.gif) [Figure 3:](http://medrxiv.org/content/early/2021/07/19/2021.07.15.21260285/F3) Figure 3: Cell-type proportion differences between children and adults. *In silico* estimation of cell-type proportions in the bulk RNA-sequencing using single-cell signatures. Black lines denote the median. The y-axis in each panel was trimmed at the maximum value among all groups of 1.5*IQR above the third quartile, where IQR is the interquartile range. For each cell type, we formally compared each viral status group between the two age cohorts as well as the No Virus and SARS-CoV-2 groups within each age cohort. Pairwise comparisons were performed with a two-sided Mann-Whitney test followed by Holm’s correction for multiple testing. We previously observed in our adult study that infection with SARS-CoV-2 was associated with blunted recruitment of macrophages and neutrophils to the upper airway as compared to other respiratory viruses22, and these trends appear to re-capitulate in children despite the limited sample size of the pediatric Other Virus group (**Figure 3B,C**). Nevertheless, macrophage proportions trended higher in children with SARS-CoV-2 compared to adults (**Figure 3B**), as did those of dendritic cells (**Figure 3D**). Intriguingly, while proportions of ciliated cells did not differ between children and adults in the No Virus group, children with SARS-CoV-2 exhibited a marked decrease in ciliated cell proportions that was entirely absent in the adults (**Figure 3E**), consistent with the GSEA findings. This was accompanied by greater proportions of basal cells in children compared to adults with SARS-CoV-2 (**Figure 3F**). Recent studies have found that ciliated cells are a major target for SARS-CoV-2 at the onset of infection32,33, and a possible interpretation of our results is that children are better able to turn over and clear infected ciliated cells in the nasopharynx, potentially helping them more effectively control the infection. Finally, we examined the effect of SARS-CoV-2 viral load on gene expression in pathways of interest within each age cohort using robust regression. Expression of interferon-stimulated genes (ISGs) frequently correlates with viral load, as we observed in our previous analysis in adults22. Children and adults exhibited similar trends for several of the ISGs that most strongly correlated with viral load in our previous analysis, such as *CXCL11* and *OASL*, though the slope trended higher in children (**Figure 4A**). However, some of the ISGs that were strongly differentially expressed in adults with COVID-19 compared to those with non-viral illnesses exhibited much more gradual activation in the pediatric cohort, such that only children with high viral load achieved comparable expression to that of adults (e.g., *IFI6, IFI27*; **Figure 4B**). The “lagging” response of these ISGs in children with lower SARS-CoV-2 viral load explains the trend toward lower pathway expression observed in the GSEA results (**Figure 2B**). These findings suggest differences in ISG-specific regulation and/or cellular origins between children and adults that defy simple generalization. While the significance of such subtle distinctions is unclear, they appear less likely to drive the dramatic outcome disparities. ![Figure 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/07/19/2021.07.15.21260285/F4.medium.gif) [Figure 4:](http://medrxiv.org/content/early/2021/07/19/2021.07.15.21260285/F4) Figure 4: Relationship of selected immune pathway marker genes to SARS-CoV-2 viral load in children and adults. Scatter plots of normalized gene counts (log2 scale, y-axis) as a function of SARS-CoV-2 viral load (log10(rpM), x-axis) in each age cohort. The viral status group is indicated by the dot color. Robust regression was performed on SARS-CoV-2 patients to characterize the relationship to viral load. Shaded bands represent 95% confidence intervals. Numerical results listed for each gene refer to, from top to bottom: the regression slope, the p-value for the difference of the slope from 0, and the adjusted robust coefficient of determination (R2). **A)** Canonical type I interferon response genes showing high correlation to viral load in adults and children. **B)** Interferon-stimulated genes showing a more gradual response to viral load in children compared to adults. **C)** B cell marker genes. In stark contrast to ISGs, the expression of several B cell marker genes (e.g. *CD22, CD79A)* was weakly correlated with viral load, especially in children (**Figure 4C**). These genes exhibited significant heterogeneity between patients, likely reflecting the timing of activation of the B cell response, but the fraction of children who were engaging the response at the time of sampling was clearly greater. Our study has several limitations that should be kept in mind: 1) a larger sample size would have increased the generalizability of our findings; 2) we could not directly control for timing since onset of infection; 3) proteomic measurements may have provided additional insight but were not possible from the available swab specimens, which were collected into a virus inactivating agent; 4) our findings regarding tissue cellular composition require validation by single-cell sequencing; and lastly, 5) the majority of subjects with COVID-19 had mild disease at the time of sampling and did not require hospitalization. Results may have differed if specimens from a greater proportion of severely ill individuals had been available, however, this likely resulted in a more relevant comparison since few children develop severe disease. Taken together, our findings suggest children elicit a more robust pro-inflammatory and adaptive immune response to SARS-CoV-2 infection in the upper airway compared to adults but an overall comparable type I interferon response. Children, unlike adults, also display decreased proportions of ciliated cells, the primary targets of SARS-CoV-2, upon infection with the virus. These differences in the upper airway immune response and tissue composition may both contribute to the protection of children from severe disease in the lower respiratory tract. Further study is warranted to unravel why children are better protected specifically against SARS-CoV-2, or perhaps more broadly against β-coronaviruses34, as compared to other common respiratory viruses. ## Materials and Methods ### Study design and clinical cohort We previously conducted and published an observational cohort study of adult patients with acute respiratory illnesses tested for COVID-19 by RT-PCR at the University of California, San Francisco (UCSF), leveraging leftover RNA extracted from clinical NP swab specimens22. The UCSF Institutional Review Board (IRB) granted a waiver of consent under IRB protocol #17-24056. For the novel analyses presented here, we leveraged the published adult dataset while supplementing it with pediatric samples similarly obtained at UCSF under the same protocol. Additionally, pediatric samples were obtained from patients tested for COVID-19 by RT-PCR from NP swabs at Children’s Hospital Colorado (CHCO). CHCO specimens and data were obtained under IRB protocols #0865, #20-1617 and #20-0972, which also granted a waiver of consent. Demographic and clinical data for all patients were obtained from a combination of electronic and manual abstraction of medical records at the respective institutions. Children up to 19 years of age and adults at least 40 years of age were eligible for inclusion in the present analysis. Enrollment was not restricted based on sex or race. Some samples were ultimately excluded based on sequencing metrics, as described in the following sections. ### Sample processing Excess clinical swab specimens were stored in viral transport media at −80°C in the Clinical Microbiology Laboratories. Specimens were thawed and 200uL aliquots of specimen were added to 200uL of DNA/RNA Shield (Zymo Research, Irvine, CA) in sterile 1.5 mL microtubes with appropriate biohazard precautions. ### Metagenomic RNA sequencing Specimens underwent RNA extraction and metagenomic sequencing, as previously described22. Briefly, RNA was extracted from 200μL of specimen in DNA/RNA shield using bead-based lysis and the Zymo Pathogen Magbead kit (Zymo). We also processed negative control samples (water and HeLa cell RNA) to account for background contamination. All samples were spiked with RNA standards from the External RNA Controls Consortium (ERCC)35. Depletion of cytosolic and mitochondrial rRNA was performed using Fastselect (Qiagen, Germantown, MD). RNA was reverse transcribed to generate cDNA and used to construct sequencing libraries using the NEBNext Ultra II Library Prep Kit (New England Biolabs, Ipswich, MA). Libraries underwent 146 nucleotide paired-end sequencing on an Illumina Novaseq 6000 instrument. ### Metagenomic analysis of respiratory viruses Samples were processed through the IDSeq pipeline36,37, which performs reference based alignment at both the nucleotide and amino acid level against sequences in the National Center for Biotechnology Information (NCBI) nucleotide (NT) and non-redundant (NR) databases, respectively, followed by assembly of the reads matching each taxon detected. We further processed the results for viruses with established pathogenicity in the respiratory tract38. We evaluated whether one of these viruses was present in a patient sample if it met the following three initial criteria: (i) at least 10 counts mapped to NT sequences, (ii) at least 1 count mapped to NR sequences, (iii) average assembly nucleotide alignment length of at least 70 bp. Negative control (water and HeLa cell RNA) samples enabled estimation of the number of background reads expected for each virus, which were normalized by input mass as determined by the ratio of sample reads to spike-in ERCC RNA standards. Viruses were then additionally tested for whether the number of sequencing reads aligned to them in the NT database was significantly greater than background. This was done by modeling the number of background reads as a negative binomial distribution, with mean and dispersion fitted on the negative controls. We estimated the mean parameter of the negative binomial for each taxon (virus) by averaging the read counts across all negative controls after normalizing by ERCCs. We estimated a single dispersion parameter across all taxa using the functions glm.nb() and theta.md() from the R package MASS. We considered a sample to have a pathogenic respiratory virus detected by sequencing if the virus achieved an adjusted p-value < 0.05 after Holm’s correction for all tests performed in the same sample. We used the IDSeq-calculated viral reads-per-million (rpM), based on the NT alignment, as a uniform measure of SARS-CoV-2 abundance across all samples. A value of 0.1 rpM was added to all samples with rpM < 0.1. ### Assignment of samples to comparator groups As in our previous analysis22, we divided the samples in each age cohort into three viral status groups: 1) samples with a positive clinical PCR test for SARS-CoV-2 were assigned to the “SARS-CoV-2” group; 2) samples with a negative PCR test for SARS-CoV-2 and no evidence of another pathogenic respiratory virus in the metagenomic sequencing were assigned to the “No Virus” group; and 3) samples with a negative PCR test for SARS-CoV-2 but another respiratory virus detected by sequencing were assigned to the “Other Virus” group. We retained for analysis only samples in the SARS-CoV-2 groups with at least 10 rpM, roughly corresponding to PCR Ct values below 3022, to focus on cases with likely active viral replication, where a clear transcriptional response to the virus is expected to be found. This approach was based on the well-established correlation between viral load and recovery of actively replicating virus from respiratory specimens, and more specifically the finding that specimens with a PCR Ct < 30 are associated with the ability to culture SARS-CoV-223–25. ### Human gene expression quantification Following demultiplexing, sequencing reads were pseudo-aligned with kallisto39 (v. 0.46.1; including bias correction) to an index consisting of all transcripts associated with human protein coding genes (ENSEMBL v.99), cytosolic and mitochondrial ribosomal RNA sequences, and the sequences of ERCC RNA standards. Samples were retained for analysis if they had at least 400,000 estimated counts associated with transcripts of protein coding genes. Gene-level counts (n=19,939) were generated from the transcript-level abundance estimates using the R package tximport40, with the scaledTPM method. ### Differential expression (DE) analyses Genes were retained for each DE analysis if they had at least 10 counts in at least 20% of the samples included in the analysis. All analyses were performed with the R package limma41, using quantile normalization and the voom method. The design formula for the comparisons within each age cohort was ∼viral status, where viral status was either “SARS-CoV-2” or “No Virus”. The design formula for the direct comparison between children and adults with SARS-CoV-2 was ∼log10(rpM) + age cohort, where age cohort was either “children” or “adults”. ### Gene set enrichment analyses (GSEA) Gene set enrichment analysis was based on Gene Ontology (GO) biological process pathway annotations27, using the non-redundant version available through WebGestalt42. Only pathways with a minimum size of 10 genes and a maximum size of 1,500 genes were retained for analysis. The analysis was performed using the fgseaMultilevel function in the R package fgsea43, which calculates p-values based on an adaptive, multilevel splitting Monte Carlo scheme. The input consisted of all genes in the respective DE analysis, pre-ranked by fold-change. The gene sets shown in **Figure 2** were manually selected to reduce redundancy and highlight diverse biological functions from among those with a Benjamini-Hochberg adjusted p-value < 0.05 in at least one of the analyses. Full results are provided in **Supplemental Data 2** and **Supplemental Data 4**. ### *In silico* estimation of cell type proportions Cell-type proportions were estimated using the CIBERSORT X algorithm30 based on single cell signatures derived from the human lung cell atlas31. Differences in estimated proportions between comparator groups were evaluated for statistical significance using a Mann-Whitney test with Holm’s correction for multiple testing. ### Regression of gene counts against viral load We performed robust regression of the limma-generated quantile normalized gene counts (log2 scale) against log10(rpM) of SARS-CoV-2 for selected genes. The analysis was performed within each age cohort separately using the R package robustbase44, which implements MM-type estimators for linear regression45,46, the KS2014 setting and the model: quantile normalized counts (log2 scale) ∼ log10(rpM). Model predictions were generated using the R package ggeffects and used for display in the individual gene plots. Error bands represent normal distribution 95% confidence intervals around each prediction. Reported p-values for significance of the difference of the regression coefficient from 0 are based on a t-statistic. Reported R2 values represent the adjusted robust coefficient of determination47. ## Supporting information Supplementary Data 1 [[supplements/260285_file03.xlsx]](pending:yes) Supplementary Data 2 [[supplements/260285_file04.xlsx]](pending:yes) Supplementary Data 3 [[supplements/260285_file05.xlsx]](pending:yes) Supplementary Data 4 [[supplements/260285_file06.xlsx]](pending:yes) Supplementary Data 5 [[supplements/260285_file07.xlsx]](pending:yes) ## Data Availability Gene counts and sample metadata have been deposited under NCBI GEO accession GSE179277. The published human lung single-cell datasets used for cell-type proportions analysis can be obtained through Synapse under accessions syn21560510 and syn21560511. ## Data availability Gene counts and sample metadata have been deposited under NCBI GEO accession GSE179277. The published human lung single-cell datasets used for cell-type proportions analysis can be obtained through Synapse under accessions syn21560510 and syn21560511. ## Funding NHLBI K23HL138461-01A1 (CL); Chan Zuckerberg Biohub (AD, AOP, NN, JLD); COVID-Child Health Research Award by the Research Institute at Children’s Hospital Colorado; philanthropic support from Mark and Carrie Casey, Julia and Kevin Hartz, Carl Kawaja and Wendy Holcombe, Eric Keisman and Linda Nevin, Martin and Leesa Romo, Diana Wagner, Jerry Yang and Akiko Yamazaki, and Three Sisters Foundation. ## Supplemental Tables View this table: [Supplemental Table 1A:](http://medrxiv.org/content/early/2021/07/19/2021.07.15.21260285/T1) Supplemental Table 1A: Adult vs Pediatric cohort characteristics. View this table: [Supplemental Table 1B:](http://medrxiv.org/content/early/2021/07/19/2021.07.15.21260285/T2) Supplemental Table 1B: Adult cohort clinical and demographic characteristics. View this table: [Supplemental Table 1C:](http://medrxiv.org/content/early/2021/07/19/2021.07.15.21260285/T3) Supplemental Table 1C: Pediatric cohort clinical and demographic characteristics. ## Supplemental Data **Supplemental Data 1**. Differential expression results between the SARS-CoV-2 and No Virus groups in adults and children separately. **Supplemental Data 2**. GSEA applied to the differential expression results between the SARS-CoV-2 and No Virus groups in adults and children separately. **Supplemental Data 3**. Differential expression results directly comparing children and adults with SARS-CoV-2, controlling for viral load. **Supplemental Data 4**. GSEA applied to the direct comparison between children and adults with SARS-CoV-2. **Supplemental Data 5**. *In silico* deconvolution of cell type proportions. ## Figures ![Figure 3–figure supplement 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/07/19/2021.07.15.21260285/F5.medium.gif) [Figure 3–figure supplement 1:](http://medrxiv.org/content/early/2021/07/19/2021.07.15.21260285/F5) Figure 3–figure supplement 1: Additional cell-types. Additional cell types included in the cell-type proportions analysis. ## Acknowledgements We would like to thank Sam Dominguez, MD, PhD, Kirk Harris, PhD, Aline Maddux, MD, Christina Osborne, MD, and Matthew Leroue, MD, for their input on study design and review of the data. ## Footnotes * ‡ These authors jointly supervised this work * Received July 15, 2021. * Revision received July 15, 2021. * Accepted July 19, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## References 1. 1.O’Driscoll, M. et al. Age-specific mortality and immunity patterns of SARS-CoV-2. Nature 590, 140–145 (2021). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 2. 2.Levin, A. T. et al. Assessing the age specificity of infection fatality rates for COVID-19: systematic review, meta-analysis, and public policy implications. Eur J Epidemiol 35, 1123– 1138 (2020). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 3. 3.Woolf, S. H., Chapman, D. A. & Lee, J. H. COVID-19 as the Leading Cause of Death in the United States. JAMA (2020) doi:10.1001/jama.2020.24865. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2020.24865&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33331845&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 4. 4.Shi, T. et al. Global, regional, and national disease burden estimates of acute lower respiratory infections due to respiratory syncytial virus in young children in 2015: a systematic review and modelling study. The Lancet 390, 946–958 (2017). 5. 5.Nair, H. et al. Global burden of respiratory infections due to seasonal influenza in young children: a systematic review and meta-analysis. Lancet 378, 1917–1930 (2011). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(11)61051-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22078723&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000298136200026&link_type=ISI) 6. 6.Principi, N. & Esposito, S. Severe influenza in children: incidence and risk factors. Expert Review of Anti-infective Therapy 14, 961–968 (2016). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27560100&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 7. 7.Matias, G. et al. Estimates of hospitalization attributable to influenza and RSV in the US during 1997–2009, by age and risk status. BMC Public Health 17, 271 (2017). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12889-017-4177-z&link_type=DOI) 8. 8.Tin Tin Htar, M., Yerramalla, M. S., Moïsi, J. C. & Swerdlow, D. L. The burden of respiratory syncytial virus in adults: a systematic review and meta-analysis. Epidemiol Infect 148, e48. 9. 9.Clinical Aspects of Pandemic 2009 Influenza A (H1N1) Virus Infection | NEJM. [https://www.nejm.org/doi/10.1056/NEJMra1000449](https://www.nejm.org/doi/10.1056/NEJMra1000449). 10. 10.Ludvigsson, J. F. Systematic review of COVID-19 in children shows milder cases and a better prognosis than adults. Acta Paediatrica 109, 1088–1095 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/apa.15270&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 11. 11.Bhopal, S. S., Bagaria, J., Olabi, B. & Bhopal, R. Children and young people remain at low risk of COVID-19 mortality. The Lancet Child & Adolescent Health 5, e12–e13 (2021). 12. 12.Bailey, L. C. et al. Assessment of 135 794 Pediatric Patients Tested for Severe Acute Respiratory Syndrome Coronavirus 2 Across the United States. JAMA Pediatr 175, 176 (2021). 13. 13.Xu, Y. et al. Characteristics of pediatric SARS-CoV-2 infection and potential evidence for persistent fecal viral shedding. Nat Med 26, 502–505 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41591-020-0817-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 14. 14.Dong, Y. et al. Epidemiology of COVID-19 Among Children in China. Pediatrics 145, (2020). 15. 15.Williamson, E. J. et al. Factors associated with COVID-19-related death using OpenSAFELY. Nature 584, 430–436 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-020-2521-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 16. 16.Weisberg, S. P. et al. Distinct antibody responses to SARS-CoV-2 in children and adults across the COVID-19 clinical spectrum. Nat Immunol 22, 25–31 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41590-020-00826-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 17. 17.Pierce, C. A. et al. Immune responses to SARS-CoV-2 infection in hospitalized pediatric and adult patients. Science Translational Medicine 12, (2020). 18. 18.Madera, S. et al. Nasopharyngeal SARS-CoV-2 viral loads in young children do not differ significantly from those in older children and adults. Sci Rep 11, 3044 (2021). 19. 19.Chung, E. et al. Comparison of Symptoms and RNA Levels in Children and Adults With SARS-CoV-2 Infection in the Community Setting. JAMA Pediatrics (2021) doi:10.1001/jamapediatrics.2021.2025. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jamapediatrics.2021.2025&link_type=DOI) 20. 20.Pierce, C. A. et al. Natural mucosal barriers and COVID-19 in children. JCI Insight 6, (2021). 21. 21.Koch, C. M. et al. Immune response to SARS-CoV-2 in the nasal mucosa in children and adults. [http://medrxiv.org/lookup/doi/10.1101/2021.01.26.21250269](http://medrxiv.org/lookup/doi/10.1101/2021.01.26.21250269) (2021) xdoi:10.1101/2021.01.26.21250269. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMS4wMS4yNi4yMTI1MDI2OXYxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDcvMTkvMjAyMS4wNy4xNS4yMTI2MDI4NS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 22. 22.Mick, E. et al. Upper airway gene expression reveals suppressed immune responses to SARS-CoV-2 compared with other respiratory viruses. Nat Commun 11, 5854 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-020-19587-y&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33203890&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 23. 23.Singanayagam, A. et al. Duration of infectiousness and correlation with RT-PCR cycle threshold values in cases of COVID-19, England, January to May 2020. Eurosurveillance 25, 2001483 (2020). 24. 24.La Scola, B. et al. Viral RNA load as determined by cell culture as a management tool for discharge of SARS-CoV-2 patients from infectious disease wards. Eur J Clin Microbiol Infect Dis 39, 1059–1061 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/sl0096-020-03913-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 25. 25.Yamada, S. et al. Assessment of SARS-CoV-2 infectivity of upper respiratory specimens from COVID-19 patients by virus isolation using VeroE6/TMPRSS2 cells. BMJ Open Respir Res 8, e000830 (2021). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYm1qcmVzcCI7czo1OiJyZXNpZCI7czoxMToiOC8xL2UwMDA4MzAiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8wNy8xOS8yMDIxLjA3LjE1LjIxMjYwMjg1LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 26. 26.Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 102, 15545–15550 (2005). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMjoiMTAyLzQzLzE1NTQ1IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDcvMTkvMjAyMS4wNy4xNS4yMTI2MDI4NS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 27. 27.Ashburner, M. et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 25, 25–29 (2000). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/75556&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10802651&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000086884000011&link_type=ISI) 28. 28.Giacomelli, A. et al. Self-reported Olfactory and Taste Disorders in Patients With Severe Acute Respiratory Coronavirus 2 Infection: A Cross-sectional Study. Clinical Infectious Diseases 71, 889–890 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=32215618&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 29. 29.Kumar, L. et al. Loss of smell and taste in COVID-19 infection in adolescents. Int J Pediatr Otorhinolaryngol 142, 110626 (2021). 30. 30.Newman, A. M. et al. Robust enumeration of cell subsets from tissue expression profiles. Nat Methods 12, 453–457 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nmeth.3337&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25822800&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 31. 31.Travaglini, K. J. et al. A molecular cell atlas of the human lung from single-cell RNA sequencing. Nature 587, 619–625 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-020-2922-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33208946&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 32. 32.Ravindra, N. G. et al. Single-cell longitudinal analysis of SARS-CoV-2 infection in human airway epithelium identifies target cells, alterations in gene expression, and cell state changes. PLOS Biology 19, e3001143 (2021). 33. 33.Fiege, J. K. et al. Single cell resolution of SARS-CoV-2 tropism, antiviral responses, and susceptibility to therapies in primary human airway epithelium. PLOS Pathogens 17, e1009292 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.ppat.1009292&link_type=DOI) 34. 34.Stockman, L. J. et al. Severe acute respiratory syndrome in children. Pediatr Infect Dis J 26, 68–74 (2007). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/01.inf.0000247136.28950.41&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17195709&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000243238000013&link_type=ISI) 35. 35.Pine, P. S. et al. Evaluation of the External RNA Controls Consortium (ERCC) reference material using a modified Latin square design. BMC Biotechnology 16, 54 (2016). 36. 36.Ramesh, A. et al. Metagenomic next-generation sequencing of samples from pediatric febrile illness in Tororo, Uganda. PLoS One 14, e0218318 (2019). 37. 37.Kalantar, K. L. et al. IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring. Gigascience 9, (2020). 38. 38.Langelier, C. et al. Integrating host response and unbiased microbe detection for lower respiratory tract infection diagnosis in critically ill adults. Proc Natl Acad Sci U S A 115, E12353–E12362 (2018). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMzoiMTE1LzUyL0UxMjM1MyI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzA3LzE5LzIwMjEuMDcuMTUuMjEyNjAyODUuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 39. 39.Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nature Biotechnology 34, 525–527 (2016). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nbt.3519&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27043002&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 40. 40.Soneson, C., Love, M. I. & Robinson, M. D. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Res 4, 1521 (2015). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 41. 41.Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 43, e47 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkv007&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25605792&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 42. 42.Liao, Y., Wang, J., Jaehnig, E. J., Shi, Z. & Zhang, B. WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs. Nucleic Acids Research 47, W199–W205 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkz401&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F07%2F19%2F2021.07.15.21260285.atom) 43. 43.Korotkevich, G. et al. Fast gene set enrichment analysis. bioRxiv 060012 (2021) doi:10.1101/060012. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czo4OiIwNjAwMTJ2MyI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzA3LzE5LzIwMjEuMDcuMTUuMjEyNjAyODUuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 44. 44.Maechler, M., Rousseeuw, P., Croux, C., Todorov, V. & Ruckstuhl, A. robustbase: Basic Robust Statistics. (2021). 45. 45.Koller, M. & Stahel, W. A. Sharpening Wald-type inference in robust regression for small samples. Computational Statistics & Data Analysis 55, 2504–2515 (2011). 46. 46.Yohai, V. J. High Breakdown-Point and High Efficiency Robust Estimates for Regression. The Annals of Statistics 15, 642–656 (1987). 47. 47.Renaud, O. & Victoria-Feser, M.-P. A robust coefficient of determination for regression. Journal of Statistical Planning and Inference 140, 1852–1862 (2010).