Emergence of immune escape at dominant SARS-CoV-2 killer T-cell epitope ======================================================================= * Garry Dolton * Cristina Rius * Md Samiul Hasan * Barbara Szomolay * Enas Behiry * Thomas Whalley * Joel Southgate * The COVID-19 Genomics UK (COG-UK) consortium * Théo Morin * Katie Topley * Li Rong Tan * Anna Fuller * Aaron Wall * Philip J. R. Goulder * Brad Spiller * Lucy C. Jones * Thomas R. Connor * Andrew K. Sewell ## ABSTRACT The adaptive immune system protects against infection via selection of specific antigen receptors on B-cells and T-cells. We studied the prevalent CD8 ‘killer’ T-cell response mounted against SARS-CoV-2 Spike269-277 epitope YLQPRTFLL via the most frequent Human Leukocyte Antigen (HLA) class I worldwide, HLA A*02. The widespread Spike P272L mutation has arisen in five different SARS-CoV-2 lineages to date and was common in the B.1.177 lineage associated with establishing the ‘second wave’ in Europe. The large CD8 T-cell response seen across a cohort of HLA A*02+ convalescent patients, comprising of over 120 different TCRs, failed to respond to the P272L variant suggesting that proline 272 dominates TCR contacts with this epitope. Additionally, sizable populations (0.01%-0.2%) of total CD8 T-cells from individuals vaccinated against SARS-CoV-2 stained with HLA A*02-YLQPRTFLL multimers but failed to bind to the P272L reagent. Viral escape at prevalent T-cell epitopes restricted by high frequency HLA may be particularly problematic when vaccine immunity is focussed on a single protein such as SARS-CoV-2 Spike and provides a strong argument for inclusion of multiple viral proteins in next generation vaccines and highlights the urgent need for monitoring T-cell escape in new SARS-CoV-2 variants. ## INTRODUCTION The mammalian immune system utilises numerous highly developed mechanisms to defend against viral infection. The most sophisticated of these systems, adaptive immunity, is controlled via vast fleets of highly variable antigen-binding molecules called B-cell receptors (antibodies) or T-cell receptors (TCRs). The advent of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) as a novel human coronavirus and the stress on global healthcare systems caused by the associated coronavirus disease 2019 (COVID-19) has focussed the world’s attention on ways to combat this emerging infection. The mechanisms used by coronaviruses to escape from the host adaptive immune system are not well understood but are now firmly in the spotlight. The developing picture shows that neutralising antibodies, CD4 ‘helper’ T-cells and CD8 ‘killer’ T-cells all contribute to the control of SARS-CoV-2 and the protection offered by currently approved vaccines1,2. Although research focused on antibody-mediated immunity has made up the majority of published SARS-CoV-2 immunological work to date, some evidence suggests that antibodies may play a secondary role in ultimately clearing SARS-CoV-2 infection compared to T-cells. The presence of SARS-CoV-2-specific CD4 and CD8 T-cells has been reported to correlate with reduced COVID-19 severity while neutralising antibodies in the same individuals did not2. Two SARS-CoV-2 positive agammaglobulinemia patients who developed COVID-19 symptoms during the first wave of infection in Italy, required neither oxygen nor intensive care before making full recoveries3. The authors conclude that while an antibody-mediated response to SARS-CoV-2 might be important, it is not obligatory for overcoming disease3. This finding is consistent with multiple patients that developed COVID-19 while on B-cell depleting therapy across several studies, who resolved infection without the need for intensive treatment4–6. There are also many reports of healthy individuals successfully controlling SARS-CoV-2 infection without having detectable neutralising, or receptor-binding domain (RBD), antibodies while having prominent SARS-CoV-2-specific T-cell memory2,7–10. Clinical interventions using monoclonal antibodies further suggest that humoral immunity to SARS-CoV-2, although important, is not the hoped-for panacea for individuals that require intensive treatment11,12. The association of SARS-CoV-2-specific CD4 and CD8 T-cells with milder disease suggests that both T-cell subsets play a role in protective immunity2. Indeed, the relative scarcity of naïve T-cells in individuals over 65 years old and the connection between ageing and impaired adaptive immune responses to SARS-CoV-2 has been suggested as a major cause of severe disease2. Analysis of immune cells in bronchoalveolar fluid from patients with COVID-19 showed that moderate disease correlated with highly clonally expanded CD8 T-cells13. Acute phase SARS-CoV-2-specific T-cells displaying a highly activated cytotoxic phenotype were present in antibody-seronegative exposed family members indicating that they may be capable of eliminating infection prior to induction of humoral immunity7 and it has been suggested that strong antibody responses but weak CD8 T-cell responses could contribute to acute COVID-19 pathogenesis and severity1,14. Depletion of CD8 T-cells in convalescent non-human primates reduced the protective efficacy of natural immunity against SARS-CoV-2 rechallenge15 suggesting CD8 T-cells in the upper respiratory tract may play a similar protective role in humans. Given the importance of CD8 T-cells to adaptive immune protection against COVID-19, we set out to examine dominant T-cell responses to SARS-CoV-2 through the most prevalent major histocompatibility complex (MHC) allele in humans, *HLA A*02* 16. HLA A*02 is an MHC class I molecule and presents processed intracellular protein antigens at the cell surface in the form of short peptides 8-10 amino acids in length for inspection by CD8 T-cells. SARS CoV-2 infection induces T-cells that recognise peptides derived from a range of viral proteins with enrichment for those that respond to Spike, nucleocapsid, membrane, ORF1ab and ORF3a7,8,17–19. Unbiased screening of nine *HLA A*0201*+ convalescent patients (CP) showed that the two biggest and most frequent CD8 T-cell responses recognised regions of the virus contained within ORF1ab residues 3,881-3,900 and Spike residues 261-28019. The dominant Spike epitope was narrowed down to residues 269-277 (YLQPRTFLL) and T-cells that responded to this peptide represented >1 in 10,000 total CD8 T-cells in many *HLA A*0201*+ CP with the majority of the nine donors using a TCR made with the *TRAV-12-1* gene, suggesting a potential shared or ‘public’ response19. Another study found that YLQPRTFLL was the most frequently recognised of 13 reported HLA A*02:01-restricted epitopes (responses in 16/17 *HLA A*0201*+ CP studied)10. TCR sequencing of HLA A*02:01-YLQPRTFLL tetramer+ cells revealed prominent CDR3 motifs that were shared across individuals and confirmed the general *TRAV-12-1* dominance9. TCRβ repertoire analyses also indicates that CD8 T-cell responses to the Spike 265-277 region containing the YLQPRTFLL epitope dominate responses to Spike in both CP and individuals vaccinated with ‘single-shot’ Ad26.5.COV2.S vaccine in the US irrespective of HLA type20. We reasoned that if SARS-CoV-2 were to exhibit escape from CD8 T-cells then this would most likely first occur within a dominant T-cell response restricted by the most frequent HLA in the population. We therefore focussed our attention on residues 269-277 of the SARS-CoV-2 Spike protein and found that the most prevalent mutation in this T-cell epitope, YLQ**L**RTFLL (P272L change indicated in bold underlined text), was not recognised by any of the >120 TCRs that responded to the founder epitope (YLQPRTFLL) across a cohort of nine HLA A*02+ CP. We also found sizeable populations of CD8 T-cells that stained with peptide-HLA A*02 multimers bearing the YLQPRTFLL peptide in a cohort of individuals that had been vaccinated against SARS-CoV-2. These cells could not be stained by reagents manufactured with the P272L variant suggesting that this variant escapes from vaccine induced T-cell responses. ## RESULTS ### Emergence of mutations in the YLQPRTFLL CD8 T-cell epitope The previously unprecedented genome-sequencing efforts for SARS-CoV-2 and sequencing of over 400,000 genomes in the United Kingdom to date (>45% of the global effort) has allowed detailed analyses and identification of over a thousand UK transmission lineages, with some variants associating with higher viral loads and increased transmission fitness21,22. We examined the global dataset sequenced as of January 31, 2021 and performed focussed analyses on the dominant Spike epitope at residues 269-277 (**Figures 1, 2** and **Supplementary Figures 1**). Within our dataset, we used ancestral state reconstruction to estimate the number of independent amino acid substitutions that have occurred in this region from sequenced data collected from the beginning of the pandemic to January 31, 2021. This analysis identified at least 24 different amino acid changes in this region, 22 of which were seen six or fewer times. The two variants resulting in the largest number of cases both occurred at position 272 (P272L and P272H). The P272H mutation occurred prior to onward transmission that resulted in 26 sequenced cases while P272L prior to onward transmission that led to an international cluster comprising over a thousand sequenced cases (**Figure 2** and **Supplementary Figure 1**). The P272L mutation occurred on a background of the B.1.177 lineage (**Figure 1**). B.1.177 is characterised by the lineage defining Spike mutation A222V and was shown to have arisen in Spain, and to have been exported to Europe and the wider world through travel over the summer of 202023. B.1.177 was responsible for numerous local outbreaks in the autumn of 2020 in the UK and was a key lineage in establishing the ‘second wave’ of SARS-CoV-2 in the UK and Europe. The genomic data indicates that the first sequenced occurrence of P272L in any lineage was identified in March 2020, occurring in lineages B1 and B.1.1.263 before emerging in the B.1.177 background, with the first sequenced case of B.1.177 with P272L being reported in June 2020. Following its emergence, P272L in B.1.177 has been sequenced from cases in 29 countries, implying extensive spread over the summer and autumn of 2020 (**Figures 1&2** and **Supplementary Figure 1 and 2**). The B.1.177 sub-lineage carrying P272L was likely to have been imported into the UK in June, shortly after it was first observed (**Supplementary Figure 1**) and has spread extensively since then becoming the 4th most frequent Spike variant in B.1.177 (**Figure 1**). Predominance of the B.1.177 P272L variant was higher in the South East of the UK and was present in 678/4,317 (∼16%) of all B.1.177 genomes sequenced from Greater London between March 7 2020 and January 31 2021. Although this lineage was ultimately suppressed by non-pharmaceutical interventions in the UK following the identification of B.1.1.7, up until December 2020 B.1.177 with P272L showed growth in multiple parts of the UK (**Figure 1**). Although global frequency counts are skewed by the emergence of variants such as B.1.1.7 and variable sequencing globally, in absolute terms P272L was in the top 15 Spike mutations observed globally as of January 31, 2021 (**Supplementary Figure 2**) and this variant has arisen independently at least 4 further times in other lineages (**Figure 1c**). ![Supplementary Figure 1](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/28/2021.06.21.21259010/F6.medium.gif) [Supplementary Figure 1](http://medrxiv.org/content/early/2021/06/28/2021.06.21.21259010/F6) Supplementary Figure 1 Worldwide SARS-CoV-2 viral dynamics of P272L spike mutation. Total number of P272L variants by nation, binned into periods of 10 epidemiological (EPI) weeks, from EPI week 13 (beginning 22nd March 2020), the first time a sequence with P272L was observed in sequence data; up to and including January 31st 2021 (EPI week 58). ![Supplementary Figure 2](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/28/2021.06.21.21259010/F7.medium.gif) [Supplementary Figure 2](http://medrxiv.org/content/early/2021/06/28/2021.06.21.21259010/F7) Supplementary Figure 2 **a**, Top 15 most frequently observed variants observed in worldwide spike glycoprotein sequence data in all lineages. Those thought to be associated with enhanced viral transmission are shown in green and those listed as potentially escaping from antibody mediated-mediated immunity are shown in light pink (possible) and dark pink (high confidence) ass assigned at [http://sars2.cvr.gla.ac.uk/cog-uk/#shiny-tab-immunology](http://sars2.cvr.gla.ac.uk/cog-uk/#shiny-tab-immunology). This colour-coding of substitutions is used throughout the figure. Variants of unknown function in grey. N501Y also escapes from antibodies **b**, Fraction of days per month where five or more P272L variants were sequenced in B.1.177 background. **c-d**, Mapping of mutants shown in A onto Spike sequence map and prefusion structure (PDB 6VXX; Walls et al. 2020. *Cell*, **181**, 281-291) ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/28/2021.06.21.21259010/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2021/06/28/2021.06.21.21259010/F1) Figure 1. SARS-CoV-2 variation in the immunodominant Wuhan-YLQPRTFLL dominant HLA A*02:01-restricted CD8 T-cell epitope. **a**, Cumulative frequency of all sequences; sequences in the B.1.177 lineage; and sequences possessing the P272L variant, both in B.1.177 and other lineages across the United Kingdom (left) and worldwide (right). Total number of P272L variants by nation, binned into periods of 10 epidemiological (EPI) weeks is shown in Supplementary Figure 1. **b**, Top 15 most frequently observed variants observed in worldwide Spike glycoprotein sequence data in lineage B.1.177. Data for all lineages is shown in Supplementary Figure 3. **c**, Phylogenetic tree showing 1227 taxa with variants at Spike position 272, with colours indicating subtrees representing potential independent mutations, computed by ASR on a larger tree of 200 221 taxa (not shown). **d**, Total number of sequences possessing P272L variant in England and Wales per administrative region up to and including 31st January 2021. ![Supplementary Figure 3](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/28/2021.06.21.21259010/F8.medium.gif) [Supplementary Figure 3](http://medrxiv.org/content/early/2021/06/28/2021.06.21.21259010/F8) Supplementary Figure 3 Wuhan-YLQPRTFLL T-cell lines from SARS-CoV-2 convalescent patients do not activate with P272L-YLQLRTFLL variant peptide. **a**, PBMCs from nine SARS-CoV-2 CP were enriched with Wuhan-YLQPRTFLL tetramers then expanded with allogeneic PBMC feeders and PHA. The lines were used in a 4h T107a assay with 10−6 M Wuhan-YLQPRTFLL or P272L-YLQ**L**RTFLL peptides. **b**, Both Wuhan and P272L peptides bind to HLA A2. HLA A2 APC staining data from a T2 binding assay using Wuhan-YLQPRTFLL and P272L-YLQ**L**RTFLL exogenous peptides. HLA B*35:08 binding HPVGEADYFEY peptide used as an irrelevant control. Error bars depict SD of duplicates. **c**, Refolded Wuhan-YLQPRTFLL and P272L-YLQ**L**RTFLL HLA monomers. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/28/2021.06.21.21259010/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2021/06/28/2021.06.21.21259010/F2) Figure 2. SARS-CoV-2 viral dynamics of P272L spike mutation in Western Europe. Total number of P272L variants by nation, binned into periods of 8 epidemiological (EPI) weeks, from EPI week 23 (beginning Sunday 31st May 2020), the first time a sequence with P272L was observed in sequence data; up to and including January 31st 2021 (EPI week 58). ### The P272L Spike variant is not recognised by YLQPRTFLL-specific CD8 T-cells from convalescent patients In order to study the impact of mutation in the YLQPRTFLL epitope we recruited a cohort of SARS-CoV-2 convalescent local healthcare workers during June 2020. Each had a history of COVID-19 symptoms and a positive nasopharyngeal swab for SARS-CoV-2 by PCR >28 days prior to sample collection. 15/30 donors tested (50%) were *HLA A*02*+ by antibody staining. Nine of the first ten *HLA A*02*+ CP we screened had CD8 T-cells that stained with A*02:01-YLQPRTFLL tetramer (example in **Figure 3a**), confirming the widespread response to this epitope seen in previous studies10,19. Bulk sorting of tetramer+ populations was used to generate a T-cell line from each of the nine CP initially screened. These lines all responded to YLQPRTFLL peptide (13-53% total cells) but completely failed to respond to YLQ**L**RTFLL peptide (**Figure 3b and Supplementary Figure 3a)** despite this sequence showing improved binding to HLA A*02 compared to the parental (Wuhan) sequence (**Supplementary Figure 3b**). The ability of the YLQ**L**RTFLL peptide to bind to HLA A*02 was confirmed during tetramer production (**Supplementary Figure 3c**). TCR sequencing of the A*02:01-YLQPRTFLL tetramer+ T-cells in each line identified 121 different TCRs across the seven patients including the public TCR chains previously identified in addition to many donor-specific TCR sequences (**Figure 3c** and **Supplementary Figure 4**). Remarkably, these data indicate that all the TCRs failed to respond to the P272L variant sequence and suggests that the proline at residue 272 plays a critical role in recognition by all TCRs across this cohort. Failure to recognise the P272L mutant or bind to this sequence was confirmed by direct *ex vivo* staining of PBMC (**Figure 3A**) and by using four different CP-derived T-cell clones (**Figure 4**). We further confirmed that surrogate infected cells expressing full-length Spike protein with the P272L substitution (**Supplementary Figure 5**) were not recognised by any of these T-cell clones (**Figure 4**). We conclude that P272L has arisen on multiple occasions (appearing to show selection in the B.1.177 background) and escapes from all TCRs raised against the parental (Wuhan) sequence that is being used in all current vaccines. ![Supplementary Figure 4](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/28/2021.06.21.21259010/F9.medium.gif) [Supplementary Figure 4](http://medrxiv.org/content/early/2021/06/28/2021.06.21.21259010/F9) Supplementary Figure 4 TCR usage and CDR3s from Wuhan-YLQPRTFLL tetramer sorted T-cells from lines generated from seven SARS-CoV-2 patients. **a**, Wuhan-YLQPRTFLL tetramer sorted T-cell lines for TCR sequencing shown in **Figure 3c. b**, TCR sequences of the Wuhan-YLQPRTFLL specific T-cells. Public TCR chains are highlighted in blue. TCR chains of the T-cell clones used in the study are indicated according to the key. ![Supplementary Figure 5](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/28/2021.06.21.21259010/F10.medium.gif) [Supplementary Figure 5](http://medrxiv.org/content/early/2021/06/28/2021.06.21.21259010/F10) Supplementary Figure 5 Confirmation of HLA A2 and spike expression of transduced A549 cells. HLA A2 (A2) and spike protein expression of the Wuhan and P272L spike transduced A549-A2. HLA A2 staining with BB7.2 Ab clone. Unconjugated isotype and anti-spike antibodies used in conjunction with a PE conjugated secondary antibody on Y axis. ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/28/2021.06.21.21259010/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2021/06/28/2021.06.21.21259010/F3) Figure 3. Wuhan-YLQPRTFLL T-cell lines from SARS-CoV-2 positive donors do not activate with P272L-YLQLRTFLL variant peptide. **a**, *Ex vivo* PBMC Wuhan-YLQPRTFLL and P272L-YLQ**L**RTFLL tetramer staining of a COVID patient. HLA A2 SLYNTVATL (SLY) peptide from HIV used as an irrelevant tetramer44. Percentage of CD8 T-cells is shown. **b**, Wuhan-YLQPRTFLL T-cell line enriched from nine COVID patients do not activate towards the P272L-YLQ**L**RTFLL peptide (10−6 M used for both peptides). Percentage of reactive cells is displayed. Full data shown graphically in **Supplementary Figure 3a**. M012 and F021 analysis performed on a different day. **c**, T-cell receptor (TCR) beta chain analysis of Wuhan-YLQPRTFLL tetramer sorted T-cells from seven of the patients in A (flow data in **Supplementary Figure 4a**). Pie charts display the proportion (chart segments) and frequency (numbers in center) of public (blue) and private (grey) TCRs. Variable (V) (arc on the right) and joining (J) (arc on the left) gene rearrangements are shown below the pie charts, with the dominant clonotypes annotated. CDR3s sequences are listed in **Supplementary Figure 4b**. T-cell clones (indicated with asterisk) are featured in **Figure 4**. ![Figure 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/28/2021.06.21.21259010/F4.medium.gif) [Figure 4:](http://medrxiv.org/content/early/2021/06/28/2021.06.21.21259010/F4) Figure 4: Wuhan-YLQPRTFLL specific T-cell clones from SARS-CoV-2 convalescent patients are unable to recognize P272L variant. YLQPRTFLL peptide reactive CD8 clones were grown from donors F014 (83), F019 (120 and 132) and F036 (36). Alpha and beta TCR chain CDR3s of each clone are shown and the public or private status of each indicated according to the key. Upper panel: peptide sensitivity assay with Wuhan-YLQPRTFLL and P272L-YLQ**L**RTFLL peptides. MIP-1β ELISA and error bars depict SD of duplicates. Middle panel: peptide-HLA tetramer staining. HLA A2 SLYNTVATL (SLY) peptide from HIV used as an irrelevant tetramer29. Lower panel: A549 cells expressing HLA A2 (A549-A2) and full length P272L Spike were not recognized, whereas endogenously expressed Wuhan spike was recognized. MIP-1β ELISA and error bars depict SD of duplicates. ### The P272L Spike variant is not recognised by YLQPRTFLL-specific CD8 T-cells from vaccinees We next examined responses to YLQPRTFLL in individuals who had been vaccinated against SARS-CoV-2. In total, PBMC from four HLA A*02+ healthy donors (HD) who had received a single dose of the AstraZeneca ChAdOx1 nCoV-19 vaccine24 and a further three HD who received the Pfizer-BioNTech BNT162b2 vaccine25 (see Materials and Methods for donor and vaccination details) were stained with peptide-HLA tetramers. All vaccinees had HLA A*02-YLQPRTFLL tetramer staining T-cells that accounted for between 0.01 and 0.2% of total CD8 T-cells (**Figure 5**). In all cases, these T-cells failed to stain with the HLA A*02-YLQ**L**RTFLL (P272L) reagent in parallel assays, indicating that the TCR on these T-cells did not strongly engage this sequence in accordance with the CP cohort data (**Figure 5**). We conclude that the YLQPRTFLL-specific T-cells induced by two current COVID vaccines fail to engage to the P272L variant sequence. ![Figure 5:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/06/28/2021.06.21.21259010/F5.medium.gif) [Figure 5:](http://medrxiv.org/content/early/2021/06/28/2021.06.21.21259010/F5) Figure 5: Wuhan-YLQPRTFLL specific T-cell clonotypes from vaccinated donors fail to stain with P272L variant tetramers. Tetramer staining of PBMC from 7 vaccinated donors (six single dose and donor 0439 double dose) with HLA A*0201-YLQPRTFLL and P272L tetramers. Donors received the AstraZenica (AZ) ChAdOx1 nCov-19 vaccine or the BNT162b2 (BNT) vaccine as indicated. See materials and Methods for details of each donor. The age and gender indicated under donor name. 0439 BNT: BD FACS Canto with CD8-APC-Vio770. All other donors: SONY MA900 with CD8 APC. ## DISCUSSION Emerging evidence indicates that HLA-I-restricted CD8 ‘killer’ T-cells contribute to the control of SARS-CoV-2 and the immunity to disease offered by the currently approved vaccines1,2. Individual HLA-I alleles can be associated with increased likelihood of control or progression of disease with viruses such as HIV26. The immune pressure of prevalent host CD8 T-cell responses against HIV results in selection of altered viral sequences and viral immune escape27–29. We were interested in whether there was evidence that SARS-CoV-2 might also escape from CD8 T-cells. Genome sequencing has shown extensive non-synonymous mutations occurred within the SARS-CoV-2 genome during 2020. Some of these mutations alter viral fitness30 while others have been shown to diminish the binding of antibodies31. To date, there has been limited study of T-cell escape and a broad brush-approach of examining T-cell recognition of key transmission variants failed to reveal any evidence that escape was occurring32. A recent report demonstrated that mutations in the RBD that enhance angiotensin converting enzyme (ACE)2 binding and viral infectivity concomitantly reduce T-cell recognition through HLA*24:02 33. Another study showed that some SARS-CoV-2 mutations in HLA-I-restricted epitopes resulted in lower binding to the HLA leading to reduced recognition by CD8 T-cells but there was little evidence of these mutations being disseminated34. It has been suggested that the L270F (Y**F**QPRTFLL) variant might be an escape mutant34, but this variant was only seen six times throughout our study period, in six different locations and five different PANGO lineages. From our data, this would suggest that while this mutation does arise, there is no evidence of spread of this variant following sporadic emergence across multiple different SARS-CoV-2 lineages. We reasoned that if escape were to occur then it would be most likely to be seen in a prevalent response through a frequent HLA. HLA A*02 is frequently carried by all human populations except those of recent African ancestry where it is present at lower levels. HLA A*02 is believed to be the most frequent HLA-I in the human population worldwide occurring in ∼40% of individuals16. *HLA A*02* is hypothesized to have become so frequent in the population due to its success at presenting well-recognised epitopes from historically dangerous pathogens, a supposition consistent with this HLA having entered the population via interbreeding with Neanderthals and Denisovans after early humans migrated from Africa35. A previous study used overlapping peptides from the entire SARS-CoV-2 proteome to identify that the most common HLA A*02-restricted responses in CP were to epitopes contained within peptides spanning residues 3,881-3900 of ORF1ab and 261-280 of Spike19. The Spike epitope was narrowed down to residues 269-277; sequence YLQPRTFLL. A further study confirmed the prevalence of CD8 T-cells that respond to YLQPRTFLL in CP that are absent in healthy donor blood taken before the spread of SARS-CoV-2 10. We confirmed the prevalence of responses to this epitope in our cohort where responses were observed in 9/10 HLA A*02+ CP tested and all 7/7 SARS-CoV-2 vaccinees including those who had received only the first dose of a double dose schedule. We reasoned that if escape from CD8 T-cells were to occur, then it would most likely be seen in a dominant epitope restricted by the most frequent HLA-I in the population so focussed our attention on nonsynonymous mutations in the sequence encoding YLQPRTFLL. While at least 24 different amino acid changes have occurred in this region, only two were seen in more than six sequenced cases in the period from the beginning of the pandemic to January 31st, 2021. The two variants resulting in the largest number of cases both occurred at position 272 with the P272L mutation occurring in over 8,500 sequenced cases as of June 1st, 2021. The P272L variant arose in 5 different lineages during the study period but was most prevalent in the B.1.177 lineage that played a key role in establishing the ‘second wave’ of SARS-CoV-2 in the UK and Europe during the Autumn of 2020. P272L was the 4th most common variant seen in the B.1.177 lineage and its descendants and in the top 15 Spike mutations observed globally to January 31, 2021. Nine of ten of our HLA A*02+ CP cohort tested made a response to the YLQPRTFLL epitope; a response comprising >120 different TCRs in total. Remarkably, no response was seen to the P272L variant. This finding was confirmed using four CP-derived T-cell clones that included TCR α or β chains that have previously been described in other CP cohorts10,19. Responses to the YLQPRTFLL epitope in a cohort of HLA A*02+ healthy donors who had been vaccinated against SARS-CoV-2 but had never had symptoms or a positive test since January 2020 ranged from 0.01-0.2% of CD8 T-cells by peptide-HLA staining. However, all of the T-cells raised against the YLQPRTFLL epitope across our vaccinee cohort expressed TCRs that engaged the P272L very poorly and did not stain with P272L peptide-HLA tetramers. It may seem surprising that YLQPRTFLL-specific TCRs, >120 in total across our CP cohort and likely many more in the vaccinees, *all* fail to engage the P272L variant when a widely polyclonal response to T-cell antigens should prevent widespread escape of this nature. The widespread failure to engage the P272L variant likely reflects the nature of the amino acid substitution involved. Substitution of a proline in the middle of an HLA-I-restricted T-cell epitope is likely to be a particularly drastic change in terms of TCR recognition as residues 4-6 in a 9-mer peptide often stand proud of the HLA-I peptide binding groove to provide an obvious focal point for host TCRs36. This can be especially true for the ringed *imino* acid proline. We have previously demonstrated that the proline at position 5 in the HLA A*02:01-restricted 10-mer preproinsulin-derived epitope ALWGPDPAAA associated with type 1 diabetes is critical for recognition by patient-derived TCRs37,38. Subsequent structural analyses showed how this proline stands proud like the “cherry on a cake” to provide ∼70% of the binding energy between the ALWGPDPAAA preproinsulin epitope and a cognate patient-derived TCR39,40. We predict that the proline at position 4 within the predominant HLA A*02-restricted SARS-CoV-2 epitope might also represent the most obvious docking point for host TCRs in the 9-mer YLQPRTFLL epitope although this awaits structural confirmation. If, as we expect, the proline at position 4 represents the major TCR docking platform this would explain why the loss of this one residue results in such widespread escape from YLQPRTFLL-specific T-cells across all donors in our study. SARS-CoV-2 variants have been rapidly emerging throughout the current pandemic with mutants that enhance transmissibility, evade host immunity or increase disease severity being of particular concern. Current systems for identifying variants of public health concern involve risk assessments that identify mutations that are present, and then looks for mutations that have a known biological effect in order to assess their significance. While extensive amounts of work have been undertaken to predict and characterise the likely effects of mutations in Spike on antibody recognition, the same is not true for T-cells. Current risk assessments that only consider antibody affinity and factors associated with transmissibility (e.g. mutations that improve cellular infectivity) are clearly incomplete. The range of potential effects of mutations that affect recognition by T-cells is significant, and urgently requires further study. It is clear that there was a significant outbreak that spanned Europe of a lineage carrying a mutation that we have shown has a detectable impact on T-cell recognition, which went unrecognised at the time. Ultimately, it is unclear whether SARS-CoV-2 incorporates enough genetic plasticity to allow it to escape from humoral immune responses. However, the area targeted on the RBD by neutralising antibodies is large enough, and the range of epitopes targeted in polyclonal human sera broad enough, to ensure that no single mutation should allow complete escape from neutralisation in the majority of individuals41. Likewise, the wide array of HLA across the population16 and the broad range of epitopes responded to in COVID-19 patients18 combine to make it unlikely that SARS-CoV-2 will completely escape from T-cell surveillance in the near future. During the early stages of the pandemic, it seems likely that alterations in SARS-CoV-2 Spike that enhance binding to ACE2 or reduce the capability of vaccine-induced, or natural, antibodies to neutralise the virus will take precedence due to their potential relevance in all transmission events. This ‘shrouding effect’ may well have occurred in the UK SARS-CoV-2 epidemic, although this is difficult to fully assess for two reasons. First, the B.1.1.7 lineage has been shown to have a significantly higher transmissibility compared to B.1.177 (and other circulating lineages occurring in the Autumn and Winter of 2020)42, and so while P272L may have conferred some advantage, this is likely to have been insufficient to prevent the lineage being replaced by B.1.1.7. Also, it is important to note that the trajectory of P272L B.1.177 isolates was generally upwards going into the Winter of 2020, however the introduction of significant non-pharmaceutical interventions in response to B.1.1.7 resulted in a strong suppression of all SARS-CoV-2 lineages in the UK. It remains to be seen whether the P272L mutant will arise and show selection in other lineages. However, since this mutation has occurred in several variants already, we predict that it will appear in future variants. Our data suggests that the 269-277 epitope of Spike is one region that should be monitored and should feature in risk assessments by public health agencies going forward. In summary, we demonstrate that SARS-CoV-2 can readily alter its Spike protein via a single amino acid substitution so that it is not recognised by CD8 T-cells targeting the most prevalent epitope in Spike restricted by the most common HLA-I across the population. While it is not possible to directly attribute the emergence and propagation of the Spike P272L SARS-CoV-2 variant in parts of the UK where *HLA A*02* is frequently expressed to CD8 T-cell-mediated selection pressure, specific focussing of immune protection on a single protein (e.g. SARS-CoV-2 Spike favoured by all currently approved vaccines43) is likely to enhance any tendency for escape at predominant T-cell epitopes like YLQPRTFLL. Our demonstration that mutations that evade immunodominant T-cell responses through population-frequent HLA can readily arise and disseminate, strongly suggests that it will be prudent to monitor such occurrences and to increase the breadth of next generation SARS-CoV-2 vaccines to incorporate other viral proteins. ## METHODS ### SARS-CoV-2 lineage dynamics To characterise the distribution of mutations in the Spike epitope between positions 269-277, we performed a set of analyses making use of high quality publicly available sequence data made available by GISAID and COG-UK as of 31 January 202144. To generate an estimate of the number of times mutations have occurred within the epitope, we performed ACCTRAN45 ancestral state reconstruction using gotree (available at [https://github.com/evolbioinfo/gotree](https://github.com/evolbioinfo/gotree)), on a tree with 200 221 taxa, generated by COG-UK (COVID-19 Genomics UK (COG-UK), 2020), for each Spike protein site. Subtree groups were identified by nodes with a consensus to variant transition. The phylogenetic tree for visualising variants at position 272 comprises 1227 taxa, and was generated from an alignment of consensus genome sequences using mafft version 7.475, with ambiguous/problematic positions masked following46 using v4 of the problematic sites vcf (available here: [https://github.com/W-L/ProblematicSites\_SARS-CoV2/blob/master/archived\_vcf/problematic\_sites\_sarsCov2.2020-12-22-11:15.vcf](https://github.com/W-L/ProblematicSites\_SARS-CoV2/blob/master/archived_vcf/problematic_sites_sarsCov2.2020-12-22-11:15.vcf)). The tree was built using IQ-TREE47, with a HKY+G model48, 1000 ultrafast bootstrap replicates49, and rooted using Wuhan isolate MN908947.3 as an outgroup. Tree drawing was performed with ggtree50. Information on case location (country, from GISAID and local authority from the COG-UK public data) and variant frequency by PANGO lineage was collated from GISAID and COG-UK and these were then visualised using R version 4.0.5. ### Convalescent COVID patients All participant samples were pseudo-anonymised and the clinical Chief Investigator, Dr Lucy C Jones, was the only individual with access to the confidential record linking sample numbers to identifying information. Pseudo-anonymised sample numbers were not known to participants or to anyone outside of the research group. Our CP study was given a favourable opinion by the Health Research Authority (HRA) Research Ethics Committee London – (Brighton & Sussex), IRAS (Integrated Research Application System for HRA) 269506 and was also reviewed by Healthcare Research Wales and Cwm Taf Morgannwg University Health board, with research permissions granted. Eligible participants were recruited during March-June 2020 and were adult (over 18) convalescent healthcare workers at Cwm Taf Morgannwg University Health Board. Inclusion criteria included a positive nasopharyngeal swab for SARS-CoV-2 by PCR, more than 28 days prior to blood sample collection. In addition to having had a positive SARS-CoV-2 swab test, all participants fulfilled the criteria for a definite diagnosis of COVID-19 at least 28 days prior to recruitment, defined by at least one of the following: new onset of pyrexia, a continuous cough, anosmia, or ageusia. ‘M’ and ‘F’ of the patient code denotes male or female, with 3 males and 7 females used for this study. Venous blood samples (50mls) were collected from 37 informed and consented participants. This was stored in EDTA for up to 3 hours and transported in UN3373 containers to the laboratory. Peripheral Blood Mononuclear Cells (PBMCs) were collected and frozen on the same day. All human blood was procured and handled in accordance with the guidelines of Cardiff University to conform to the United Kingdom Human Tissue Act 2004. HLA A*02 typing was performed by flow cytometry using anti-HLA A2 antibody (BB7.2, BioLegend, San Diego, CA, US). Of the nine *HLA A*0201*+ patients seven had mild disease and two had moderate disease in accordance with the US National Institutes of Health classification. None of the participants required treatment with invasive ventilation support. The 3 male and 6 female participants studied had ages ranging from 30 to 60 years and a median age of 50 years. Peripheral blood was collected between 6 and 13 weeks after the positive PCR swab (median 12 weeks). ### SARS-CoV-2 vaccinees We recruited six participants who had received one dose of a SARS-CoV-2 vaccine as part of the National UK vaccination programme, under ethical approval (Central University Research Ethics Committee (CUREC), University of Oxford, Approval Reference: R71346/RE001). Inclusion criteria for participants included: no clinical history of symptoms of COVID-19 since January 2020, no history of a positive COVID swab or positive COVID antibody test. Exclusion criteria of participants included: immunological conditions which directly affect immune responses, current or recent medications which directly affect immune responses, recent history of a transfusion of blood products and pregnancy. Peripheral blood (50 mls) was collected between 28 to 60 days after receiving one dose of a vaccine for SARS-CoV-2 as indicated in **Table 1** and processed withing 4 hours of sampling. Participants received either the Astrazeneca (ChAdOx1 nCoV-19) vaccine or the Pfizer (BNT162b2 2) vaccine. Healthy donor 0439 was a colleague who received the BNT162b2 vaccine25 and volunteered a sample on day 14 following the second vaccine dose having heard that we were studying immune escape. Donor 0439 was the only donor to have undergone a full vaccination course as indicated in **Table 1**. The three male and four female donors had ages ranging from 41 to 75 years and a median age of 57 years. View this table: [Table 1.](http://medrxiv.org/content/early/2021/06/28/2021.06.21.21259010/T1) Table 1. Participant characteristics. Gender, type of SARS-CoV-2 vaccine administered and the time of phlebotomy post-vaccine in days. ### PBMCs and cell lines PBMC isolation from CPs and a healthy vaccinated donor was performed by standard density centrifugation using Histopaque-1077 (Merck Group, Darmstadt, Germany). Cell lines were cultured on the basis of ATCC guidelines: immortalized embryonic kidney cell HEK293T (CRL-1573™); lung carcinoma A549 (CCL-185 ™) and T2 (CRL-1992™). Cells were cultured in RPMI 1640 (A549 and T2) or DMEM (HEK293T) medium supplemented with 10% FBS, 2 mM L-glutamine and 100 U/mL penicillin and 100 μg/mL streptomycin (R10 or D10 respectively; all from Merck Group). Adherent cell lines were cultured as adherent monolayers, passaged when 50–80% confluent using D-PBS (Merck Group) with 2 mM EDTA to detach cells. T2s were cultured as suspension cells and split 1:10 twice weekly. Cells were regularly tested for mycoplasma contamination (MycoAlert™, Lonza). ### Generation of T-cell lines and clones T-cells were cultured in R10 supplemented with 25 ng/ml interleukin IL-15 (Miltenyi Biotech, Bergisch Gladbach, Germany), 20 or 200 IU/mL IL-2 (Proleukin®; Prometheus, San Diego, CA), 1X non-essential amino acids solution, 1 mM sodium pyruvate and 10 mM HEPES buffer (all from Merck Group unless stated otherwise). T-cells from patients recognising Wuhan-YLQPRTFLL were isolated from whole PBMCs using PE conjugated Wuhan-YLQPRTFLL peptide-HLA tetramers and magnetic anti-PE beads according to the manufacturer’s guidelines (Miltenyi Biotec) and as previously described51,52. For donors M001, F012 and F021 a bulk T-cell library was firstly created using magnetically purified CD8 T-cells (Miltenyi Biotec) and CD3/CD28 Dynabeads according to the manufacturer’s instructions (ThermoFisher Scientific, Waltham, MA, US). After 2 weeks of expansion with CD3/CD28 beads, Wuhan-YLQPRTFLL PE tetramers and anti-PE beads were used as above to enrich antigen specific T-cells. Tetramer enriched cells were allowed to culture overnight and then expanded with irradiated (3100 cGy) allogeneic PBMCs from three donors purchased from the Welsh Blood Service, and 1 μg/mL of L-phytohemagglutinin (Pan Biotech, GmbH, Am Gewerbepark 6, 94501 Aidenbach, Germany). Cloning was performed as previously described53. Briefly, tetramer enriched T-cell lines were cloned by plating 0.5, 1 or 3 cells per well in 96U-well plates with 5 ×104 irradiated PBMCs and PHA as above. ### Flow cytometry and cell sorting For surface staining, typically 5 ×104 cells per condition were used. T-cells were stained with LIVE/DEAD™ fixable violet dead stain (ThermoFisher Scientific) and CD3 (BW264/56) and CD8α (BW135/80) antibodies (all from Miltenyi Biotec). Cells were gated on FSC-A versus SSC-A, single cells (FSC-A versus FSC-H), then viable cells (marker of choice versus VIVID). Samples were acquired on a FACSCantoII machine (BD Biosciences, Franklin Lakes, NJ, USA) and analyzed in the FlowJo software (Tree Star Inc., Ashland, Oregon, US). Cell sorting was performed on SONY MA900 multi-application cell sorter (SONY, San Jose, CA, USA). ### Lentivirus production and transduction Lentiviral particles were generated by transient co-transfection of HEK293T cells with a 3rd generation plasmid system. Briefly, codon-optimised Wuhan Spike, P272L Spike (site directed mutagenesis of Wuhan Spike, details below) and *HLA A*0201* genes were synthesised (Genewiz, Essex, UK) and cloned into a 3rd generation lentiviral plasmids. Spike genes were designed with a rCD2 co-marker: Xba1-Kozak-Spike-Xho1-P2A-rCD2-Sal1-Stop, and cloned in to pSF (Oxgene, Oxford Genetics Ltd, Littlemore, Oxford, UK). The pSF plasmid was modified by removal of a Xho1 site present in the plasmid backbone and insertion of stop codons after the Sal1 site. *HLA A2* was cloned in to pELNS (Kind gift from James Riley): Xba1-Kozak-HLA A2-Stop-Xho1. For transfection, pSF or pELNS (1.52 μg), envelope plasmid (pMD2.G; 0.72 μg) and packaging plasmids (pMDLg/pRRE; 1.83μg and pRSV-REV; 1.83μg) were mixed in 300 μL of Opti-MEM, reduced serum medium (ThermoFisher Scientific) followed by mixing with 1 μg/μl Polyethylenimine (PEI; Merck Group) at a 3:1 PEI: plasmid ratio. Plasmid/PEI mixtures were incubated at room temperature for 15 min, added dropwise to the HEK293T cells (80% confluence in one well of a 6-well plate) and incubated at 37°C in a 5% CO2 humidified atmosphere. The supernatants containing lentiviral vectors were harvested, 0.4 μm filtered, and used immediately for transduction or stored at −80°C and only defrosted once before transduction. A549 cells were transduced by spinfection with viral supernatant at 400g for 2 h, incubated at 37°C overnight and media was replaced the following morning. HLA A2 transduction was confirmed by staining with BB7.2 Ab. Spike transduced cells were firstly checked for with conjugated anti-rCD2 antibody (OX-34, BioLegend), then in follow-up experiments with 1 μg/test of unconjugated anti-SARS Spike glycoprotein antibody (1A9, Abcam, Cambridge, MA, US). An unconjugated isotype matched antibody (P.3.6.2.8.1, eBiosciences, ThermoFisher Scientific) was used as a control. A goat anti-mouse secondary antibody (multiple adsorbed PE conjugated Ig polyclonal; BD Biosciences, Oxford, UK) was used to detect Spike antibody staining. ### Site directed mutagenesis of the Wuhan Spike gene Oligonucleotides were designed to substitute the Proline to a Leucine at position 272. Non-overlapping sequences were designed so that the 5’ end of the forward oligo (5′ TGAGAACCTTCCTGCTGAAGTACAACG 3’) started with the nucleotides to be mutated and included at least 20 complementary nucleotides on the 3’ side of the mutation. The reverse oligo (5′ GCTGCAGGTAGCCCACATAGTAAG 3′) was designed in the other direction so that the 5’ end annealed back-to-back with the forward oligo. Oligonucleotides were purchased via the Eurofins Genomics Website (Luxembourg, EU). PCR was performed using Phusion HF PCR Master Mix (ThermoFisher Scientific). Cycling conditions were performed according to manufacturer instructions, except elongation which was done at 1 min/kb. PCR products sizes were verified by running samples on 1% agarose gel. Correctly amplified products were treated by Kinase-Ligase-Dpn1 (KLD) enzyme mix (New England Biolabs, Ipswich, MA, USA) for 30 min at room temperature. Samples were then transformed into XL10-Gold Ultracompetent cells (Agilent Technologies Santa Clara, CA, US) by heat shock and cells were plated on LB-Agar plate with Carbenicillin antibiotic (50 μL/mL; ThermoFisher Scientific). Plasmid DNA was extracted from antibiotic resistant colonies using a HiPure Miniprep kit (ThermoFisher Scientific). DNA samples were sent to Eurofins Genomics sequencing platform using the following primers: * Ef1a For: 5’ TCAAGCCTCAGACAGTGGTTC 3’ * Spike seq primer 1(p800nt): 5′ CTCTGCTCTGGAACCCCTGGT 3′ * Spike seq primer 2(p1500nt): 5′ GAGTGTCTGTGATCACCCCT 3′ * Spike seq primer 3(p2500nt): 5′ CGGCTTCAATTTCAGCCAGATT 3′ * Spike seq primer 4(p2800nt): 5′ CAACAGCGCCATCGGCAAGA 3′ * rCD2 Rev: 5’ AACTTGCACCGCATATGCAT 3’ ### T-cell activation assays Prior to assays, T-cells were rested in R5 (as for R10 with 5% FBS) for 24 h before setting-up any activation assay to help reduce spontaneous activation. Typically, 3 ×104 T-cells and 6 ×104 target cells were used per well in R5 in a 96U well and incubated overnight, with supernatants collected for an ELISA (MIP-1β), which was performed according to the manufacturer’s instructions (R&D Systems) using half area ELISA plates. For TNFα-processing inhibitor (TAPI)-0 (Merck Group) assays, T-cells and target cells were co-incubated for 4 h with 30 μM TAPI-0 (Sigma, UK), antibody directed against TNF (clone cA2, Miltenyi Biotec) and CD107a antibody (clone H4A3, BD Biosciences) to detect activation-induced degranulation of cytotoxic T-cells. Following co-incubation, cells were washed and stained with Fixable Live/Dead Violet Dye VIVID and antibodies against appropriate cell surface markers as above. ### Peptide-MHC tetramer production and staining Peptide-HLA-A*02:01 tetramers were produced and used to stain cells as previously described54. Briefly tetramers were assembled from monomers via the step-wise addition of PE-conjugated streptavidin (ThermoFisher Scientific) at a peptide-MHC:streptavidin molar ratio of 4:1. Protease Inhibitor (Merck Group) and PBS were added to give a final peptide-MHC tetramer concentration of 0.1 mg/ml (with regard to the peptide-MHC component). Following assembly, 0.5 μg of tetramer (with respect to the peptide-MHC component) was used to stain 5 ×104 of a T cell clone and 4 ×106 PBMCs. T-cell clones were stained from culture, COVID-19 patient PBMCs from cryopreserved samples and vaccinee PBMCs from freshly drawn blood. An optimised tetramer staining protocol involving pre-incubation with 50 nM protein kinase inhibitor, Dasatinib (Axon Medchem, Reston, VA, USA) and unconjugated anti-PE antibody (PE001, BioLegend) was used as described previously55–57. Following tetramer staining, T-cells were stained with LIVE/DEAD™ fixable violet dead stain (ThermoFisher Scientific), anti-CD3 and anti-CD8 antibody and analysed using flow cytometry as above. PBMCs were also stained with anti-CD14 (M5E2, BioLegend) and anti-CD19 (HIB19, BioLegend) pacific blue conjugated antibodies to create a ‘dump’ channel for dead cells. ### HLA A2 peptide binding assay T2 cells were plated in a 96U-well plate (2 ×104 per well) in serum-free media. Wuhan-YLQPRTFLL and P272L variant-YLQ**L**RTFLL Peptides (Peptide Protein Research, Fareham, UK) were added at 100 μM concentrations; alongside a negative control peptide HPVGEADYFEY from EBV that binds HLA B*35:01. The percentage volume of DMSO was matched in each condition. Cells were then incubated overnight at room temperature. The following day, cells were stained with anti-HLA A2 antibody (clone BB7.2, BioLegend) and incubated at 37°C for 1 h. Cells were then washed in PBS and stained with LIVE/DEAD™ fixable violet dead stain (ThermoFisher Scientific) and analysed using flow cytometry. ### TCR sequencing TCR sequencing was performed as previously described58. RNA extraction was carried out using the RNEasy Micro kit (Qiagen, Hilden, Germany). cDNA was synthesized using the 5′/3′ SMARTer kit (Takara Bio, Paris, France). The SMARTer approach used a Murine Moloney Leukaemia Virus (MMLV) reverse transcriptase, a 3′ oligo-dT primer and a 5′ oligonucleotide to generate cDNA templates flanked by a known, universal anchor sequence at the 5′. PCR was then set up using a single primer pair consisting of TRAC or TRBC-specific reverse primer and an anchor-specific forward primer (Takara Bio, Paris, France). All samples were used for the following PCR reaction: 2.5 μL template cDNA, 0. 5 μL High Fidelity Phusion Taq polymerase, 10 μL 5X Phusion buffer, 0.5 μL DMSO, 1 μL dNTP Mix (10 mM each) (all from ThermoFisher Scientific, UK), 1 μL of TRAC or TRBC-specific primer (10 μM), 5 μL of 10X anchor-specific UPM primer (Takara Bio, Paris, France), and nuclease-free water for a final reaction volume of 50 μL (cycling conditions: 5 min at 94°C, 30 cycles of 30 s at 94°C, 30 s at 63°C for alpha chains or 30 s at 66°C for beta chains, 120 s at 72°C). Subsequently, 5 μL of the first PCR products were taken out to set up a nested PCR as above, using 1 μL of a nested primer pair (10 μM). All nested primers were flanked with Illumina indexes (cycling conditions: 5 min at 94°C, 30 cycles of 30 s at 94°C, 30 s at 62°C, 120 s at 72°C, and a final 10 min at 72°C). The final PCR products were loaded on a 1% agarose gel and purified with the Monarch® gel extraction kit (New England Biolabs). Purified products were sequenced on an Illumina MiSeq instrument using the MiSeq v2 reagent kit (Illumina, Cambridge, UK). Sequence analysis was performed using MiXCR software (v3.0.7)59. Public TCR clonotypes were identified using the VDJdb database60. ### Data display Unless stated otherwise, all data were displayed using GraphPad Prism software. TCR V-J usage plots were generated using VDJ tools61. ## Supporting information COG Consortium [[supplements/259010_file03.pdf]](pending:yes) GISAID sequences used [[supplements/259010_file04.pdf]](pending:yes) CP study approval [[supplements/259010_file05.pdf]](pending:yes) ## Data Availability All data generated or analysed during this study are included in this published article (and its supplementary information files). Raw data are available from the corresponding authors on reasonable request. ## AUTHOR CONTRIBUTIONS GD, LCJ and AKS conceived the study. LCJ acquired the donor samples and clinical data. GD, CR, MSH, BSz, EB, TW, JS, TM, KT, LRT, AF, AW, PG, BSp, TC and AKS undertook experiments and/or analysed the data. GD, CR, TC and AKS wrote the manuscript. ## COMPETING INTERESTS The authors declare no competing interests in regard to this work ## ACKNOWLEDGEMENTS This work was generously funded by Continuum Life Sciences (CLS) to aid the global COVID-19 effort. CLS provided salary for GD and CR. AKS is a Wellcome Investigator (220295/Z/20/Z) and we acknowledge salary support from Wellcome. EB received support from a Wellcome Trust Institutional Strategic Support Fund Fellowship. MSH was part funded by Enara Bio. We are grateful of funding from the Welsh Government through their Senior Research Leaders Scheme to AKS. LCJ is a Clinical Investigator at Cwm Taf Morgannwg University Health Board (CTMUHB). CTMUHB sponsored and supported the research and provided salary support for LCJ. TRC acknowledges support from the MRC, which funded the computational resources used by the project (grant reference MR/L015080/1), as well as specific funding from Welsh Government, which provided funds for the sequencing and analysis of Welsh samples used in this study, via Genomics Partnership Wales. UK sequencing has been supported by the COVID-19 Genomics UK Consortium (COG-UK), as well as using genomics data that has partly been funded/generated as part of COG-UK. COG-UK is supported by funding from the Medical Research Council (MRC) part of UK Research & Innovation (UKRI), the National Institute of Health Research (NIHR) and Genome Research Limited, operating as the Wellcome Sanger Institute. JAS is funded by the MRC (MR/T030062/1) and TW is funded by the Wellcome Trust (215800/Z/19/Z). The authors gratefully acknowledge the contributors to GISAID whose sequence data enabled an examination of the global spread of P272L (full list of sequences used and acknowledgements in supplementary information) ## Footnotes * 4 [https://www.cogconsortium.uk/](https://www.cogconsortium.uk/) * Received June 21, 2021. * Revision received June 21, 2021. * Accepted June 28, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at [http://creativecommons.org/licenses/by/4.0/](http://creativecommons.org/licenses/by/4.0/) ## REFERENCES 1. Sette, A. & Crotty, S. Adaptive immunity to SARS-CoV-2 and COVID-19. Cell 184, 861–880, doi:10.1016/j.cell.2021.01.007 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2021.01.007&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33497610&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 2. Rydyznski Moderbacher, C. et al. Antigen-Specific Adaptive Immunity to SARS-CoV-2 in Acute COVID-19 and Associations with Age and Disease Severity. Cell 183, 996–1012 e1019, doi:10.1016/j.cell.2020.09.038 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2020.09.038&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33010815&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 3. Soresina, A. et al. Two X-linked agammaglobulinemia patients develop pneumonia as COVID-19 manifestation but recover. Pediatr Allergy Immunol 31, 565–569, doi:10.1111/pai.13263 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/pai.13263&link_type=DOI) 4. Montero-Escribano, P. et al. Anti-CD20 and COVID-19 in multiple sclerosis and related disorders: A case series of 60 patients from Madrid, Spain. Mult Scler Relat Disord 42, 102185, doi:10.1016/j.msard.2020.102185 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.msard.2020.102185&link_type=DOI) 5. Novi, G. et al. COVID-19 in a MS patient treated with ocrelizumab: does immunosuppression have a protective role? Mult Scler Relat Disord 42, 102120, doi:10.1016/j.msard.2020.102120 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.msard.2020.102120&link_type=DOI) 6. Safavi, F., Nourbakhsh, B. & Azimi, A. R. B-cell depleting therapies may affect susceptibility to acute respiratory illness among patients with multiple sclerosis during the early COVID-19 epidemic in Iran. Mult Scler Relat Disord 43, 102195, doi:10.1016/j.msard.2020.102195 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.msard.2020.102195&link_type=DOI) 7. Sekine, T. et al. Robust T Cell Immunity in Convalescent Individuals with Asymptomatic or Mild COVID-19. Cell 183, 158–168 e114, doi:10.1016/j.cell.2020.08.017 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2020.08.017&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32979941&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 8. Nelde, A. et al. SARS-CoV-2-derived peptides define heterologous and COVID-19-induced T cell recognition. Nat Immunol 22, 74–85, doi:10.1038/s41590-020-00808-x (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41590-020-00808-x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32999467&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 9. Schulien, I. et al. Characterization of pre-existing and induced SARS-CoV-2-specific CD8(+) T cells. Nat Med 27, 78–85, doi:10.1038/s41591-020-01143-2 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41591-020-01143-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33184509&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 10. Shomuradova, A. S. et al. SARS-CoV-2 Epitopes Are Recognized by a Public and Diverse Repertoire of Human T Cell Receptors. Immunity 53, 1245–1257 e1245, doi:10.1016/j.immuni.2020.11.004 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.immuni.2020.11.004&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33326767&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 11. Weinreich, D. M. et al. REGN-COV2, a Neutralizing Antibody Cocktail, in Outpatients with Covid-19. N Engl J Med 384, 238–251, doi:10.1056/NEJMoa2035002 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa2035002&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33332778&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 12. Group, A.-T. L.-C. S. et al. A Neutralizing Monoclonal Antibody for Hospitalized Patients with Covid-19. N Engl J Med 384, 905–914, doi:10.1056/NEJMoa2033130 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa2033130&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33356051&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 13. Liao, M. et al. Single-cell landscape of bronchoalveolar immune cells in patients with COVID-19. Nat Med 26, 842–844, doi:10.1038/s41591-020-0901-9 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41591-020-0901-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32398875&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 14. Zhou, R. et al. Acute SARS-CoV-2 Infection Impairs Dendritic Cell and T Cell Responses. Immunity 53, 864–877 e865, doi:10.1016/j.immuni.2020.07.026 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.immuni.2020.07.026&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32791036&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 15. McMahan, K. et al. Correlates of protection against SARS-CoV-2 in rhesus macaques. Nature 590, 630–634, doi:10.1038/s41586-020-03041-6 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-020-03041-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33276369&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 16. Gonzalez-Galarza, F. F. et al. Allele frequency net database (AFND) 2020 update: gold-standard data classification, open access genotype data and new query tools. Nucleic Acids Res 48, D783–D788, doi:10.1093/nar/gkz1029 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkz1029&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 17. Le Bert, N. et al. SARS-CoV-2-specific T cell immunity in cases of COVID-19 and SARS, and uninfected controls. Nature 584, 457–462, doi:10.1038/s41586-020-2550-z (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-020-2550-z&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32668444&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 18. Grifoni, A. et al. Targets of T Cell Responses to SARS-CoV-2 Coronavirus in Humans with COVID-19 Disease and Unexposed Individuals. Cell 181, 1489–1501 e1415, doi:10.1016/j.cell.2020.05.015 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2020.05.015&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32473127&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 19. Ferretti, A. P. et al. Unbiased Screens Show CD8(+) T Cells of COVID-19 Patients Recognize Shared Epitopes in SARS-CoV-2 that Largely Reside outside the Spike Protein. Immunity 53, 1095–1107 e1093, doi:10.1016/j.immuni.2020.10.006 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.immuni.2020.10.006&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33128877&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 20. Alter, G. [https://www.nature.com/articles/s41586-021-03681-2](https://www.nature.com/articles/s41586-021-03681-2). (2021). 21. du Plessis, L. et al. Establishment and lineage dynamics of the SARS-CoV-2 epidemic in the UK. Science 371, 708–712, doi:10.1126/science.abf2946 (2021). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzNzEvNjUzMC83MDgiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8wNi8yOC8yMDIxLjA2LjIxLjIxMjU5MDEwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 22. Volz, E. et al. Evaluating the Effects of SARS-CoV-2 Spike Mutation D614G on Transmissibility and Pathogenicity. Cell 184, 64–75 e11, doi:10.1016/j.cell.2020.11.020 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2020.11.020&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33275900&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 23. Hodcroft, E. B. et al. Emergence and spread of a SARS-CoV-2 variant through Europe in the summer of 2020. medRxiv, doi:10.1101/2020.10.25.20219063 (2020). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMC4xMC4yNS4yMDIxOTA2M3YzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDYvMjgvMjAyMS4wNi4yMS4yMTI1OTAxMC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 24. Ramasamy, M. N. et al. Safety and immunogenicity of ChAdOx1 nCoV-19 vaccine administered in a prime-boost regimen in young and old adults (COV002): a single-blind, randomised, controlled, phase 2/3 trial. Lancet 396, 1979–1993, doi:10.1016/S0140-6736(20)32466-1 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(20)32466-1&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33220855&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 25. Polack, F. P. et al. Safety and Efficacy of the BNT162b2 mRNA Covid-19 Vaccine. N Engl J Med 383, 2603–2615, doi:10.1056/NEJMoa2034577 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa2034577&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33301246&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 26. Collins, D. R., Gaiha, G. D. & Walker, B. D. CD8(+) T cells in HIV control, cure and prevention. Nat Rev Immunol 20, 471–482, doi:10.1038/s41577-020-0274-9 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41577-020-0274-9&link_type=DOI) 27. Leslie, A. et al. Transmission and accumulation of CTL escape variants drive negative associations between HIV polymorphisms and HLA. J Exp Med 201, 891–902, doi:10.1084/jem.20041455 (2005). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiamVtIjtzOjU6InJlc2lkIjtzOjk6IjIwMS82Lzg5MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzA2LzI4LzIwMjEuMDYuMjEuMjEyNTkwMTAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 28. Price, D. A. et al. Positive selection of HIV-1 cytotoxic T lymphocyte escape variants during primary infection. Proc Natl Acad Sci U S A 94, 1890–1895 (1997). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czo5OiI5NC81LzE4OTAiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8wNi8yOC8yMDIxLjA2LjIxLjIxMjU5MDEwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 29. Price, D. A. et al. The influence of antigenic variation on cytotoxic T lymphocyte responses in HIV-1 infection. J Mol Med 76, 699–708 (1998). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s001090050270&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9766848&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 30. Plante, J. A. et al. Spike mutation D614G alters SARS-CoV-2 fitness. Nature 592, 116–121, doi:10.1038/s41586-020-2895-3 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-020-2895-3&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33106671&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 31. Harvey, W. T. et al. SARS-CoV-2 variants, spike mutations and immune escape. Nature reviews. Microbiology, doi:10.1038/s41579-021-00573-0 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41579-021-00573-0&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34075212&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 32. Tarke, A. et al. Negligible impact of SARS-CoV-2 variants on CD4 (+) and CD8 (+) T cell reactivity in COVID-19 exposed donors and vaccinees. bioRxiv, doi:10.1101/2021.02.27.433180 (2021). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYmlvcnhpdiI7czo1OiJyZXNpZCI7czoxOToiMjAyMS4wMi4yNy40MzMxODB2MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzA2LzI4LzIwMjEuMDYuMjEuMjEyNTkwMTAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 33. Motozono, C. et al. SARS-CoV-2 spike L452R variant evades cellular immunity and increases infectivity. [https://doi.org/10.1016/j.chom.2021.06.006](https://doi.org/10.1016/j.chom.2021.06.006) (2021). 34. Agerer, B. et al. SARS-CoV-2 mutations in MHC-I-restricted epitopes evade CD8(+) T cell responses. Sci Immunol 6, doi:10.1126/sciimmunol.abg6461 (2021). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTA6ImltbXVub2xvZ3kiO3M6NToicmVzaWQiO3M6MTM6IjYvNTcvZWFiZzY0NjEiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8wNi8yOC8yMDIxLjA2LjIxLjIxMjU5MDEwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 35. Abi-Rached, L. et al. The shaping of modern human immune systems by multiregional admixture with archaic humans. Science 334, 89–94, doi:10.1126/science.1209202 (2011). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjExOiIzMzQvNjA1Mi84OSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzA2LzI4LzIwMjEuMDYuMjEuMjEyNTkwMTAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 36. Bridgeman, J. S., Sewell, A. K., Miles, J. J., Price, D. A. & Cole, D. K. Structural and biophysical determinants of alphabeta T-cell antigen recognition. Immunology 135, 9–18, doi:10.1111/j.1365-2567.2011.03515.x (2012). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1365-2567.2011.03515.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22044041&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 37. Skowera, A. et al. CTLs are targeted to kill beta cells in patients with type 1 diabetes through recognition of a glucose-regulated preproinsulin epitope. J Clin Invest 118, 3390–3402, doi:10.1172/JCI35449 (2008). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1172/JCI35449&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18802479&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000259828600020&link_type=ISI) 38. Wooldridge, L. et al. A single autoimmune T cell receptor recognizes more than a million different peptides. J Biol Chem 287, 1168–1177, doi:10.1074/jbc.M111.289488 (2012). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiamJjIjtzOjU6InJlc2lkIjtzOjEwOiIyODcvMi8xMTY4IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDYvMjgvMjAyMS4wNi4yMS4yMTI1OTAxMC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 39. Bulek, A. M. et al. Structural basis for the killing of human beta cells by CD8(+) T cells in type 1 diabetes. Nat Immunol 13, 283–289, doi:10.1038/ni.2206 (2012). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ni.2206&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22245737&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 40. Cole, D. K. et al. Computational design and crystal structure of an enhanced affinity mutant human CD8 alphaalpha coreceptor. Proteins 67, 65–74 (2007). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/prot.21176&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17243170&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 41. Rees-Spear, C. et al. The effect of spike mutations on SARS-CoV-2 neutralization. Cell reports 34, 108890, doi:10.1016/j.celrep.2021.108890 (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.celrep.2021.108890&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33713594&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 42. Volz, E. et al. Assessing transmissibility of SARS-CoV-2 lineage B.1.1.7 in England. Nature, doi:10.1038/s41586-021-03470-x (2021). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-021-03470-x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33767447&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 43. Krammer, F. SARS-CoV-2 vaccines in development. Nature 586, 516–527, doi:10.1038/s41586-020-2798-3 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-020-2798-3&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32967006&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 44. Cole, D. K. et al. Dual Molecular Mechanisms Govern Escape at Immunodominant HLA A2-Restricted HIV Epitope. Frontiers in immunology 8, 1503, doi:10.3389/fimmu.2017.01503 (2017). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fimmu.2017.01503&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29209312&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 45. consortiumcontact@cogconsortium.uk, C.-G. U. An integrated national scale SARS-CoV-2 genomic surveillance network. Lancet Microbe 1, e99–e100, doi:10.1016/S2666-5247(20)30054-9 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S2666-5247(20)30054-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32835336&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 46. Swofford, D. L. & Maddison, W. P. Reconstructing ancestral character states under Wagner parsimony. Mathematical Biosciences 87, 199–229 (1987). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/0025-5564(87)90074-5&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1987M322500005&link_type=ISI) 47. Masking strategies for SARS-CoV-2 alignments. [https://virological.org/t/masking-strategies-for-sars-cov-2-alignments/480](https://virological.org/t/masking-strategies-for-sars-cov-2-alignments/480) (2021). 48. Minh, B. Q. et al. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol Biol Evol 37, 1530–1534, doi:10.1093/molbev/msaa015 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/molbev/msaa015&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32011700&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 49. Hasegawa, M., Kishino, H. & Yano, T. Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. J mol Evol 22, 160–174 [https://doi.org/110.1007/BF02101694](https://doi.org/110.1007/BF02101694) (1985). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/BF02101694&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=3934395&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1985AWB1600007&link_type=ISI) 50. Hoang, D. T., Chernomor, O., von Haeseler, A., Minh, B. Q. & Vinh, L. S. UFBoot2: Improving the Ultrafast Bootstrap Approximation. Mol Biol Evol 35, 518–522, doi:10.1093/molbev/msx281 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/molbev/msx281&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29077904&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 51. Yu, G. C., Smith, D. K., Zhui, H. C., Guan, Y. & lam, T. T. Y. ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods in Ecology and Evolution 8, doi:10.1111/2041-210X.12628 (2017). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/2041-210X.12628&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8015439&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 52. Dolton, G. et al. More tricks with tetramers: a practical guide to staining T cells with peptide-MHC multimers. Immunology 146, 11–22, doi:10.1111/imm.12499 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/imm.12499&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26076649&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 53. Wooldridge, L. et al. Tricks with tetramers: how to get the most from multimeric peptide-MHC. Immunology 126, 147–164, doi:10.1111/j.1365-2567.2008.02848.x (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1365-2567.2008.02848.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19125886&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000262227200001&link_type=ISI) 54. Theaker, S. M. et al. T-cell libraries allow simple parallel generation of multiple peptide-specific human T-cell clones. J Immunol Methods 430, 43–50, doi:10.1016/j.jim.2016.01.014 (2016). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jim.2016.01.014&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26826277&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 55. Rius, C. et al. Peptide-MHC Class I Tetramers Can Fail To Detect Relevant Functional T Cell Clonotypes and Underestimate Antigen-Reactive T Cell Populations. J Immunol 200, 2263–2279, doi:10.4049/jimmunol.1700242 (2018). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiamltbXVub2wiO3M6NToicmVzaWQiO3M6MTA6IjIwMC83LzIyNjMiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8wNi8yOC8yMDIxLjA2LjIxLjIxMjU5MDEwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 56. Dolton, G. et al. Optimised peptide-MHC multimer protocols for detection and isolation of autoimmune T-cells. Frontiers in immunology doi: 10.3389/fimmu.2018.01378 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fimmu.2018.01378&link_type=DOI) 57. Lissina, A. et al. Protein kinase inhibitors substantially improve the physical detection of T-cells with peptide-MHC tetramers. J Immunol Methods 340, 11–24 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jim.2008.09.014&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18929568&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000262869800002&link_type=ISI) 58. Tungatt, K. et al. Antibody stabilization of peptide-MHC multimers reveals functional T cells bearing extremely low-affinity TCRs. J Immunol 194, 463–474, doi:10.4049/jimmunol.1401785 (2015). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiamltbXVub2wiO3M6NToicmVzaWQiO3M6OToiMTk0LzEvNDYzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDYvMjgvMjAyMS4wNi4yMS4yMTI1OTAxMC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 59. Galloway, S. A. E. et al. Peptide Super-Agonist Enhances T-Cell Responses to Melanoma. Frontiers in immunology 10, 319, doi:10.3389/fimmu.2019.00319 (2019). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fimmu.2019.00319&link_type=DOI) 60. Bolotin, D. A. et al. MiXCR: software for comprehensive adaptive immunity profiling. Nature methods 12, 380–381, doi:10.1038/nmeth.3364 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nmeth.3364&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25924071&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom) 61. Shugay, M. et al. VDJdb: a curated database of T-cell receptor sequences with known antigen specificity. Nucleic Acids Res 46, D419–D427, doi:10.1093/nar/gkx760 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/nar/gkx760&link_type=DOI) 62. Shugay, M. et al. VDJtools: Unifying Post-analysis of T Cell Receptor Repertoires. PLoS Comput Biol 11, e1004503, doi:10.1371/journal.pcbi.1004503 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pcbi.1004503&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26606115&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F06%2F28%2F2021.06.21.21259010.atom)