Abstract
Patients with Lynch syndrome (LS) are prone to cancer due to heterozygous germline pathogenic variants in genes encoding DNA mismatch repair proteins MLH1, MSH2, MSH6 and PMS2. LS cancer cells exhibit deficient DNA mismatch repair and microsatellite instability due somatic inactivation of the second copy of the affected gene. To study microsatellite characteristics in non-neoplastic cells in LS we determined CAG repeat size in the huntingtin gene (HTT) microsatellite in lymphocyte DNA from LS patients with germline pathogenic variants in MLH1 (n = 11), MSH2 (n = 9), MSH6 (n = 7) and non-LS controls (n=19). Mean repeat size in LS was 19,55 CAG (MLH1), 19,39 CAG (MSH2), 18.07 CAG (MSH6), respectively compared to 18,42 CAG in controls. Standard deviation for CAG repeat size in LS was 4,183 CAG (MLH1), 5,089 CAG (MSH2), 3,075 CAG (MSH6), respectively, compared to 3,342 CAG in controls. Peak CAG repeat size in LS was 32 CAG (MLH1), 32 CAG (MSH2), 24 CAG (MSH6), respectively compared to 27 CAG in controls. Collectively, our data indicate that HTT CAG repeat size tends to be larger and more variable in individuals with LS caused by pathogenic variants in MLH1 and MSH2.
Introduction
Lynch syndrome (LS) is a multiorgan cancer predisposition syndrome caused by germline heterozygous pathogenic variants (PV) in the mismatch repair (MMR) genes MLH1, MSH2, MSH6 or PMS2 1. LS cancers exhibit deficient MMR (dMMR) and microsatellite instability (MSI) due to somatic inactivation of the remaining allele of the affected MMR gene 1. In the vast majority of patients, the underlying germline PV is inherited, and as a consequence there is often an accumulation of LS cancers (predominantly colon cancer and uterus cancer) in the affected family branch. Average age at onset is lower for LS cancers than for sporadic cancers. In addition, anticipation, i.e. lower age at disease onset in subsequent generations has been reported in LS 2,3 although the phenomenon has been questioned in the absence of mechanistic explanation or explained by a birth-cohort bias 4,5. Yet, as the MMR system is ubiquitous to maintain nuclear genome stability 6 dysfunctional MMR alleles could conceivably contribute to an increased mutational load including in germ cells and hence acquired genetic aberrations could be transmitted to the next generation. Indeed, the state of haploidy in gametes implies that half of the germ cells in patients with LS have no functional copy of the affected MMR gene and could therefore be subject to dMMR. However, a recent whole genome sequencing (WGS) effort revealed no evidence for altered mutational load in non-neoplastic tissue in LS patients 7. Since WGS technologies still lack sufficient resolution in regions with short tandem repeats (STR), i.e. in regions with microsatellites, we herein have used a PCR-based clinical grade high-resolution DNA fragment analysis to determine the range of CAG repeats in the huntingtin gene (HTT) microsatellite in patients with LS and matched controls.
Materials and Methods
Genomic DNA extracted from blood samples from patients investigated for non-polyposis hereditary colorectal cancer (CRC) in the Southern health care region in Sweden during 1997-2012 and shown to have either a dMMR colorectal cancer (CRC) and a PV diagnostic for LS or a MMR proficient CRC (referred to as controls in the present study) were retrieved from the Skåne university hospital biobank and anonymized. LS patients with PV in PMS2 were not included in this study for reasons of integrity as they were too few cases in the clinical registry. HTT CAG repeat size estimation was performed using PCR amplification and capillary electrophoresis fragment analysis with a validated accuracy of ± 1 CAG repeat for alleles with < 45 repetitions and ± 3 CAG repetitions for alleles with 45 or more repeats as described 8. HTT was chosen as it contains a well-characterized STR known to contain pathogenic CAG repeat expansions causative for Huntington disease (HD) and since the HTT CAG pathogenic repeat is thought to further expand somatically in an MMR-dependent manner 9.
Statistical analyses
HTT CAG repeat values were converted to integers according to clinical genetic laboratory diagnostic routines for Huntington disease 8. The CAG repeat size estimation error +/-1 was excluded from statistical calculations. Due to limited number of patients and incomplete normal distribution of histogram data, Mann-Whitney U and Kruskal-Wallis tests were used. CAG repeat size was analyzed either as unpaired values (independent alleles) or as paired (i.e. as the sum of CAG repeats for both HTT alleles in each patient). A CAG repeat size of of 18,42 triplets was set at baseline in accordance with values from patient controls in the present study and in line with published data from 7379 unselected individuals in Sweden (Sundblom et al., 2020). SPSS Statistics for Windows (SPSS Inc., Chicago, Ill., USA) software was used. P-values < 0.05 were considered significant.
Ethical approvals
Approvals and decisions were received from The Regional Ethical Review Board in Lund (application no. 2013/468) and from the Swedish Ethical Review Agency (application no. 2019-02312 and application no. 2021-06254-02).
Results
CAG repeat size was determined for both HTT alleles in LS patients with PV in MLH1 (n = 11), MSH2 (n = 9), MSH6 (n = 7) and in control patients (n=19) (Table 1, Figure 1). Mean CAG repeat size for unpaired HTT alleles in the LS subgroups was 19,55 CAG (MLH1), 19,39 CAG (MSH2), 18,07 CAG (MSH6), respectively compared to 18,42 CAG in controls (Figure 2). Standard deviation (SD) for CAG repeat size for unpaired HTT alleles in the LS subgroups was 4,183 CAG (MLH1), 5,089 CAG (MSH2), 3,075 CAG (MSH6), respectively, compared to 3,342 CAG in controls (Figure 2). SD for the sum of CAG repeat size for paired HTT alleles in LS subgroups was 6,640 CAG (MLH1), 6,667 CAG (MSH2), 3,761 CAG (MSH6), respectively, compared to 4,375 CAG in controls (Figure 3). Peak CAG repeat size in LS subgroups was 32 CAG (MLH1), 32 CAG (MSH2), 24 CAG (MSH6), respectively compared to 27 CAG in controls (Table 1; Figure 1). Mean sum of CAG repeat size for paired HTT alleles in LS subgroups was 39,09 CAG (MLH1), 38,78 CAG (MSH2), 36,14 CAG (MSH6), respectively compared to 36,84 CAG in controls (Figure 3). Differences observed between the LS subgroups or between LS subgroups and controls were not statistically significant (data not shown).
Conclusion
In this study we hypothesized that heterozygosity for PV in MMR genes could have an impact on microsatellite size in non-neoplastic cells. For this purpose we studied CAG repeat size in the HTT microsatellite in genetic subgroups of patients with LS. We found that CAG repeat size in LS patients with PV in MLH1 and MSH2 tended to be larger and more variable compared to patients with a PV in MSH6 and non-LS controls. The differences observed between the groups were however not statistical significant. The study of a larger group of patients with LS or inter-generational studies of LS family members could possibly clarify whether or not PV in MMR genes affect microsatellite length in germline DNA.
Data Availability
All data produced in the present study are available upon reasonable request to the authors.
Acknowledgements
We wish to thank the Department of Clinical Genetics and Pathology, Office for Medical Service for support regarding personnel, equipment and materials in this study.