ABSTRACT
Importance Incidence, prevalence, and survival are important measures to inform the management and provision of head and neck cancer care.
Objective To calculate the incidence, prevalence, and survival rates for head and neck cancers and subsites in the UK from 2000 to 2021.
Design, Setting, and Participants This population-based cohort study uses routinely collected primary care data from the UK. Patients aged 18 years or older with at least 1 year of history registered in Clinical Practice Research Datalink (CPRD) GOLD or Aurum were included. Data were analyzed from January 2023 to March 2024.
Main Outcomes and Measures Head and neck cancer incidence rates (IR), period prevalence (PP), and one-, five-, and ten-year survival after diagnosis between 2000 and 2021, stratified by age and calendar years.
Results There were 12,455 HNC patients (male 69.2%) with a median age of 64 years. Crude incidence increased from 9.08 (7.88 to 10.42) per 100 000 person-years in 2000 to 15.59 (14.07 to 17.23) in 2021 in CPRD GOLD with similar rates in Aurum. Age standardization attenuated incidence rises for HNC subsites apart from oropharynx and tongue. Prevalence increased for both databases, from 0.04% in 2000 to 0.12% in 2019. HNC five-year survival increased from 53.8% (95% CI, 51.4% - 56.3%) in 2000-2004 to 58.7% (56.5 - 60.9%) in 2015 to 2019.
Conclusions and Relevance HNC increases over recent decades are likely due to ageing with increases in specific subsites such as oropharyngeal cancers are due to other behavioural risk factors. Small improvements in survival highlights more research is needed to improve earlier diagnosis which will lead to better patient outcomes.
Question What is the disease burden of the head and neck cancers (HNC) in UK from 2000-2021?
Findings For all HNC combined, incidence and prevalence have increased with five-year survival slightly improving over time. Variation in results were observed for different subsites.
Meaning Our findings show increases in HNC are likely due to ageing with increases in certain subsites due to other behavioural risk factors. The small increase in survival highlights more research is needed to improve earlier diagnosis leading to better patient outcomes.
INTRODUCTION
Head and neck cancers (HNC) are a heterogenous group of cancers affecting the upper aerodigestive tract including the oral cavity, pharynx, larynx, paranasal sinuses, nasal cavity, and salivary glands. Combined, these cancers are the seventh most common cancer worldwide accounting for more than 660,000 new cases and 325,000 deaths each year 1.
The etiology and epidemiology of HNC subsites can vary with distinct characteristics in terms of histology subtypes, symptoms, treatment approaches, and prognosis, making it necessary to consider them as separate entities 2. The overall incidence of HNC has been shown to be increasing over time with marginal improvements in survival in the United Kingdom (UK) and in other countries6,7,11 However, specific risk factors such as tobacco and alcohol consumption and human papillomavirus (HPV) infection have impacted the disease burden of the different HNC subtypes 9 10.
Understanding trends in incidence, prevalence, and overall survival of HNC and subsites is an important aspect to inform decisions regarding screening, prevention, treatment, and disease management. Due to changes in HPV infections and vaccinations, a decline in tobacco and alcohol usage in recent years, an up-to-date comprehensive assessment of the trends of HNC and subsites is lacking. The aim of this study is to describe HNC and subsite trends in terms of incidence, prevalence, and survival from 2000-2021 using primary care data from the UK.
METHODS
Study design, setting, and data sources
We carried out a population cohort study using routinely collected primary care data from the UK. People with a diagnosis of HNC and a denominator population were identified from the CPRD GOLD database. We repeated the study using CPRD Aurum for comparison. Both databases contain pseudonymised patient-level information on demographics, lifestyle data, clinical diagnoses, prescriptions, and preventive care and are broadly representative15. Both databases were mapped to the Observational Medical Outcomes Partnership Common Data Model16,17.
Study participants
Eligible individuals were 18 years or older with one year of prior history, in the database from 1st January 2000. These individuals were followed up to whichever came first: HNC diagnosis, exit from database, date of death, or study end (December 31, 2021, for GOLD, and, due to data availability December 31, 2019, for Aurum). For survival analysis, individuals were followed from cancer diagnosis to either death, exit from database, or study end.
Outcome definitions
We used diagnostic codes to identify HNC and subsites. Diagnostic codes indicative of either nonmalignant cancer or metastasis were excluded (apart from prevalence analyses) as well as melanoma and lymphoma codes. Subsites of HNC were categorised as cancers of the oral cavity, tongue and lingual tonsils, nasal cavity and sinuses, salivary glands, hypopharynx, nasopharynx, oropharynx, and larynx. Clinical codelists are provided in supplement S1 with all analytical code in GitHub to enable reproducibility58. For survival analyses, mortality was defined as all-cause mortality from date of death.
Statistical methods
The characteristics of HNC patients were summarised, with median (IQR) for continuous variables and counts with percentages for categorical variables.
For incidence, number of events, observed time at risk, and incidence rates (IR) per 100,000 person years were summarised along with 95% confidence intervals (CI). Annualised IR were calculated as the number of incident cases as the numerator and number of person-years in the general population within that year as the denominator from 2000 to 2021. Age-standardized IR were calculated using the 2013 European Standard Population(19).
Period prevalence was calculated on 1st January for the years 2000 to 2021 with HNC patients as the numerator. The denominator was the number of patients in the respective years. The number of events and prevalence (%) were summarised along with 95% CI.
For survival analysis, we used the Kaplan-Meier method to estimate the overall survival probability with 95% CI. We estimated median survival and survival one, five, and ten years after diagnosis. Patients whose death and cancer diagnosis occurred on the same date were removed.
All results were stratified by sex and by age group. For survival analysis, we also stratified by calendar time of cancer diagnosis. To avoid possible patient re-identification, we do not report results with fewer than five cases. To replicate the results from GOLD, the same analysis was performed using Aurum, except for the stratification by calendar time of cancer diagnosis, which was conducted in GOLD only.
The statistical software R version 4.2.3 was used for analyses and the IncidencePrevalence 20 and survival R packages 41. Data were analyzed from January 2023 to March 2024. Data were descriptive and no tests for statistical significance were performed.
RESULTS
Patient characteristics
Overall, there were 11,388,117 eligible patients, with at least one year of prior history from 2000-2021 in GOLD. Attrition tables are in the supplement S2. A summary of baseline patient characteristics of HNC patients is shown in Table 1. All study results for this study can be found in an interactive web application42.
There were 12,455 HNC patients (69.2% males), with a median age of 64 (56 to 73) years. The highest percentage of patients were 60-69 years olds contributing to 30% of diagnosed patients with similar results in Aurum (Supplement S3).
Characteristics by HNC subsite (Supplement S4) showed larynx (22%), tongue (21%), or oropharynx (18.6%) were most common with the rarest being nasal cavity and sinus cancers (2.1%). Patients were more likely to be male (69%), with laryngeal cancer having the highest proportion of males (81%). The highest proportion of diagnoses were those aged 60-69 years of age across all subsites apart from nasal cavity and sinuses and salivary gland where the highest proportion of patients were 70-79 years old. For nasopharyngeal cancer the highest proportion of patients were 50-59 years of age. Patients with cancers of the hypopharynx, larynx and oral cavity had higher percentages of comorbidities. For most subsites (larynx, oral cavity, tongue, oropharynx, hypopharynx), smokers made up the highest proportion of patients (37.3% to 53.7%).
Incidence rates stratified by calendar year, age, and sex
In GOLD, crude overall IR of HNC from 2000 to 2021 was 14.2 (14.0 to 14.5) per 100,000 person-years. Males had higher overall IR (20.0 [19.6 to 20.4]) compared to females (8.6 [8.4 to 8.9]), with similar values in Aurum. The HNC subsite with the highest overall IR was laryngeal cancer (3.2 per 100,000 person-years) whereas nasal cavity and sinuses (0.3 per 100,000 for both databases) had the lowest. Across all subsites males had higher overall IR compared to females (supplement S5).
Crude annualised IR for HNC increased across the study period (Figure 1). For HNC subsites, cancers of the nasopharynx, oropharynx, salivary glands, and tongue increased over the study period. IR for hypopharyngeal cancer increased from 2000 to 2009 before stabilising in GOLD or reducing in Aurum. For cancers of the larynx and nasal cavity and sinuses, IR remained stable in GOLD with a decrease for laryngeal cancer in Aurum. For cancer of the oral cavity, IR gradually increased from 2000 to 2010-11 before dropping in 2012 before increasing. Stratification by sex revealed similar trends over time with higher IR for subsites in males apart from nasal cavity and sinuses cancers (Supplement S6). Age standardized IR show trends have attenuated or stabilised apart from cancers of the tongue and oropharynx particularly in males (Supplement S7). Comparison of HNC combined compared to the cancer registry data across the UK43,44,45 shows lower but comparable trends (Supplement S8).
Crude annualised incidence rates for HNC and subsites stratified by database (2004 was the initiation of the Quality Outcome Framework in the UK).
Overall IR was highest with increasing age peaking at 70-79 years old: 33.2 (31.9 to 34.4) per 100,000 person-years in CPRD GOLD with similar values in Aurum (28.5 [27.7 to 29.3]). For cancers of the oropharynx and tongue, IR was highest in those aged 60 to 69 years whereas cancers of the larynx, hypopharynx and nasopharynx, overall IR was highest in those 70-79 years old. For cancers of the oral cavity, those aged 80-89 years had the highest overall IR whereas for the salivary gland, IR was highest in those 90 years and older. For cancers of the nasal cavity and sinus, IR were highest in those aged 70-79 years old for GOLD and highest in 80–89-year-olds in Aurum (Supplement S9).
For different age groups, those aged 30-39 and 90+, IR were relatively stable over time whereas IR for those aged 40-59 and 80-89 increased to 2006 before stabilising. IR for those aged 60-79 gradually increased over time. In GOLD, IR decreased in 2020 for those aged 40-69 and those 90 years and older (Figure 3). Stratifying by age and sex showed similar trends with higher and larger increases in IR over time for males (Supplement 10).
Annualised IR stratified by age group for each HNC subsite showed some similar trends to Figure 3 (Supplement 11). Cancers of the hypopharynx and oral cavity, IR remained relatively stable for 50– 59-year-olds. For laryngeal cancer, those aged 50-69 years IR decreased over time whereas those aged 40-49 and 70-89 showed stable IR over time. For cancers of the oropharynx, salivary gland, and tongue, IR increased over time for all age groups (Supplement S12.1-S12.6).
Crude period prevalence stratified by calendar year, age, and sex
PP for HNC in 2021 for GOLD was 0.13% (0.12% to 0.13%). PP in 2021 was 2.3-fold higher for males 0.18% [0.17% to 0.18%]) compared to females (0.08% [0.07% to 0.08%]). PP increased 3.6-fold from 2000 to 2021 in GOLD with fold increases of 3.3 and 3.7 for females and males respectively. PP in 2021 for HNC subsites was highest for cancer of the tongue, larynx, and oropharynx whereas PP was lowest for cancers of the nasal cavity and sinuses. Males had highest PP in 2021 (0.04%) for cancer of the larynx, tongue and oropharynx, whereas for females, cancer of the tongue, oral cavity and oropharynx had the highest PP in 2021 (0.02%) with similar trends in Aurum.
Annualised crude PP increased over time for HNC and subsites apart from hypopharynx, nasopharynx, and oral cavity where PP decreased from 2019 (Figure 3). Similar trends were seen for both databases with some exceptions. For Aurum, PP decreased from 2012 for hypopharyngeal cancer whereas for oral cavity, PP were stable from 2010 onwards with PP being stable from 2010 declining from 2015 for laryngeal cancer. Oropharyngeal cancer had the largest fold increase in PP across the study period for both databases. Sex stratification showed similar trends (supplement S13). Age stratification showed PP increased over time for those over 60 years of age for HNC. For those aged 40-59, PP increased before stabilising from 2015. Stratification on sex and age showed males with higher prevalence from 40 years of age (supplement S14). Annualised PP stratified by age group for each HNC subsite showed some similar trends (Supplement S15), with exceptions for hypopharyngeal and laryngeal cancer (Supplement S16.1-16.7).
Survival with age, sex, and calendar year stratification
In GOLD there were 12,381 patients with 5,662 deaths (45.7% of patients) over the study period. Median survival for GOLD was 7.0 (6.6 - 7.5) years. Survival after one, five, and ten years after diagnosis was 81%, 56%, and 41% in GOLD with similar results in Aurum (Supplement S17).
Females had a longer median survival (8.2 years) compared to males (6.5 years) with better five-year survival (59.4%) compared to males (55.1%). Cancer of the hypopharynx had the lowest median survival of 3.4 years and 2.6 years in GOLD and Aurum (supplement S18). Cancer of the oral cavity had the longest median survival of 8.7 years however in Aurum, cancer of the salivary glands had the longest median survival (9.6 years).
Across the different subsites median survival was similar for both databases apart from cancer of the nasopharynx (6.1 years GOLD; 9.1 Aurum) and for cancers of salivary glands where survival between databases was greater than one year. Stratifying by sex, females showed slightly longer median survival for most subsites.
Overall, short, and long-term survival was similar for most subsites with one-, five- and ten-year survival of around 77-83%, 50-60% and 40-50% respectively apart from cancers of the hypopharynx and the nasal cavity and sinuses which was lower (Table 2). There were no sex differences in short- or long-term survival apart from cancer of the salivary gland where females had better short- and long-term term survival.
Stratification by calendar year in GOLD (supplement S19) showed short-term survival has not improved. However, oral cancer survival decreased from 89.8% (86.5-93.3) in 2000-2004 to 82.6% (78.9-86.4) in 2015-2019. Long term survival for HNC improved from 53.8% (51.4-56.3) in 2000-2004 to 58.7% (56.5-60.9) in 2015-2019. Nasopharyngeal cancer showed improvements from 32.8% (22.0-49.0) to 63.1% (52.4-76.0) in five-year survival with wide confidence intervals.
DISCUSSION
This study provides a comprehensive analysis of secular trends in HNC incidence, prevalence, and survival in the UK. Our analysis reveals notable increases in incidence and prevalence of HNC likely due to an ageing population, particularly among males and individuals aged 70–79 years old. Rises are primarily attributed to cancers of the tongue, salivary gland, and oropharynx. Laryngeal cancer appears to be stabilizing or, in some cases, decreasing. Patients diagnosed with hypopharyngeal, or nasal cavity and sinuses cancers exhibited poorest survival, compared to other HNC subsites. There has been a slight improvement in long-term survival rates across all HNC.
Our findings align with global trends, which have reported increases in HNC incidence and prevalence over recent decades7, 12, 22, particularly for cancers of the tongue, oropharynx, and salivary glands 7. While our IR for HNC broadly align with National Cancer Statistics across the UK 23,43,44,45, variations across countries, particularly for specific subsites, are likely due to different classification of subsites56 as well as differing risk factors among populations 8, 24. However, evidence suggests demographic profiles have remained unchanged over time despite rising oropharyngeal cancer incidence and stable or declining laryngeal cancer rates57.
Reasons for increases in HNC cases are multifaceted and vary per subsite due to different etiologies and specific risk factors8. Smoking and alcohol consumption are the well-known behavioral risk factors associated with HNC, particularly among males, especially for oral cavity and larynx cancers which could have contributed to the rising numbers of HNC13,25,26,27. However, other studies have shown IR for these subsites falling in countries with a higher sociodemographic index, potentially related to decreasing smoking and alcohol consumption 28. Therefore, increases in HNC cannot be attributed solely to smoking and alcohol consumption with an overall decline in these behaviors younger people in the UK30,31. Birth cohort effects likely play a role in the development of certain HNC cancers due to varying exposure to risk factors over time on top of an increasing ageing population 7, 39.
Increasing HNC could be partly due to the increases in HPV infections 31 which has led to a notable increase in HPV-associated HNC, with rates for most non-HPV-related HNC remaining stable or declining32. However, despite a potential link between HPV and HNC subsites54, one study found no change in the HPV-attributable HNC cases between 2002 and 2011, despite oropharyngeal cancer doubling, suggesting HPV plays a role but may not be entirely explained by changes in HPV infections 35.
Survival estimates for HNC align with findings from the UK8and Europe46 with cancers of the hypopharynx and nasal cavity and sinuses having lowest survival rates47. Poorer survival rates for these cancers can be attributed to late presentation, early lymph nodal metastasis, and high cancer recurrence48 with most patients asymptomatic until diagnosis relative to other subsites49.
Our study revealed no consistent improvements in short-term survival and only a slight improvement in long-term survival in the UK. This aligns with research indicating that survival rates for oral and pharyngeal cancer in the UK have not improved in recent decades50. As our study period covers up to 2021, small improvements in long term survival could be attributed to a larger reduction in smoking and alcohol before 2000’s we well as better management using multi-disciplinary teams and advances in palliative care51.
We used two large primary care databases with over 20 years of follow-up covering across the whole of the UK. CPRD GOLD covers practices from England, Wales, Scotland and Northern Ireland, while Aurum covers practices in England15. The use of both databases meant it was possible to compare results and increase generalizability of research findings. However, for some HNC subsites, there were some differences between databases likely due to geographical location52. Another strength of our study is the inclusion of a complete study population .in contrast, with cancer registries, potentially introducing bias16.
Our study had several limitations. We used primary care data without linkage to cancer registries, which may have introduced misclassification or delays in recording. Our findings align with national cancer incidence trends, lending support to the external validity of our results. This consistency suggests that our data reflect broader population trends, despite the limitations of primary care-only records54. Our use of primary care records precluded us from studying tumour histology, genetic mutations, staging or cancer therapies which can all impact survival. The anatomical proximity between the different sites of HNC often leads to erroneous classification of this type of cancer, which may result in misclassification. Finally, the HNC classification using SNOMED CT used in our study does not always align with the anatomical classification used by otolaryngologists 53. For example, the tongue is divided anatomically into base of the tongue, and the anterior two-thirds which are classified as oropharynx, and oral cavity. However, unspecific SNOMED CT codes such as “Malignant neoplasm of tongue” meant we were unable to classify patients into either oral cavity or oropharyngeal HNC subsites meaning subsites may be underreported.
CONCLUSION
This study highlights increasing incidence and prevalence of HNC in the UK are largely driven by an aging population, while specific subsites are influenced by additional factors such as HPV infections. The lack of significant improvement in short-term survival and only modest gains in five-year survival emphasize the critical need for earlier diagnosis and tailored strategies to improve outcomes. Continued surveillance and research are essential to address the unique epidemiology of HNC subsites and guide prevention, diagnosis, and treatment efforts.
CONTRIBUTIONS
Conceptualization; DN, DPA, EB
Data harmonisation and data quality assessment; AD, WYM
Formal analysis; DN
Funding acquisition; DPA
Supervision; DPA, EB, FXA-J
Interpretation of results: All authors
Roles/Writing - original draft: AMD, DN
Writing - review & editing: All authors
FUNDING
This activity under the European Health Data & Evidence Network (EHDEN) and OPTIMA has received funding from the Innovative Medicines Initiative 2 (IMI2) Joint Undertaking under grant agreement No 806968 and No. 101034347 respectively. IMI2 receives support from the European Union’s Horizon 2020 research and innovation programme and European Federation of Pharmaceutical Industries and Associations (EFPIA). The sponsors of the study did not have any involvement in the writing of the manuscript or the decision to submit it for publication. Additionally, there was partial support from the Oxford NIHR Biomedical Research Centre. The corresponding author had full access to all the data in the study and had final responsibility for the decision to submit for publication.
CONFLICTS OF INTEREST
Professor Daniel Prieto-Alhambra research group has received research grants from the European Medicines Agency, from the Innovative Medicines Initiative, from Amgen, Chiesi, and from UCB Biopharma; and consultancy or speaker fees from Astellas, Amgen, Astra Zeneca, and UCB Biopharma. All other authors declare no conflicts of interest.
DATA AVAILIABILITY STATEMENT
This study is based in part on data from the Clinical Practice Research Datalink (CPRD) obtained under licence from the UK Medicines and Healthcare products Regulatory Agency. The data is provided by patients and collected by the NHS as part of their care and support. The interpretation and conclusions contained in this study are those of the author/s alone. Patient level data used in this study was obtained through an approved application-the CPRD (application number 22_001843) and is only available following an approval process-safeguard the confidentiality of patient data. Details on how to apply for data access can be found at https://cprd.com/data-access.
Footnotes
amiquel.hj23.ics{at}gencat.cat, cheryl.tan{at}ndorms.ox.ac.uk, edward.burn{at}ndorms.ox.ac.uk, antonella.delmestri{at}ndorms.ox.ac.uk, tduarte{at}idiapjgol.org, asieh.golozar{at}odysseusinc.com, waiyi.man{at}ndorms.ox.ac.uk, faviles{at}clinic.cat, danielle.newby{at}ndorms.ox.ac.uk
ABBREVIATIONS
- IR
- incidence rates
- PP
- period prevalence
- HNC
- head and neck cancers
- KM
- Kaplan-Meier
- HPV
- human papillomavirus