Risk factors for lung cancer in never-smokers: Multi-cohort study ================================================================= * G. David Batty * Frederick K Ho * Steven Bell ## Abstract **Background** If lung cancer in never-smokers was a single disease entity, it would be the sixth most commonly occurring malignancy. Despite the population impact, its risk factors are poorly understood owing to a dearth of larger-scale, well-characterised studies. **Methods** We pooled individual-participant data from 18 prospective cohort studies comprising 91,588 never smokers (55,452 women) aged 16-102 years at study induction. Participants were linked to national death registries. **Results** A maximum of 17 years follow-up (mean 9.7) gave rise to 85 lung cancer deaths. Of the 19 potential determinants captured at baseline, only being older age (hazard ratio; 95% confidence interval per 10 year increase: 2.45; 2.11, 2.85), male (2.25; 1.46, 3.48), and having a high fruit and vegetable intake (2.29; 1.25, 4.17) were associated with elevated rates of lung cancer in this never-smoking group. No other substantial relationships were detected. **Conclusions** Despite the number and breadth of potential risk factors featured in this multi-cohort study, there was no clear suggestion of new determinants of lung cancer in never-smokers. **Impact** Our findings point to the need to explore the influence of risk factors additional to those included herein, particular in the field of genetics. Our unlikely finding for fruit and vegetable consumption warrants further testing. ## Introduction Lung cancer is a leading cause of death worldwide and it is very well documented that cigarette smoking is its most powerful risk factor. Even with only 10% of lung cancer cases arising in people with no history of smoking,1 the absolute number of disease events is high, such that, if lung cancer in neversmokers was a single disease entity, it would be the sixth most commonly occurring malignancy.2 While this brings into sharp focus the need to identify the causes of lung cancer in never smokers, the process is methodologically challenging because large studies are required and electronic health records, the most obvious source of data, often lack information on smoking habit. The current evidence base implicates a history of non-cancer respiratory disease and exposure to second-hand smoke, asbestos, and radon as being among the few modifiable risk factors in the occurrence of lung cancer in people who never smoked.1 We therefore examined the role of an array of other determinants by pooling individualparticipant data from a series of well-characterised, identical, field-based cohort studies. ## Methods Described in detail elsewhere,3 we used individual-participant (raw) data from 18 cohort studies with identical methodology: the Health Survey for England (15 studies) and the Scottish Health Surveys (3 studies). Ethical approval for data collection was granted by the London Research Ethics Council and local research ethics committees, and study members consented to participate. A total of 222,829 people participated in a home-based biomedical survey between 1994 and 2008 when aged 16-102 years; 193,873 consented to being linked to a national mortality registry (figure 1). Of these, 94,456 people reported that they had never smoked. The further omission of study members with a plasma or salivary cotinine >1.0 ng/mL to identify so called smoking deceivers;4 those reporting use of nicotine replacement therapy; and study members with a history of cancer, resulted in 91,588 (55,452 women) remaining individuals and this was our analytical sample.  [Figure 1.](http://medrxiv.org/content/early/2025/03/14/2025.03.11.25323738/F1) Figure 1. Flow of study members into the analytical sample: **Follow-up of study members in the Health Survey for England and the Scottish Health Surveys** HSE, Health Survey for England; SHS, Scottish Health Survey; NRT, nicotine replacement therapy ### Baseline data collection At study baseline, health behaviours (dietary intake, including alcohol use, leisure-time physical activity), psychosocial characteristics (cohabitation status, educational attainment, psychological distress), physical health (self-rated general, respiratory medications, non-cancer respiratory illness), and demographic data (ethnicity, sex, age) were all self-reported using standard methods.3 During medical examination, waist and hip circumference, height and weight, C-reactive protein, and plasma or blood cotinine were also measured using standard protocols. To account for differences in age, sex, and height, forced expiratory volume in one second and forced vital capacity were standardised using the Global Lung Function Initiative 2022 equations5 and z-scores derived. ### Ascertainment of lung cancer mortality Participants were linked to routinely-collected UK-wide mortality records from which cause of death was extracted. Ascertainment of lung cancer was based on any mention of the malignancy on the death certificate.6 A shared frailty Cox proportional hazards model accounted for unmeasured heterogeneity across studies by incorporating study-level random effects. Participants were censored according to the date of death from this malignancy or the end of follow-up (14 February 2011 in the Health Survey for England, 31 December 2009 in the Scottish Health Surveys), whichever came first. Analyses were conducted using Stata version 15. ## Results A maximum of 17 years of mortality surveillance (mean 9.7) of 91,588 study members gave rise to 85 lung cancer deaths. Being older (hazard ratio; 95% confidence interval per 10 year increase: 2.45; 2.11, 2.85) and male (versus female: 2.25; 1.46, 3.48) was associated with an elevated risk of lung cancer (table 1). While there was also a suggestion of associations with some of the others study member characteristics – higher body mass index and alcohol consumption – these did not reach conventional levels of statistical significance. The only exception was higher intake of vegetable and fruit where there was a more than doubling in the rate of lung cancer mortality (2.29; 1.25, 4.17). View this table: [Table 1.](http://medrxiv.org/content/early/2025/03/14/2025.03.11.25323738/T1) Table 1. Association between baseline characteristics and lung cancer mortality in life-long never smokers: Follow-up of study members in the Health Survey for England and the Scottish Health Surveys ## Discussion Our main finding was that, with the exception of risk factor associations for higher age and being male -- both well-replicated in this context1 -- there was little evidence of a clear association for the remaining 17 potential determinants of never-smoking lung cancer. Our finding that higher fruit and vegetable consumption appeared to confer elevated risk has been reported in other studies7 but is not a universal observation.8 Given the paucity of known risk factors for lung cancer in never smokers, we included as many potential determinants as possible from our dataset, however, rarely, some of these (e.g., psychological distress), were not hypothesis-driven. In conclusion, despite the number and breadth of potential environmental risk factors described, there was no clear emergence of new determinants for never-smoking lung cancer. It may be that future research should consider the role of genetic indices. ## Data Availability [https://data-archive.ac.uk/](https://data-archive.ac.uk/) [https://data-archive.ac.uk/](https://data-archive.ac.uk/) ## Footnotes * (Frederick.Ho{at}glasgow.ac.uk) * (scb81{at}medschl.cam.ac.uk) * Funding: GDB is supported by the US National Institute on Aging (1R56AG052519-01; 1R01AG052519-01A1) and SB by Cancer Research UK (A27657). There was no direct financial or material support for the work reported in the manuscript. * Access to data: Data from the Health Surveys for England and The Scottish Health Surveys are available to researchers upon application ([https://data-archive.ac.uk/](https://data-archive.ac.uk/)). * Conflicts of Interest. The authors declare no potential conflicts of interest. * Received March 11, 2025. * Revision received March 11, 2025. * Accepted March 14, 2025. * © 2025, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), CC BY-NC 4.0, as described at [http://creativecommons.org/licenses/by-nc/4.0/](http://creativecommons.org/licenses/by-nc/4.0/) ## References 1. 1.Samet JM, Avila-Tang E, Boffetta P, Hannan LM, Olivo-Marston S, Thun MJ, Rudin CM. Lung cancer in never smokers: clinical epidemiology and environmental risk factors. Clin Cancer Res 2009; 15(18): 5626–45. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTA6ImNsaW5jYW5yZXMiO3M6NToicmVzaWQiO3M6MTA6IjE1LzE4LzU2MjYiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNS8wMy8xNC8yMDI1LjAzLjExLjI1MzIzNzM4LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 2. 2.Rudin CM, Avila-Tang E, Samet JM. Lung cancer in never smokers: a call to action. Clin Cancer Res 2009; 15(18): 5622–5. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTA6ImNsaW5jYW5yZXMiO3M6NToicmVzaWQiO3M6MTA6IjE1LzE4LzU2MjIiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyNS8wMy8xNC8yMDI1LjAzLjExLjI1MzIzNzM4LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 3. 3.Batty GD, Gale CR, Kivimaki M, Deary IJ, Bell S. Comparison of risk factor associations in UK Biobank against representative, general population based studies with conventional response rates: prospective cohort study and individual participant meta-analysis. BMJ 2020; 368: m131. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjE3OiIzNjgvZmViMTJfMTIvbTEzMSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI1LzAzLzE0LzIwMjUuMDMuMTEuMjUzMjM3MzguYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 4. 4.Batty GD, Shipley MJ, Kvaavik E, Russ T, Hamer M, Stamatakis E, Kivimaki M. Biomarker assessment of tobacco smoking exposure and risk of dementia death: pooling of individual participant data from 14 cohort studies. J Epidemiol Community Health 2018; 72(6): 513–5. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiamVjaCI7czo1OiJyZXNpZCI7czo4OiI3Mi82LzUxMyI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI1LzAzLzE0LzIwMjUuMDMuMTEuMjUzMjM3MzguYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 5. 5.Bowerman C, Bhakta NR, Brazzale D, Cooper BR, Cooper J, Gochicoa-Rangel L, Haynes J, Kaminsky DA, Lan LTT, Masekela R, McCormack MC, Steenbruggen I, Stanojevic S. A Raceneutral Approach to the Interpretation of Lung Function Measurements. Am J Respir Crit Care Med 2023; 207(6): 768–74. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1164/rccm.202205-0963OC&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36383197&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F14%2F2025.03.11.25323738.atom) 6. 6.Batty GD, Gale CR, Kivimaki M, Bell S. Assessment of Relative Utility of Underlying vs Contributory Causes of Death. JAMA Open Network 2019. 7. 7.Liu Y, Sobue T, Otani T, Tsugane S. Vegetables, fruit consumption and risk of lung cancer among middle-aged Japanese men and women: JPHC study. Cancer Causes Control 2004; 15(4): 349–57. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1023/B:CACO.0000027507.22124.20&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15141136&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2025%2F03%2F14%2F2025.03.11.25323738.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000221360300003&link_type=ISI) 8. 8.Ozasa K, Watanabe Y, Ito Y, Suzuki K, Tamakoshi A, Seki N, Nishino Y, Kondo T, Wakai K, Ando M, Ohno Y. Dietary habits and risk of lung cancer death in a large-scale cohort study (JACC Study) in Japan by sex and smoking habit. Jpn J Cancer Res 2001; 92(12): 1259–69. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1349-7006.2001.tb02148.x&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000172895400001&link_type=ISI)