Abstract
Background Per- and polyfluoroalkyl substances (PFAS) are persistent organic pollutants with emerging environmental and regulatory concerns.
Objectives This study aimed to estimate the burden of PFAS exposures through ground water on the incidence of chronic health conditions among Medicare beneficiaries aged 65 years and older.
Methods We estimated PFAS groundwater concentrations for every ZIP code tabulated area (ZCTA) in California counties where 25 percent or more of the population’s drinking water was derived from groundwater. We calculated the annual incidence of non-cancer chronic health conditions among 1,696,247 Medicare beneficiaries aged 65 and older by residential ZCTA over the seven-year study period (2011-2017). A Poisson regression model was used to estimate associations between PFAS groundwater concentration and chronic condition incidence with an offset for the number of beneficiary-years at risk and adjusting for bias due to non-random sampling of wells, use of groundwater for drinking water, demographic characteristics, and lung cancer incidence as a control for smoking.
Results Results suggest an association between a 10 ng/L increment in PFAS contaminated groundwater and chronic health conditions including hypertension (+1.15%, 95% confidence interval (CI) 1.01, 1.30), chronic kidney disease (+0.83%, 95% CI 0.68, 0.99) and cataracts (+1.50%, 95% CI 1.35, 1.66).
Discussion This small increment in the incidence rate would produce an additional 1,700 new cases of hypertension each year in the study population.
Introduction
The rising concern over per- and poly-fluoroalkyl substances (PFAS) in drinking water in the United States has prompted the federal government to identify scientific challenges that must be addressed to understand and address human health impacts (National Science and Technology Council 2023). Potential adverse health effects from PFAS include impacts on fetal development, testicular and kidney cancer, liver tissue damage, suppressed immune function, and cholesterol levels changes (US Environmental Protection Agency 2021). While there is scientific consensus that there are significant health effects associated with perfluorooctanoic acid (PFOA) and perfluorooctane sulfonic acid (PFOS), the full scope of health effects in the population related to the larger set of PFAS measured in drinking water remains unclear.
Recent systematic evidence maps conducted by EPA researchers found limited epidemiological studies on cardiovascular endpoints related to PFAS exposure (Carlson, Angrish et al. 2022, Radke, Wright et al. 2022). Most epidemiological studies focused on acute clinical outcomes, such as heart attack-related hospitalizations, with few examining chronic conditions like hypertension, hyperlipidemia, or type-2 diabetes. These reviews identified few epidemiologic studies of cardiovascular endpoints in cross-sectional studies in the general population and even fewer in cohort studies. Most of these epidemiological studies were of acute clinical endpoints, such as deaths and hospitalizations for myocardial infarctions, and few have examined the effect of exposure on development of chronic health conditions such as the incidence of new diagnoses of major cardiovascular conditions, such as hypertension, hyperlipidemia, or type-2 diabetes.
Relative to the widespread use of these chemicals and known health effects, the evidence for population level health burden is still sparse, primarily due to the scarcity of exposure data. Although fluorosurfactants (principally PFOS and PFOA) have been detected in drinking water supplies throughout the US (Hu, Andrews et al. 2016, Guelfo and Adamson 2018), the assessment of the public health burden has been challenging; the exposures have not been systematically measured until recent history and PFAS is not generally measured in water used for consumption such as tap water. Simultaneously, PFAS exposures are both ubiquitous and differential with respect to many known determinants of chronic health conditions, such as income, race, sex, and occupation and there is an extensive lag time between exposure and the development of disease.
This study aims to assess the burden of chronic health conditions associated with PFAS exposure in groundwater well sources of drinking water. More specifically, the study estimates the risk associated with the annual incidence of common chronic health conditions among Medicare beneficiaries and identifies outcomes of highest concern for burden assessment at the population level. Propensity scores are employed to account for non-random sampling of wells and non-random use of groundwater for drinking. Additionally, several chronic health conditions are assessed as negative controls to identify potential unmeasured confounding related to smoking, frailty, and social isolation. The analysis quantifies outcomes with strongest association in response to PFAS exposure, the impact of each potential source of bias and offers recommendations for future analytical approaches.
Methods
To estimate the burden of chronic health conditions associated with PFAS exposure we identified health outcomes from Centers for Medicare and Medicaid Services’ Chronic Conditions Warehouse, and exposure levels in groundwater well sources of drinking water. Additionally, areal and well level information about sociodemographic population characteristics, landcover types, and the presence of known and potential PFAS sources were used to control for confounding.
Health Outcome Data
We obtained individual-level information on chronic health conditions among Medicare beneficiaries from the Centers for Medicare and Medicaid Services’ Chronic Conditions Warehouse. For each Medicare beneficiary, these annual files provide indicators for 27 chronic health conditions as well as their annually updated ZIP-code of residence (US Department of Health and Human Services 2023). Since Medicare is exceptional for those under 65 years, we restricted our analysis to Medicare beneficiaries aged 65 years and older for whom coverage is nearly universal.
The 27 chronic health conditions were separated into nine common conditions (cataract, chronic kidney disease, hypertension, congestive heart failure, hyperlipidemia, anemia, diabetes, rheumatoid/osteo arthritis, ischemic heart disease), eight less common conditions (chronic obstructive pulmonary disease (COPD), stroke, atrial fibrillations, glaucoma, hyperplasia, acquired hypothyroidism, osteoporosis, Alzheimer’s, and dementia), and three negative controls (lung cancer, depression, and hip fracture). We excluded from the analysis seven outcomes that averaged less than one incident case per year in a ZIP code: prostate cancer, breast cancer, endometrial cancer, colorectal cancer, asthma, Alzheimer’s, and acute myocardial infarction. We used indicators of each chronic health condition to calculate the annual incidence of newly diagnosed conditions over the seven-year period (2011-2017) for each ZIP code of residence.
For each ZIP code of residence, each unique beneficiary with an incident chronic health condition at any time during the seven-year study period contributed one count to the incidence of the corresponding condition. Person-years at risk were calculated in a similar manner; for each ZIP code of residence, each unique beneficiary contributed a beneficiary-year at risk for every year that they did not have the condition as well as the year in which they were first diagnosed.
Medicare beneficiary residential ZIP codes (postal delivery routes assigned by the US Postal Service) were mapped to both ZIP code population weighted centroids (US Department of Housing and Urban Development 2023) and ZIP Code Tabulated Areas (ZCTAs) (US Bureau of the Census 2023). In this process, multiple ZIP codes may have been mapped to the same ZCTA (2,444 ZIP codes were mapped to 1,756 ZCTAs).
PFAS Groundwater Concentrations
PFAS concentrations in groundwater were obtained from California’s Groundwater Ambient Monitoring and Assessment (GAMA) program, a collaboration between the U.S. Geological Survey and California’s State Water Resources Control Board, which began testing domestic and public supply wells for PFAS in 2018 (US Geological Survey 2023). Well location data and PFAS sampling results were downloaded from the GAMA website for the period 2018 until 2021, however most samples were analyzed in recent periods (California Water Boards 2023).
Individual wells may have been tested several times for multiple PFAS species and all results were reported in nanograms per liter (ng/L). Values reported as less than the detection limit were interpreted as non-detects and converted to zero before calculating exposure; however, minimum detection limits varied by PFAS testing method. Maximum PFAS concentrations at individual wells were aggregated to the ZCTA-level as an inverse squared distance weighted concentration averaged across all wells within 20-km of the ZCTA population centroid, with wells closest to the centroid contributing the most weight.
Sociodemographic Data
Population characteristics, including age, race, and income, were downloaded from the US Census Bureau’s 5-year American Community Survey (ACS-5) for all ZCTAs and census tracts in California for the year 2015 (US Bureau of the Census 2023). Additional sociodemographic data were retrieved from California’s Office of Environmental Health Hazard Assessment’s CalEnviroScreen 4.0 (CES) dataset which is an interactive mapping tool and database of environmental, socioeconomic, and public health data (California Office of Environmental Health Hazard Assessment 2023). The CES data were available by census tract, and for the purpose of the analysis they were averaged for all census tracts within each ZCTA as well as linked to the wells found within the census tract.
PFAS Source
Locations of industries such as airports, fire training facilities, chemical manufacturing sites, and waste management sites registered through USEPA’s Enforcement and Compliance History Online (ECHO) database and included in EPA’s PFAS Analytics Tools were treated as potential PFAS sources in this study (‘PFAS Sources’, 23 variables, Table S1) (US Environmental Protection Agency 2023). Although, except for most military installations, this dataset does not include any actual data or confirmation about each facility’s PFAS use or discharge, these industry points are considered to have some relevance to PFAS use or discharge. Potential PFAS sources were counted within well site buffers (1- and 5-km) and ZCTAs by industry sector, and distances to the nearest PFAS industry source were measured from well sites. The number of PFAS industry sources were included as predictors of the exposure and were not considered to be on the pathway between PFAS concentration and health outcomes.
Landcover Data
Land use and landcover data were obtained from the Multi-Resolution Land Characteristics Consortium’s 2016 National Land Cover Dataset (NLCD) (Dewitz and Survey 2021, Wickham, Stehman et al. 2021). The NLCD uses Landsat imagery to classify landcover into 16 categories at a 30m resolution for the entire US. The NLCD Land Cover Change Index raster dataset, which measures changes in landcover types between 2001-2019, was also included in this analysis. The NLCD also provides a raster of urban imperviousness that assigns each 30m pixel a value of percent developed impervious cover, for example, pavement and rooftops. Landcover data were summarized within California’s ZCTAs as well as within 1- and 5-km buffers around each well site. Local soil characteristics, including percent organic matter, percent clay, and pH, were summarized at each well site with data from the POLARIS database (Chaney, Wood et al. 2016, Moro Rosso, de Borja Reis et al. 2021). Elevation (meters above sea level) was estimated at each well site using the “elevatr” package in R (Hollister, Shah et al. 2022).
Statistical approach
We used Poisson regression (“glm” package in R) (R Core Team 2022) to model the incidence of chronic health conditions as a function of PFAS concentrations (ng/L) in the groundwater. We first restricted study area to regions where at least 25 percent of drinking water was sourced from groundwater wells. This restriction was not based on PFAS groundwater concentrations nor incidence of chronic health conditions. However, to control for potential selection bias and for confounding by common causes introduced by choice of drinking water source we developed a propensity score (‘GW PS’). Second, we consider the possibility that wells were not tested at random and may have been differentially sampled with respect to unknown factors. To adjust for selection bias due to preferential sampling of wells, we also developed a well testing propensity score (‘Testing PS’) defining the probability of a well being tested based on measured covariates. Both propensity scores were included in the model as natural splines with three degrees of freedom.
Additionally, three population-level variables that are known determinants of health were added to the regression (‘Regional Determinants of Health’; percent of workforce employed in agriculture, percent Hispanic ethnicity, and percent with a disability). The prevalence of smoking (‘Smoking’) was unknown, both for the general population and for Medicare beneficiaries. Smoking is known to be the overwhelming cause of lung cancer and any contribution of PFAS must be orders of magnitude less, so we used lung cancer incidence as a surrogate to account for the unmeasured variation in the prevalence of smoking across the ZCTAs in the study area.
Finally, we adjusted for two sets of covariates that may predict the exposure: covariates related to the known and potential sources of PFAS (‘PFAS Sources’, 23 variables, Table S2) and covariates that were indicative of PFAS concentrations (‘PFAS Covariates’, 10 variables, Table S2).
We also considered depression and hip fracture, as negative controls to assess residual confounding by unknown or omitted sources of confounding. These outcomes have no known biological mechanisms that link the incidence of disease to PFAS exposure. However, we expect there may be other factors associated with negative controls and PFAS that are also likely to confound the association with other outcomes of interest. Therefore, the common causes of exposure to PFAS and outcome are hypothesized to be like the common causes of exposure to PFAS and incidence of chronic conditions. These factors may be directly or indirectly measured through variables describing the residential location, type of employment, proximity to sources, demographic characteristics, etc. Since social isolation is the major risk factor for depression, we believe that there exists no biological relationship between PFAS and depression. Any residual association between PFAS and depression would be evidence of residual confounding. We did not find literature reporting a direct link between hip fracture with PFAS exposure. The major risk factor for hip fracture is fall and age, but factors such as arthritis and osteoporosis may contribute to the severity of outcomes associated with hip fracture. Those outcomes are analyzed separately.
Propensity score for PFAS testing
Groundwater wells were not randomly selected for PFAS testing which can potentially lead to selection bias in the analysis. To account for potential selection bias, we estimate the probability of each well being selected for PFAS testing (testing propensity score). We take advantage of the large and comprehensive characterization of residential and potential PFAS sources within 1-km and 5-km radii of each well and use extreme gradient boosting, XGBoost algorithm (Chen and Guestrin 2016), to predict the probability of a well being tested for PFAS. More specifically, the following variables were summarized at the well (buffer) level and used in the XGBoost algorithm: the Californian hydrologic region (https://gis.data.ca.gov/datasets/2a572a181e094020bdaeb5203162de15/explore) that the well is in, land use and landcover variables from the NLCD, location of potential industrial PFAS sources (EPA PFAS Analytics Tool), population characteristics (CES and US Census, ACS), soil characteristics (POLARIS), and elevation.
The XGBoost model was fit in two stages using the test-set validation approach. The model in the first stage was fit on a random subset containing 90% of wells, with the optimal number of gradient boosting rounds determined via early stopping with ten rounds, using the other 10% of wells as a validation set. That is, the optimal round is the last round after which performance on the validation set doesn’t improve for ten rounds. The model in the second stage was then fit on all wells, with the optimal number of rounds found in the first stage. The other parameters of the model are set to the defaults as in the “xgboost” package in R (https://xgboost.readthedocs.io/en/stable/parameter.html). This model was then used to predict propensity scores for all the wells. We aggregated propensities to the ZCTA-level as an inverse distance weighted concentration averaged across all PFAS-tested wells within 20-km of the ZCTA population centroid, as with PFAS concentrations.
Propensity score for drinking water from ground sources
We consider groundwater consumption in the epidemiological model for PFAS in groundwater. While we estimated 22% of Californians rely on groundwater as their primary source of drinking water, this proportion varies considerably by county. In the study area, the overall proportion was 54%. Data on public water systems, including population served and whether sourced from groundwater or surface water, were downloaded from California’s Public Water System Information Dataset (PWSID) (California State Water Resources Control Board. 2019). These water-source estimates were combined with estimates of population served by private drinking water wells (Johnson, Belitz et al. 2019) to calculate percent of county population drinking from groundwater. We restricted the study region to ZCTAs with population centroids within the 40 counties where 25% or more of the population rely on groundwater for their drinking water.
To account for potential selection bias from restricting our analysis to the specified ZCTAs, we calculate the second propensity score for each ZCTA (probability of selection in the study region). We use extreme gradient boosting (Chen and Guestrin 2016) to calculate the propensity scores, as earlier. However, for the two-stage test-set validation approach, the model was trained on 80% of the ZCTAs and validated on the other 20%; the validation set has a larger proportion to adjust for the smaller number of ZCTAs. We predicted these propensity scores conditional on ZCTA-level variables: land use and landcover variables (NLCD), location of potential industrial PFAS sources (EPA PFAS Analytics Tool), and population characteristics (CES and US Census, ACS).
Statistical models
For each outcome of interest, we fit seven models (M1-M7) for weighted PFAS concentrations as a treatment variable of interest. M1 (Base Model) used the log count of the total Medicare beneficiaries (population-at-risk) as an offset. The subsequent models were cumulative adjustments to the basic model: M2 (+ Testing PS) adds the PFAS-well testing propensity score to control for potential selection bias; M3 (+ GW PS) adds the propensity score for the proportion of drinking water from groundwater; M4 (+ Regional Determinants of Health) adds three major regional health determinants for the total population, including percent working in agriculture industry, percent Hispanic and percent with disability; M5 (+ Smoking) adds lung cancer incidence as a surrogate for smoking prevalence. Finally, model M6 (+ PFAS Sources) adds known industrial PFAS sources (Supplemental Table S1) and M7 (+ PFAS Covariates) adds additional covariates (not including sources) that were related to high levels of PFAS (Supplemental Table S2).
Sensitivity analysis was conducted with respect to the exposure metric. In addition to the weighted metric used in the main analysis, we also considered unweighted, and propensity weighted metric. The propensity weighted metric was constructed as , where wi is the distance weight of the well to the population centroid of ZCTA, Si is the indicator of a sampled location, and pi is the propensity score of the sampling and the summation is over sampled locations in DS. Rescaling the original propensity score , where ) provides a property of . As a result, the constructed propensity weighted metric approximates the weighted average of PFAS had all the wells in the ZCTA DS been sampled . This metric estimates the total exposure for this ZCTA and adjusts for preferential sampling by weighting.
Results are reported as the percent difference in the annual incidence of a chronic condition for a 10.0 ng/L increment in estimated PFAS groundwater concentrations, which corresponds to the 80th percentile of the PFAS distribution in the study area. To recapitulate, PFAS groundwater concentrations are estimated for each ZCTA as the inverse distance-squared weighted mean of the well-specific maximum PFAS-species concentration for tested wells within 20-km of the ZTCA population centroid.
Results
The study cohort included 1.7 million Medicare beneficiaries aged 65 years and older living within 523 ZCTAs (Figure 1). The greatest reduction in the study cohort was the restriction of the study to beneficiaries residing in California counties with over 25% of their drinking water derived from groundwater, which excluded many of the major urban centers (Figure 1 inset map). Compared to all of California, the population of the study area had greater proportions of Hispanic ancestry, with disabilities and employed in agriculture (Table 1). The study area had a slightly higher incidence of the eight most prevalent chronic health conditions, but a similar incidence of lung cancer, which was used as a proxy for smoking prevalence (Table 1).
Among the more than 20,000 wells extracted from the GAMA database, only 2,773 (14%) were tested for PFAS. Statewide, the highest PFAS groundwater concentrations were observed at wells near the most urbanized areas with low groundwater usage for drinking which were not located within the study area (for example, Los Angeles, San Francisco, and San Diego) suggesting that regions less reliant on ground sources of drinking water have more contamination and are tested more often. Within the study area, 1,647 wells were tested for PFAS and 589 (36%) had detectable levels of PFAS, with higher PFAS concentrations observed at wells near or within urban areas (for example, Fresno, Redding, and San Luis Obispo). Although the highest PFAS concentrations were outside the study area, the overall distributions of PFAS concentrations within the study area were remarkably similar to those of California overall (Table 1).
The exposure metric is calculated using the maximum sample at each well regardless of PFAS species. As such, only a handful of the tested PFAS species were represented as the exposure metrics, since 98% of wells’ maximum PFAS measurements were from just five species – PFOS, PFHXSA, PFOA, PFHA, and PFBSA (Table S4). These five species are either perfluoroalkyl sulfonic or perfluoroalkyl carboxylic acids with chain lengths ranging from 4 to 8 carbons. Four additional PFAS species (PFBTA, PFPES, PFUNDCA, NMEFOSAA) were also detected in study area wells, but were never the maximum measured species for any well.
Based on our analysis of testing propensity, the average probability of a groundwater well being tested for PFAS was 89.1% for wells with detectable levels of PFAS and 83.2% for wells without detectable levels of PFAS, indicating that contaminated wells were more likely to be tested than non- contaminated wells. Variables most influential in prediction of well testing were increased land development (“developed, medium-intensity”, an NLCD category) within 5-km of the well, presence of 1,2,3-trichloropropane (an industrial solvent), decreased levels of chromium 6 and uranium, and shorter distance to national defense industry sites. The presence of PFAS sources explained only 3% of variability (R2) in PFAS concentration across wells.
The incidences of chronic health conditions were associated with a 10 ng/L increment in PFAS groundwater concentrations with the strongest associations observed for hypertension 1.15% (1.01, 1.30), cataract 1.50% (1.35, 1.66), chronic kidney disease 0.83% (0.68, 0.99), and COPD 1.21% (1.01, 1.42) for the final covariate model (Table 2, Table S4). Adjustments for PFAS testing propensity and for varying use of groundwater for drinking water produced little change in estimates, but the three regional health covariates and smoking (that is, the ZCTA-specific incidence of lung cancer) were strong confounders as shown by the change in estimates. Ischemic heart disease as a chronic health condition represents a cohort of heart attack survivors and these results suggest that PFAS may have a negative impact on heart attack survival. Adjustments for PFAS sources (23 degrees of freedom) and for other covariates (10 degrees of freedom) showed little evidence of additional confounding although addition of PFAS sources generally increased the effect estimates.
Associations for other non-cancer chronic health conditions generally had small but positive estimated risk, except for asthma and ischemic heart disease (Figure 2). The associations for some outcomes were sensitive to one largest PFAS value observed in Mariposa County. After removing this maximum value, associations between PFAS and anemia, asthma, and diabetes were null. For all other outcomes estimates of risk were consistent and similar in magnitude (Figure S3).
Many of these chronic health conditions share common etiologies and these results may reflect shared pathways of action. We performed a mediation analysis to assess whether PFAS associations were mediated by the strong association between PFAS and hypertension, a known risk factor for other chronic health conditions. Hypertension is also the most prevalent condition and has the largest rate of co-occurrence with other conditions. With the addition of hypertension to the model, the PFAS direct effect was reduced to 0.56 (95% CI 0.40, 0.71) for chronic kidney disease, 0.76 (95% CI 0.61, 0.92) for cataract, and 0.94 (95% CI 0.73, 1.15) for chronic obstructive pulmonary disease. The proportion of the total effect mediated by hypertension was greatest for cataracts (49%), chronic kidney disease (33%), and chronic obstructive pulmonary disease (22%). These results suggest that hypertension mediated a considerable portion, but not all, of the association between PFAS and these other chronic health conditions.
Discussion
Exposure to PFAS is a national public health concern with a growing need for population level risk assessment. In this study, we characterized the association between PFAS in the groundwater wells and the incidence cases of chronic disease in a large population of Medicare beneficiaries aged 65 years and older residing in California between 2011 and 2017. The study area was a region where approximately 54% of population receives drinking water from ground water and the exposure was determined as a distance-weighted mean of the maximum PFAS concentrations at each groundwater well in the postal code of Medicare beneficiaries. We observed the strongest association for the incidence of hypertension, chronic kidney disease, COPD, and cataract; however, the incidence of other chronic diseases (congestive heart failure, stroke, atrial fibrillations, diabetes, glaucoma, osteoarthritis, anemia, acquired hypothyroidism, benign prostatic hyperplasia, and Alzheimer’s/dementia) were also positively associated with PFAS exposure.
Incidence of hypertension was increased by one percent (1.15, 95% CI 1.01, 1.30) for a 10 ng/L increment of PFAS. To place this small rate increment in context, the average annual incidence of hypertension in the study area was 10.1 percent or about 170,000 new cases of hypertension annually. A one percent increase in this rate would add 1,700 additional cases of hypertension each year among the approximately 1.7 million Medicare beneficiaries in the study area.
An association between PFAS and hypertension has been previously reported. Cardiovascular disease has been linked to PFAS by experimental and epidemiologic studies (Meneguzzi, Fava et al. 2021) (Meneguzzi, Fava et al. 2021, Wen, Wang et al. 2022). Hypertension and dyslipidemia have been linked to serum PFAS among Norwegian adolescents (Averina, Brox et al. 2021), US women (Ding, Karvonen-Gutierrez et al. 2022) and adolescents (Ma, Xu et al. 2019), and to PFAS drinking water contamination among North Carolina adults (Ward-Caviness, Moyer et al. 2022). Preeclampsia, acute onset hypertension in pregnant women, and other reproductive related outcomes were found to be associated with PFAS in two large systematic reviews (Gao, Ni et al. 2021, Rickard, Rizvi et al. 2022). This study is the first to provide an estimate of the burden of PFAS in groundwater on clinically reported hypertension among a general population aged 65 years and older. An adverse association between PFOS and preeclampsia (acute hypertension in pregnancy) was first reported for women enrolled in a study of a major industrial release of PFAS in the Mid-Ohio River Valley in the United States (Stein, Savitz et al. 2009, Avanasi, Shin et al. 2016). Adverse associations between PFAS and preeclampsia were also found in Sweden (Rylander, Lindh et al. 2020), Denmark (Birukov, Andersen et al. 2021), Canada (Borghese, Walker et al. 2020), United States (Preston, Hivert et al. 2022) and China (Tian, Zhou et al. 2023) but results from a Norwegian cohort were equivocal (Starling, Engel et al. 2014).
The results reported here are consistent with a previous study in North Carolina based on electronic health records (Ward-Caviness, Moyer et al. 2022). That study also reported similar associations between presence of any PFAS detected in the drinking water system with hypertension (OR 1.32 (1.15, 1.52) ) and chronic kidney disease (OR 1.14 (0.87, 1.51)). In that study, the associations between PFAS and heart failure and diabetes were problematic due to smaller sample size, and cataract and COPD were not considered. Even though both studies were population based, they used different exposure metrics: a binary indicator of any PFAS detected in a water system versus more extensive PFAS testing of groundwater well concentrations.
Two outcomes had negative association with PFAS, asthma and ischemic heart disease. Both outcomes have previously been associated with PFAS but not necessarily in this age group. Here because we identify the outcomes as the incident cases that have occurred for the first time after age 65, we suspect that we may have conditioned on the population that is less likely to be newly diagnosed with these two outcomes, like the healthy survivor bias. More specifically, new incidence of asthma is rare in older adults while ischemic heart disease is diagnosed only in the surviving cases (∼75%).
The mediation analysis suggested that hypertension was an important mediator of the PFAS association with other chronic health conditions, especially for cataracts and chronic kidney disease but less so for chronic obstructive pulmonary disease. Even after consideration of hypertension, PFAS exposure still showed direct effects on these other conditions suggesting that the PFAS association is not entirely mediated by hypertension. The relationship of hypertension with cataracts and chronic kidney disease has previously been established (Yu, Lyu et al. 2014, Hamrahian and Falkner 2017). The causal relationship between hypertension and chronic obstructive pulmonary disease is more complex as the loss of lung structure may cause hypertension (Kim, Park et al. 2017). Nevertheless, the results of this study indicate that PFAS has a direct association with these chronic health conditions beyond that mediated by hypertension. Ward-Caviness et al. (Ward-Caviness, Moyer et al. 2022) similarly indicated that the health effects of the exposure to PFAS are not specific and may be associated with multiple chronic conditions.
Strengths and weaknesses
This study has several strengths and weaknesses. For those over the age 65, Medicare coverage is essentially universal. Thus, data collected on new incidence of chronic disease is representative of the population in the same age group in the study region. A unique strength of Medicare beneficiary summaries is that the chronic diseases are tracked over all beneficiaries and the incidence and prevalence rates can be fully enumerated. Nevertheless, to our knowledge there have not been other studies of PFAS effects using beneficiary files.
In this analysis we payed close attention to various sources of potential bias including preferential sampling of PFAS and confounding. We carefully considered selection bias from the non-random sampling of wells for PFAS testing, but the adjustment by a testing propensity score showed minimal impact on the PFAS effect estimates. The restriction of the study area to California counties where more than 25% of drinking water comes from groundwater was also adjusted by a propensity score for groundwater use and several demographic characteristics associated with the selection. The adjustment had minimal impact on the PFAS effect estimates (Table 2), but the reported findings ought not to be extrapolated to areas that were excluded from the analysis. For example, the region around Los Angeles has a number of wells with detected PFAS levels however, the region is estimated to have low reliance on the use of ground water sources for drinking. Additionally, some of the outcomes that had the largest risk estimates, such as cataract, have previously been noted to have both genetic and age dependent risk factors. Although conceivable that an association exists between average age in population and average PFAS concentration and that the outcome of interest is a surrogate for age, the risk estimates did not change after adjusting for age.
PFAS sources had a small suppression effect (MacKinnon, Krull et al. 2000), which when added to the model increased the effect estimates across most outcomes. The small explanatory power of PFAS sources suggest that presence of sources cannot act as an Instrumental variable and therefore cannot be a substitute for the direct exposure measurements. However, the addition of these variables can increase the magnitude and improve the overall model.
This study is a cross sectional analysis of chronic condition incidence and PFAS exposure in ground water used for drinking water, and as such is subject to several sources of potential confounding. The major confounders in the analysis were the three regional determinants of health (Hispanic ethnicity, disabilities, and agricultural employment) and the incidence of lung cancer as a surrogate for tobacco smoking (Table 2). Adjustment for the three regional variables increased the PFAS associations, while the adjustment for lung cancer reduced these associations. Among other limitations, smoking status of beneficiaries was not available and lung cancer incidence is used to characterize population level rates of smoking. Tobacco smoking does have a strong independent association with hypertension and partial adjustment for smoking using lung cancer incidence shows a large change in estimate, reflecting considerable confounding. The adjustment by the propensity scores for groundwater use showed less evidence of confounding. The widths of the confidence intervals for each chronic condition do not change much across the epidemiologic models (Table 2), providing no evidence of overfitting. This lack of variance inflation persists even for meaningful changes in the point estimates.
This study is an ecological study of a shared environmental exposure within the ZIP code of residence. Population level assessments are challenging due the lack of a comprehensive sampling strategy for PFAS. In this study, we used measurements for PFAS in groundwater from GAMA data repository of various sampling campaigns. While possible, measurement errors are unlikely to have varied across ZCTAs differentially with the incidence of chronic health conditions. Similarly, the clinical diagnosis of chronic health conditions supplied by the Medicare data is unlikely to have been influenced by PFAS groundwater concentrations. Finally, temporal misalignment between sampling of PFAS, exposure, and measured outcomes in these early stages of population level investigation is a knowledge gap and should not be overlooked.
Future research will benefit from enhanced measurements of PFAS in drinking water sources that is ongoing in many States. In present work we use the best available data on groundwater concentrations, but measures of drinking water exposure are likely to provide direct linkage. We found that adjustment for area-level lung cancer incidence was an important confounder, which implies that smoking is likely to be an important factor for consideration in future studies. The second most influential confounders included areal-level determinants of population health, which in the study area were % hispanic population, % disability, and % employed in agriculture. Adjustment with propensity scores for the non- random sampling of wells for PFAS testing and for differences in groundwater use for drinking water produced modest changes in the estimates but are important to consider in studies of ground water exposure. The impact of groundwater use was minimized by our restriction to California counties with at least 25% of drinking water derived from groundwater. Finally, the range of chronic conditions associated with PFAS indicated non-specific modes of action. Separation of PFAS associations into direct and indirect effects would be useful in resolving these pathways, as shown by our analysis using hypertension as a mediator.
Conclusion
The findings of associations between PFAS in the groundwater and the incidence of hypertension, chronic obstructive pulmonary disease, chronic kidney disease, and cataracts among Medicare beneficiaries aged 65 years and older are consistent with prior clinical and epidemiologic research and are unlikely to be due to confounding, selection bias, or measurement errors. While a small percentage increase in annual incidence of clinical hypertension may seem like a small proportion, the impact of hypertension on subsequent morbidity and mortality suggests important public health consequences.
Data Availability
All exposure data and analytical code will be posted after the manuscript is accepted. Health data is considered sensitive and cannot be posted publicly.
Supplemental Figures and Tables
Acknowledgments
Although this work has been reviewed for publication by the U.S. Environmental Protection Agency, it does not necessarily reflect the views and policies of the Agency. Reference herein to any specific commercial products, process, or service by trade name, trademark, manufacturer, or otherwise, does not necessarily constitute or imply its endorsement, recommendation, or favoring by the United States Government.
Footnotes
Declaration of competing financial interests: This work was funded with internal funds from the U.S. Environmental Protection Agency (EPA). The authors declare they have no actual or potential competing financial interests.
Data sharing: All data used in this manuscript are publicly available from the appropriate data sources.
Abbreviation
- ZIP-code
- Zone Improvement Plan Code
- ZCTA
- Zip Code Tabulation Area
- EPA
- Environmental Protection Agency
- GAMA
- Groundwater Ambient Monitoring and Assessment
- ACS-5
- American Community Survey
- CES
- CalEnviroScreen
- ECHO
- Enforcement and Compliance History Online
- NLCD
- National Land Cover Dataset
- POLARIS
- Probabilistic Remapping of Soil Survey Geographic Database
- PWSID
- Public Water System Information Dataset
- COPD
- Chronic Obstructive Pulmonary Disease
- GW
- Groundwater
- PS
- Propensity Score
- glm
- Generalized Linear Model
- XGBoost
- Extreme Gradient Boosting
- IQR
- Inner quartile range
- ng/L
- nanograms per liter
- km
- kilometer
- m
- meter
- pH
- Potential of Hydrogen
- DAG
- Directed Acyclic Graph
- PFAS
- Per- and polyfluoroalkyl sulfonic acids
- PFOA
- perfluorooctnoic acid
- PFOS
- perfluoroocane sulfonic acid
- PFHXSA
- perfluorooctanoic sulfonate
- PFHA
- perfluorohexanoic acid
- PFBSA
- perfluorobutanesulfonic
- PFBTA
- Perfluorobutanoic acid
- PFPES
- Perfluoropentanesulfonoic acid
- PFUNDCA
- Perfluoroundecanoic acid
- NMEFOSAA
- N-Methyl perfluorooctane sulfonamidoacetic acid
References
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14.
- 15.
- 16.
- 17.
- 18.
- 19.
- 20.
- 21.
- 22.
- 23.
- 24.
- 25.
- 26.
- 27.
- 28.
- 29.
- 30.
- 31.
- 32.
- 33.
- 34.
- 35.
- 36.
- 37.
- 38.
- 39.
- 40.
- 41.
- 42.
- 43.