Machine learning assisted discovery of synergistic interactions between environmental pesticides, phthalates, phenols, and trace elements in child neurodevelopment =================================================================================================================================================================== * Vishal Midya * Cecilia Sara Alcala * Elza Rechtman * Irva Hertz-Picciotto * Chris Gennings * Maria Rosa * Damaskini Valvi ## Abstract A growing body of literature suggests that higher developmental exposure to individual or mixtures of environmental chemicals (ECs) is associated with autism spectrum disorder (ASD). However, the effect of interactions among these ECs is challenging to study. We introduced a composition of the classical exposure-mixture Weighted Quantile Sum (WQS) regression, and a machine-learning method called signed iterative random forest (SiRF) to discover synergistic interactions between ECs that are (1) associated with higher odds of ASD diagnosis, (2) mimic toxicological interactions, and (3) are present only in a subset of the sample whose chemical concentrations are higher than certain thresholds. In the case-control Childhood Autism Risks from Genetics and Environment study, we evaluated multi-ordered synergistic interactions among 62 ECs measured in the urine samples of 479 children in association with increased odds for ASD diagnosis (yes vs. no). WQS-SiRF discovered two synergistic two-ordered interactions between (1) trace-element cadmium(Cd) and alkyl-phosphate pesticide - diethyl-phosphate(DEP); and (2) 2,4,6-trichlorophenol(TCP-246) and DEP metabolites. Both interactions were suggestively associated with increased odds of ASD diagnosis in a subset of children with urinary concentrations of Cd, DEP, and TCP-246 above the 75th percentile. This study demonstrates a novel method that combines the inferential power of WQS and the predictive accuracy of machine-learning algorithms to discover interpretable EC interactions associated with ASD. **Synopsis** The effect of interactions among environmental chemicals on autism spectrum disorder (ASD) diagnosis is challenging to study. We used a combination of Weighted Quantile Sum regression and machine-learning tools to study multi-ordered synergistic interactions between environmental chemicals associated with higher odds of ASD diagnosis. ![Figure1](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/02/04/2023.02.02.23285222/F1.medium.gif) [Figure1](http://medrxiv.org/content/early/2023/02/04/2023.02.02.23285222/F1) ## Introduction Autism spectrum disorder (ASD) is a neurodevelopmental disorder characterized by deficits in social communication and interaction and repetitive and stereotyped interests and behaviors1. ASD prevalence has increased drastically recently and is a public health concern worldwide. According to the Centers for Disease Control program Autism and Developmental Disabilities Monitoring (ADDM) Network, approximately 1 in 44 children have been diagnosed with ASD2, 3. In the past decade, a growing number of epidemiological studies have associated early-life environmental exposures with ASD4. These environmental exposures include air pollution5-9, nutrition, and environmental chemicals like volatile organic compounds, solvents, and endocrine-disrupting chemicals (EDCs). Among multiple EDCs, the field of metals exposure and ASD research is continuously evolving10, 11, suggesting a compelling link between higher inorganic metalloid arsenic and ASD in children12. The association between trace elements and other endocrine-disrupting chemicals, such as BPA, and parabens, with ASD has been understudied13; however, studies have shown that they are potential risk factors for child behavioral outcomes14-16. Although the exact cause of ASD remains unclear, research on its etiology has suggested an interplay of multiple genetic and early environmental contributions that differ between individuals4, 17, 18. Genetic and environmental factors may impact typical brain development, including neuron formation and migration, synapse formation, or neural connectivity, ultimately leading to ASD4. Environmental chemical exposures biologically interact through pathophysiologies, including the direct disruption of cells and structures of the nervous system, endocrine hormone- or immune system-mediated impacts, and epigenetic changes, to name a few4. However, there is a lack of environmental epidemiology studies assessing potential chemical-chemical interactions in ASD. Among the very few studies, Curtin et al. examined whether the dynamic interaction of zinc-copper cycles, which regulate metal metabolism, are disrupted in ASD19. Findings showed that the interaction between cyclical co-occurrence between zinc and copper is disrupted in ASD19, 20. The concept of “interaction” has been construed in many ways through different scientific fields21. For example, in current epidemiological studies, interactions are usually reported through association estimates of their effect sizes or inclusion probabilities22-33. Though estimating associations is essential, most methods do not provide any mechanistic or biological insight, possibly because the reported interactions are of particular functional forms (for example, multiplication of exposures) rather than representing their collective activities beyond certain concentration thresholds34. Further, after applying certain dimension reductions, most interactions are reported between sets of reduced exposures, limiting interpretability. In addition, such interactions provide a population-level estimate, with each sample providing some contribution to the overall estimate. On the other hand, the toxicological representation of interactions is easier to comprehend. Through the collective activities of the chemicals, (1) one can identify the mechanism of synergistic or antagonistic behavior that might arise beyond the concentration thresholds (and not just the regression coefficient of multiplicative associations), and (2) the use of concentration thresholds reflect the toxicological underpinning of classical threshold based chemical dose-response studies35-37. Moreover, as the number of chemical exposures increases, searching for multi-ordered interactions gets computationally very intensive. Most current methods, therefore, “hard code” or pre-specify interaction terms in models, but such strategies are limited due to restrictions on sample size and are usually underpowered38, 39. In comparison, Kernel Machine Regression or Bayesian factorization-inspired methods discover interactions with certain functional forms that do not represent any collective activity or concentration thresholds30, 32, 40. The lack of similarity with toxicological threshold-based dose-response studies makes it difficult to find any biologically relevant interpretation of the recovered interactions. Note that such interactions can only be present in a subset of the population since not every sample will have chemical concentrations beyond certain thresholds. As a possible alternative to address this problem of interpretability, tree-based machine learning (ML) models provide a natural solution to represent collective activities of chemical exposures as threshold-based interactions. Nevertheless, a significant challenge was that most of these tree-based models were black-box, creating tension between prediction quality and meaningful biological insight. Moreover, a predictive machine-learning model might not be the optimal model for inference 41. However, in recent epidemiological studies, interpretable tree-based machine-learning tools were used to discover simultaneously co-occurring chemicals, similar to classical Weighted Quantile Sum (WQS) Regression models 42-46. Separately in computational biology, using a novel ML algorithm called random intersection trees47, Basu et al.34, 48 introduced the “signed iterative random forest” (SiRF) algorithm to discover interactions through collective activities. Moreover, SiRF can efficiently search for the few stable and highly occurring interactions instead of going through each possible interaction term. Since exposure to environmental chemicals occurs simultaneously, we intend to use a combination of the WQS regression and the ML method Signed Iterative Random Forest to search for interactions that mimic toxicological interactions. Using data from the Childhood Autism Risks from Genetics and Environment (CHARGE) study, we aimed to identify multi-ordered synergistic interactions between environmental chemicals at specific exposure thresholds associated with higher odds of ASD. We further examined whether the directionality of the interactions remained unaltered even after adjusting for the potential effects of the overall chemical mixtures. ## Methods ### Study Design and Population Details about the CHARGE study have been reported in Bennett et al.13 Briefly, the Childhood Autism Risks from Genetics and Environment (CHARGE) is a case-control study that recruited between 2006 and 2017 three groups of children: (1) children with ASD (2) children with developmental delay (DD) but not ASD, and (3) children with typical development (TD)49. Children from the first two groups were mainly identified from the California Department of Developmental Services. The department coordinates services for individuals with developmental disabilities and is inclusive of all residents of California regardless of their place of birth, religion, or financial resources 13. The third group (controls) was sampled from California birth files utilizing frequency matching of ASD cases comprised of the following characteristics, age, sex, and broad geographic regions up to 10 counties. Children from all three groups were: a) aged 24-60 months at recruitment; b) living with a biological parent who speaks English or Spanish; c) born in California; and d) residing in the study catchment area. CHARGE study included all children with at least 16 mL of urine collected at their assessment and available for chemical analysis. In addition, detailed demographic characteristics of the parents and children were collected during the study visit. However, in this present study, we only included children with either ASD (from group 1) or typical development (from group 3), totaling a sample size of 479. ### Exposure Assessment We collected a single urine sample from each participating child during their visit. All samples were frozen immediately at -20□C and remained frozen until analysis. The samples were shipped on dry ice to the New York State Department of Health’s (NYSDOH) Wadsworth Center’s Human Health Exposure Analysis Resource (HHEAR) Targeted Analysis Laboratory for analysis. Enzymatic deconjugation and liquid-liquid extraction were used to assess the specific phenolic compounds previously described 50, 51. A comprehensive description of the exposure assessment of the targeted phenolic compounds can be found in Bennett et al.13, 50-52. Urinary phthalate metabolites (PhMs) were analyzed using enzymatic deconjugation, solid-phase extraction (SPE), and an isotope dilution method of quantification53. Further information on the analysis of the PhMs is explained elsewhere13, 53, 54. We used the SPE method and the HPLC-MS/MS to analyze the urine samples for six dialkyl phosphate metabolites (DAPs) described in Bennett et al.13, 55. Trace elements were analyzed from urine specimens using the biomonitoring methods based on the ICP-MS at the Laboratory of Inorganic and Nuclear Chemistry at the Wadsworth Center13, 56. Using the following formula, we corrected for specific gravity (SG) urinary concentrations Pc= Px[(SGp-1)/(SG-1)]57. Pc was the SG corrected metabolite concentration (ng/mL), and SG was the specific gravity of the urine sample. The median specific gravity of the CHARGE participants was 1.0223 ng/mL (SGp). In the event that the specific gravity correction factors were greater than 2, they were assigned a value of 2. For values below 0.5, they were assigned 0.513. ### Developmental Assessment During the study visit, an assessment of ASD diagnosis was conducted (to confirm the diagnosis of ASD indicated during the CHARGE enrollment process) using two gold standard psychometric instruments: the Autism Diagnostic Interview-Revised (ADI-R)58-60 and the Autism Diagnostic Observation Schedules (ADOS)61. The ADI-R is a semi-structured interview administered to the primary caregiver to diagnose autism and to differentiate autism from other developmental disorders60. The ADOS is a semi-structured, standardized assessment where the researcher observes the social interaction, communication, play, and imaginative use of materials by children suspected of having ASD13, 61. We utilized the DSM-5 and followed standardized procedures from the ADOS and ADI-R to assign the final diagnosis of ASD62. Children from all three groups were administered the Mullen Scales of Early Learning (MSEL) and the Vineland Adaptive Behaviors Scores (VABS)13. To confirm that a child did not have ASD, we used the Social Communications Questionnaire to screen for ASD in children in both the developmental delay and general population groups63. If a child was positive, we administered the ADI-R and ADOS to determine if they had ASD. All other children enrolled because of a community diagnosis of ASD or DD, but were not confirmed for either of these two diagnoses, were grouped together as Other Early Concerns (OEC)13. Children were classified as TD and enrolled as general population controls who did not meet the criteria for either ASD or DD. All classification groups are mutually exclusive. All clinicians participating in the study spoke English and/ or Spanish. Additionally, they achieved research reliability on all of the instruments they administered13. ### Statistical Analysis We used the Weighted quantile sum (WQS)26 regression to model the adverse mixture effect of chemicals while simultaneously (1) accommodating the correlation structure of the chemicals and (2) controlling for covariates. Similar to Bennet et al., we conducted this analysis by focusing on the positive association (i.e., adverse directionality) between chemical exposures and ASD status. To reduce spurious co-occurrences of chemicals, interactions were searched on top of the chemical-mixture effect. A conceptual schematic of different kinds of interactions was presented in Figure 1. Briefly, these interactions mimic classical toxicological interactions where interaction occurs only if the concentration of certain chemicals is above some thresholds. Conceptually, a usual multiplicative interaction between two chemicals (say, A and B) can be mapped to four toxicological interactions, (1) the concentration of A is high, and the concentration of B is high, (2) the concentration of A high, and the concentration of B is low, (3) the concentration of A is low, and the concentration of B is high, and (4) the concentration of A is low, and the concentration of B is low (see Figure 1A). Note that each of the four components is easier to interpret and could directly imply plausible biological interpretation. Moreover, a positive association with multiplicative interaction does not necessarily imply synergy since the higher value of multiplicative interaction does not imply that the concentrations of individual chemicals are also high. However, such a problem of interpretability does not arise for toxicologically mimicked interactions (Figures 1B and 1C). Lastly, multiplicative interactions provide a population-level interaction estimate – where all individuals contribute, whereas the mimicked toxicological interactions are only present in a subset of the population. In the following analysis, we searched for synergistic interactions in the adverse direction, i.e., chemical exposures higher than certain concentration thresholds, mimicking a toxicological interaction. ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/02/04/2023.02.02.23285222/F2.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2023/02/04/2023.02.02.23285222/F2) Figure 1: Conceptual illustration of multiplicative and toxicologically mimicked interactions. The model was controlled for the child’s sex, year of birth, race/ethnicity, age at enrollment, maternal age at the time of childbirth, maternal metabolic conditions during pregnancy (any hypertensive disorder, including obesity or any diabetes), and parental homeowner status (as a proxy of socioeconomic status). These covariates were chosen apriori based on the previous analysis in Bennet et al. 13 To make the analysis robust, we implemented the random subset and repeated holdout 64, 65 variants of WQS. Assuming the main chemical-mixture effect and the synergistic interactions are additive, we extracted the Pearson residuals from this model and treated the residual as the new outcome (the Pearson residual possesses asymptotic normality)37, 66. Therefore, we searched for synergistic interactions on the residuals after adjusting the first-order main mixture effect and the covariates. We searched for interactions through signed-iterated Random Forest (SiRF), where the Pearson residuals from WQS were the outcome, and the 62 chemicals were the exposures. The SiRF utilizes a combination of state-of-the-art machine-learning tools, iterative Random Forests (iRFs), and recently developed Random Intersection Trees (RITs) to search for interactions within a certain proportion of samples34, 47, 48, 67. Instead of searching through all possible combinations, SiRF searches for combinations of exposures prevalent on the decision paths of the generated iRFs. Briefly, we explain how SiRF searches for high-order chemical exposure interactions. First, the model begins with fitting the RF model and reweighting the important exposures. Using the reweighted exposures, multiple RF models are fitted iteratively to reduce the dimensionality of the exposure space without removing marginally unimportant exposures. Second, decision rules are extracted from the iterated RF and fed to a generalization of the RIT to efficiently discover high-order interactions from the decision paths. Last, a bagging step is introduced in the algorithm to assess the “stability” of the recovered interactions through a large number of bootstrapped iterations. Here stability implies the number of times an interaction is detected throughout the iterations; therefore, the higher the recovery rate, the better. Since SiRF searches through particular decision branches, it can incorporate meaningful directionality (in the current study, synergism) while recovering the interactions. The combination of WQS-SiRF can robustly search for interactions without the need to depend on p-values. In the SiRF part, the model was trained on a subset of data, and then bagging was introduced on the remaining held-out testing data. Therefore, to obtain robust results against the sensitivity of data partitioning, we chose three different data partitions, (1) 70% for training and 30% for testing, (2) 75% for training and 25% for testing, and (1) 80% for training and 20% for testing. Finally, we chose only those interactions with (1) more than 50% stability score and were (2) common to all three data partitioning results. Since the discovered interactions were based on thresholds, they were only present in certain portions of the samples. However, SiRF does not directly estimate the thresholds by itself. Therefore, we created interaction indicators based on their joint concentrations to denote the presence or absence of interactions. For example, if the specific gravity-adjusted concentrations of the chemicals were more than the 75th percentile, then the interaction indicator would be non-zero; else, it would be zero. We created another set of indicators based on the 67th percentile threshold for sensitivity analysis. For WQS analysis, (1) we converted all chemical exposures to deciles, and (2) we included all chemicals irrespective of their percentage detected above LOD. Note that the conversion in deciles for chemical exposures and the growing many decision trees through bootstraps protect against outlying and influential observations. As sensitivity analyses, (1) we repeated the WQS-SiRF algorithm with data partitioned in 75% for training and 25% for testing without chemicals whose % of detection above LOD was less than 60%, (2) we gradually increased the number of bootstraps, from 250, 500, to 1000, (3) we used the whole dataset to test the model trained on the 75% data, and (4) repeated SiRF to obtain interactions observed in the primary analysis after randomly permuting the ASD status. For descriptive analysis, we calculated the Pearson correlation matrices of log-transformed and specific gravity-corrected 62 chemicals exposures for ASD and TD children. Missing data in covariates was minimal (< 5%) and were imputed using the R package “mice” 68. A two-tailed p-value less than alpha at 0.05 is considered statistically significant. All data were analyzed in R version 4.1.2. The detailed mathematical exposition of the algorithm can be found in48. In addition, the tuning parameters in WQS-SiRF and random seeds for training and testing data are provided in the supplemental materials. ## Results The list of all 62 chemicals was presented in Supplemental Table S1, and their LODs (and % detected above LOD) were presented in Supplemental Table S2. Supplemental Table S3 presents the log-transformed and specific-gravity-corrected urinary concentrations of all 62 chemicals for ASD and TD children. Among 62 chemicals, 42 had more than 60% detection rate above LOD (Supplemental Table S2). The specific gravity-adjusted concentration levels and the correlation matrices of the chemicals were presented in Figure 2. ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/02/04/2023.02.02.23285222/F3.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2023/02/04/2023.02.02.23285222/F3) Figure 2: Specific gravity-adjusted and log-transformed (base 2) mean concentration and correlation plot of urinary chemicals There were moderate to strong (0.3 to 0.7) within-group correlations among pesticides and phenols. The distributions of the child’s sex and race/ethnicity were not significantly different between ASD and TD children (Table 1). Further, there was no significant difference in parental homeowner status. However, children with ASD were more likely to be older at their age of assessment, and their mothers were more likely to have any hypertensive disorder or diabetes for any BMI category. The chemical concentrations of Methyl Paraben, Diethyl-phosphate, and Propylparaben (the top three chemicals based on weights from WQS) were significantly higher in children with ASD. View this table: [Table 1:](http://medrxiv.org/content/early/2023/02/04/2023.02.02.23285222/T1) Table 1: Characteristics of mothers and children included in the analysis from the CHARGE cohort. ### WQS- SiRF result In the WQS model (with binary outcome ASD vs. TD and without any interaction term), the mixture index was significantly associated with higher odds of ASD (OR[95% CI]: 1.58[1.32, 1.88]). There were 20 chemicals with higher than chance contribution (weight > 1/62) to the overall mixture effect. The top five chemicals were Methyl Paraben, Diethyl-phosphate, Propylparaben, trace-metal Uranium, and Bisphenol F. The estimated weights (and the corresponding 95% CIs) were presented in Figure 3. ![Figure 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/02/04/2023.02.02.23285222/F4.medium.gif) [Figure 3:](http://medrxiv.org/content/early/2023/02/04/2023.02.02.23285222/F4) Figure 3: Results from WQS chemical mixture regression. (A) the overall OR between the WQS chemical mixture with ASD vs. TD status and (B) the corresponding weights contributed to the overall OR (note that the weights sum up to 1). WQS-SiRF searched for interactions of multiple orders (>= 2) but eventually found two synergistic two-ordered interactions with more than 75% stability. The interactions were 1) urinary trace element Cadmium (Cd) and alkyl-phosphate pesticide – Diethyl-phosphate (DEP), denoted by Cd/DEP; and 2) environmental Phenol 2,4,6-Trichlorophenol (TCP-246) and DEP, denoted by TCP-246/DEP. However, both interactions were only observed in a subset of the sample whose urinary chemical concentrations of Cd, DEP, and TCP-246 were higher than certain thresholds. Therefore, based on a 75th percentile threshold cutoff, we created two separate interaction indicators to test these discovered interactions for association analysis. For example, if both the specific gravity-adjusted concentrations of Cd and DEP were more than the 75th percentile, then the interaction indicator Cd/DEP would be non-zero; else, it would be zero. In the sample, the estimated prevalence of these interactions was 5% and 8.4% for Cd/DEP and TCP-246/DEP, respectively. The results of SiRF from all three different data partitions were presented in Supplemental Table S4. In two separate adjusted models (after controlling for the main WQS chemical mixture and covariates), each interaction indicator was associated with increased odds of ASD, 2.60[0.90, 7.50] and 1.14[0.55, 2.38] for Cd/DEP and TCP-246+/DEP respectively. Find all the ORs and corresponding CIs in the forest plot in Figure 4. Among the two interactions, Cd/DEP had the strongest association, and in all the models, the WQS chemical mixture remained statistically significant, with just a slight change in the ORs. ![Figure 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/02/04/2023.02.02.23285222/F5.medium.gif) [Figure 4:](http://medrxiv.org/content/early/2023/02/04/2023.02.02.23285222/F5) Figure 4: Results from nested linear models with WQS and discovered interaction indicators (cutoff set at 75th percentile) and WQS chemical mixture. In the top three models, the WQS chemical mixture and the two interaction indicators were used in separate models. Both interaction indicators were adjusted with the WQS chemical mixture in the following two models. In the last model, interaction indicators and the WQS chemical mixture were put in the same model. All models were adjusted for covariates. In the sensitivity analyses, (1) the interactions Cd/DEP and TCP-246/DEP were replicated when the WQS-SiRF algorithm was re-fitted without chemicals whose % of detection above LOD was less than 60% (Supplemental Table S5). (2) Furthermore, the gradual increase in the number of bootstraps, from 250, 500, to 1000, did not alter the results. Both the discovered interactions remained unaltered when the whole dataset was used to test the model trained on 75% data (Supplemental Table S6). Moreover, (3) the directionality of the ORs did not alter even when the interaction threshold of the 75th percentile was changed to the 67th percentile (Supplemental Figure S1), and (4) the interactions Cd/DEP and TCP-246/DEP were not found in the permutation tests. ## Discussion We leveraged data from the CHARGE study to assess the synergistic interactions between environmental chemicals, pesticides, phthalates, phenols, and trace elements and ASD. Utilizing WQS-SiRF, we found two suggestive synergistic interactions associated with increased odds of ASD diagnosis between (1) Cd and DEP and (2) 2,4,6-trichlorophenol and DEP among children with the urinary concentration of interacting chemicals over certain thresholds. When controlled for the main WQS mixture and the necessary covariates, cadmium/DEP and TCP-246/DEP were associated with increased odds of ASD, respectively. Between the two interactions, cadmium/DEP had the strongest association and was previously shown to form chemical complexes. The identified interactions could be experimentally testable and, therefore, biologically meaningful. This paper is a continuation of the study of the main effects by Bennet et al.13, which concluded that many urinary chemicals were associated with increased odds of ASD at 2-5 years of age. The present study adds value by examining multi-ordered synergistic interactions between exposures to pesticides, phthalates, phenols, and trace elements and ASD and providing evidence for suggestive synergistic interactions observed between Cd/DEP and TCP-246/DEP. There are few studies on interactions associated with ASD, including gene-environment70, social71, and chemical4 factors. Moreover, there is a lack of studies demonstrating chemical-chemical interactions in this context. Previous studies have shown an association between heavy metals, like cadmium, and ASD72, 73. Kern et al. discovered that cadmium and other trace elements were significantly lower in the hair of children with autism than others74. This supports the concept that children with autism may have issues excreting cadmium, resulting in a higher body burden that could contribute to symptoms of autism74, 75. Children could be exposed to cadmium through inhalation and ingestion. It is commonly found in the food chain, soil, cigarette smoke, and manufactured products73. Research on pesticide exposure during childhood, specifically glyphosate76, 77, chlorpyrifos77, diazinon77, and the development of ASD continues to emerge78-80. Potential routes of pesticide exposure in children include food contaminated with pesticides (ingestion), in utero or through breastmilk, and household exposures via dermal contact81, 82. However, there is a lack of studies showing any associations between the interaction of DEP and TCP-246 with ASD. Regarding possible biochemical significance, the cation, Cd2+ forms a complex with phosphate ester, particularly with DEP (C4H10O4P-), forming cadmium diethyl phosphate, C4H10CdO4P-83, 84. Although for the TCP-246+/DEP+ interaction, many details are not known, a chemical complex “2,4,6-trichlorophenyl dialkyl phosphate” was patented (in 1952) for use as parasiticides and control of agricultural and household pests through aqueous suspensions employed as sprays85. However, the activities of both chemical complexes in biological media are not known in detail. We acknowledge the study’s limitations. (1) The urine samples were collected post-diagnosis, i.e., months and sometimes years after their symptoms emerged. In rare cases, urine samples were collected at the diagnosis. Therefore we cannot rule out reverse causation, with the associations reflecting lifestyle patterns. In addition, urinary measurements of the organic compounds assessed in this study represent recent exposures due to their half-lives. Ideally, the urine samples should have been assessed with repeated samples collected at various time points13, 86-88. As a result, we are uncertain whether these chemical interactions directly contribute to ASD diagnosis. (2) Because of the limited sample size, we did not study potential sex-specific associations with ASD diagnosis, although sexually dimorphic effects are well documented.2 (3) Additionally, we used the same confounders used in the original analysis by Bennet et al.13. However, these confounders were selected based on them being confounders to MEPB because it has one of the strongest associations in the unadjusted model. (4) Similar to large case-control studies, residual confounding is possible. However, our results remained unaltered after adjusting for multiple confounders and covariates, negating residual confounding as the sole explanation. (5) The choice of cutoffs at the 75th or 67th percentile is ad-hoc and sample-specific and therefore needs to be replicated in a separate independent study population. Further, using random intersection trees within the SiRF algorithm makes it difficult to extract the absolute threshold cutoffs directly. Therefore given these methodological challenges, there lies a strong potential for further developments attuned to specific problems. (6) In the present analysis, the same chemicals were used in the WQS and then again in the SiRF, raising the possibility of over-fitting. A training, testing, and validation data split in an ideal large sample scenario would potentially guard against overfitting. However, in this moderate sample-sized study, the use of random subsets and repeated holdouts in training and testing samples of WQS and the drawing of a large number of bootstrapped samples with different training and testing splits in the SiRF could potentially induce a robust guard against overfitting. However, our study has several strengths. First, CHARGE is a well-established case-control study with extensive demographic and covariate data. Further, it allowed us to assess a wide range of environmental chemical exposures in children 2-5 years of age, along with available data on ASD with a moderate sample size. Second, this is the first study to combine exposure mixture methods and machine learning tools to discover interactions that mimic classical threshold-based toxicological dose-response interactions, providing a meaningful way to extract plausible mechanistic insight. Third, these toxicologically mimicking interactions are only present in a subset of the sample, therefore, can be thought of as “personalized and precision” interactions. Fourth, WQS-SiRF can efficiently search for high-order interactions; therefore, the intended order does not need to be specified beforehand. Fifth, in terms of practical implementation, the WQS-SiRF algorithm is relatively fast and user-friendly, with both having robust R packages. In conclusion, we introduced a novel way of discovering threshold-based interactions. To the best of our knowledge, this is the first paper that combines the inferential power of WQS and the predictive accuracy of a machine-learning algorithm to discover threshold-based personalized biologically suggestive interactions among urinary biomarkers associated with higher odds of ASD. ## Supporting information Supplementary Information [[supplements/285222_file02.docx]](pending:yes) ## Data Availability All data produced are available online at Human Health Exposure Analysis Resource (HHEAR) Data Center at the Icahn School of Medicine at Mount Sinai [https://hheardatacenter.mssm.edu/](https://hheardatacenter.mssm.edu/) ## Acknowledgment We want to thank the Human Health Exposure Analysis Resource (HHEAR) Data Center at the Icahn School of Medicine at Mount Sinai for the availability of open-source data and the CHARGE study participants and researchers for making this work possible. * Received February 2, 2023. * Revision received February 2, 2023. * Accepted February 4, 2023. * © 2023, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## References 1. 1.Association AP. DSM 5: American Psychiatric Association; 2013. 2. 2.Maenner MJ, Shaw KA, Bakian AV, Bilder DA, Durkin MS, Esler A, Furnier SM, Hallas L, Hall-Lande J and Hudson A. Prevalence and characteristics of autism spectrum disorder among children aged 8 years—autism and developmental disabilities monitoring network, 11 sites, United States, 2018. MMWR Surveillance Summaries. 2021;70:1. 3. 3.Maenner MJ, Shaw KA, Baio J, Washington A, Patrick M, DiRienzo M, Christensen DL, Wiggins LD, Pettygrove S and Andrews JG. Prevalence of autism spectrum disorder among children aged 8 years—autism and developmental disabilities monitoring network, 11 sites, United States, 2016. MMWR Surveillance summaries. 2020;69:1. 4. 4.Kalkbrenner AE, Schmidt RJ and Penlesky AC. Environmental chemical exposures and autism spectrum disorders: a review of the epidemiological evidence. Current problems in pediatric and adolescent health care. 2014;44:277-318. 5. 5.Lam J, Sutton P, Kalkbrenner A, Windham G, Halladay A, Koustas E, Lawler C, Davidson L, Daniels N and Newschaffer C. A systematic review and meta-analysis of multiple airborne pollutants and autism spectrum disorder. Plos One. 2016;11:e0161851. 6. 6.Gong T, Dalman C, Wicks S, Dal H, Magnusson C, Lundholm C, Almqvist C and Pershagen G. Perinatal exposure to traffic-related air pollution and autism spectrum disorders. Environ Health Persp. 2017;125:119–126. 7. 7.Raz R, Levine H, Pinto O, Broday DM and Weisskopf MG. Traffic-related air pollution and autism spectrum disorder: a population-based nested case-control study in Israel. American journal of epidemiology. 2018;187:717–725. 8. 8.Ritz B, Liew Z, Yan Q, Cuia X, Virk J, Ketzel M and Raaschou-Nielsen O. Air pollution and autism in Denmark. Environmental Epidemiology (Philadelphia, Pa). 2018;2:e028. 9. 9.Pagalan L, Bickford C, Weikum W, Lanphear B, Brauer M, Lanphear N, Hanley GE, Oberlander TF and Winters M. Association of prenatal exposure to air pollution with autism spectrum disorder. JAMA pediatrics. 2019;173:86–92. 10. 10.Rossignol DA, Genuis SJ and Frye RE. Environmental toxicants and autism spectrum disorders: a systematic review. Translational psychiatry. 2014;4:e360–e360. 11. 11.Grandjean P and Landrigan PJ. Developmental neurotoxicity of industrial chemicals. The Lancet. 2006;368:2167–2178. 12. 12.Wang M, Hossain F, Sulaiman R and Ren X. Exposure to inorganic arsenic and lead and autism spectrum disorder in children: a systematic review and meta-analysis. Chemical research in toxicology. 2019;32:1904–1919. 13. 13.Bennett DH, Busgang SA, Kannan K, Parsons PJ, Takazawa M, Palmer CD, Schmidt RJ, Doucette JT, Schweitzer JB, Gennings C and Hertz-Picciotto I. Environmental exposures to pesticides, phthalates, phenols and trace elements are associated with neurodevelopment in the CHARGE study. Environ Int. 2022;161:107075. 14. 14.Harley KG, Gunier RB, Kogut K, Johnson C, Bradman A, Calafat AM and Eskenazi B. Prenatal and early childhood bisphenol A concentrations and behavior in school-aged children. Environ Res. 2013;126:43–50. [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000326135200006&link_type=ISI) 15. 15.Braun JM. Early-life exposure to EDCs: role in childhood obesity and neurodevelopment. Nat Rev Endocrinol. 2017;13:161–173. 16. 16.Philippat C, Nakiwala D, Calafat AM, Botton J, De Agostini M, Heude B and Slama R. Prenatal Exposure to Nonpersistent Endocrine Disruptors and Behavior in Boys at 3 and 5 Years. Environ Health Perspect. 2017;125:097014. 17. 17.Landrigan PJ. What causes autism? Exploring the environmental contribution. Current opinion in pediatrics. 2010;22:219–225. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/MOP.0b013e328336eb9a&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20087185&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) 18. 18.Engel SM and Daniels JL. On the complex relationship between genes and environment in the etiology of autism. Epidemiology. 2011;22:486–488. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/EDE.0b013e31821daf1c&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21642774&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) 19. 19.Curtin P, Austin C, Curtin A, Gennings C, Arora M, Tammimies K, Willfors C, Berggren S, Siper P, Rai D, Meyering K, Kolevzon A, Mollon J, David AS, Lewis G, Zammit S, Heilbrun L, Palmer RF, Wright RO, Bolte S and Reichenberg A. Dynamical features in fetal and postnatal zinc-copper metabolic cycles predict the emergence of autism spectrum disorder. Sci Adv. 2018;4:eaat1293. [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo4OiJhZHZhbmNlcyI7czo1OiJyZXNpZCI7czoxMjoiNC81L2VhYXQxMjkzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDIvMDQvMjAyMy4wMi4wMi4yMzI4NTIyMi5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 20. 20.Austin C, Curtin P, Arora M, Reichenberg A, Curtin A, Iwai-Shimada M, Wright RO, Wright RJ, Remnelius KL, Isaksson J, Bölte S and Nakayama SF. Elemental Dynamics in Hair Accurately Predict Future Autism Spectrum Disorder Diagnosis: An International Multi-Center Study. Journal of Clinical Medicine. 2022;11:7154. 21. 21.Gennings C. ON TESTING FOR DRUG/CHEMICAL INTERACTIONS: DEFINITIONS AND INFERENCE. Journal of Biopharmaceutical Statistics. 2000;10:457–467. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1081/BIP-100101978&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11104387&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) 22. 22.Lee M, Rahbar MH, Samms-Vaughan M, Bressler J, Bach MA, Hessabi M, Grove ML, Shakespeare-Pellington S, Coore Desai C, Reece J-A, Loveland KA and Boerwinkle E. A generalized weighted quantile sum approach for analyzing correlated data in the presence of interactions. Biometrical Journal. 2019;61:934–954. 23. 23.Rahbar MH, Samms-Vaughan M, Kim S, Saroukhani S, Bressler J, Hessabi M, Grove ML, Shakspeare-Pellington S and Loveland KA. Detoxification Role of Metabolic Glutathione S-Transferase (GST) Genes in Blood Lead Concentrations of Jamaican Children with and without Autism Spectrum Disorder. Genes. 2022;13:975. 24. 24.Colicino E, Pedretti NF, Busgang SA and Gennings C. Per- and poly-fluoroalkyl substances and bone mineral density: Results from the Bayesian weighted quantile sum regression. Environ Epidemiol. 2020;4:e092. 25. 25.Kowal DR, Bravo M, Leong H, Bui A, Griffin RJ, Ensor KB and Miranda ML. Bayesian variable selection for understanding mixtures in environmental exposures. Statistics in Medicine. 2021;40:4850–4871. 26. 26.Carrico C, Gennings C, Wheeler DC and Factor-Litvak P. Characterization of Weighted Quantile Sum Regression for Highly Correlated Data in a Risk Analysis Setting. J Agric Biol Environ Stat. 2015;20:100–120. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) 27. 27.Keil AP, Buckley JP, O’Brien KM, Ferguson KK, Zhao S and White AJ. A quantile-based g-computation approach to addressing the effects of exposure mixtures. Environmental health perspectives. 2020;128:047004. 28. 28.Bobb JF, Valeri L, Claus Henn B, Christiani DC, Wright RO, Mazumdar M, Godleski JJ and Coull BA. Bayesian kernel machine regression for estimating the health effects of multi-pollutant mixtures. Biostatistics. 2015;16:493–508. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/biostatistics/kxu058&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25532525&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) 29. 29.Bellavia A, Dickerson AS, Rotem RS, Hansen J, Gredal O and Weisskopf MG. Joint and interactive effects between health comorbidities and environmental exposures in predicting amyotrophic lateral sclerosis. International Journal of Hygiene and Environmental Health. 2021;231:113655. 30. 30.Antonelli J, Mazumdar M, Bellinger D, Christiani D, Wright R and Coull B. Estimating the health effects of environmental mixtures using Bayesian semiparametric regression and sparsity inducing priors. The Annals of Applied Statistics. 2020;14:257-275, 19. 31. 31.McGee G, Wilson A, Webster TF and Coull BA. Bayesian multiple index models for environmental mixtures. Biometrics. 2021;n/a. 32. 32.Liu JZ, Deng W, Lee J, Lin P-iD, Valeri L, Christiani DC, Bellinger DC, Wright RO, Mazumdar MM and Coull BA. A Cross-Validated Ensemble Approach to Robust Hypothesis Testing of Continuous Nonlinear Interactions: Application to Nutrition-Environment Studies. Journal of the American Statistical Association. 2022;117:561–573. 33. 33.Ferrari F and Dunson DB. Bayesian Factor Analysis for Inference on Interactions. Journal of the American Statistical Association. 2021;116:1521–1532. 34. 34.Karl Kumbier SB, James B. Brown, Susan Celniker, Bin Yu. Refining interaction search through signed iterative Random Forests. 2018. 35. 35.Hamm AK, Hans Carter Jr W and Gennings C. Analysis of an interaction threshold in a mixture of drugs and/or chemicals. Statistics in Medicine. 2005;24:2493–2507. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15889451&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) 36. 36.Yeatts SD, Gennings C, Wagner ED, Simmons JE and Plewa MJ. Detecting Departure From Additivity Along a Fixed-Ratio Mixture Ray With a Piecewise Model for Dose and Interaction Thresholds. J Agric Biol Environ Stat. 2010;15:510–522. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21359103&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) 37. 37.Gennings C, Schwartz P, Carter WH and Simmons JE. Detection of Departures from Additivity in Mixtures of Many Chemicals with a Threshold Model. Journal of Agricultural, Biological, and Environmental Statistics. 1997;2:198–211. 38. 38.Gibson EA. Statistical and Machine Learning Methods for Pattern Identification in Environmental Mixtures. 2021:182. 39. 39.Joubert BR, Kioumourtzoglou M-A, Chamberlain T, Chen HY, Gennings C, Turyk ME, Miranda ML, Webster TF, Ensor KB, Dunson DB and Coull BA. Powering Research through Innovative Methods for Mixtures in Epidemiology (PRIME) Program: Novel and Expanded Statistical Methods. International Journal of Environmental Research and Public Health. 2022;19:1378. 40. 40.Colicino E, Ferrari F, Cowell W, Niedzwiecki MM, Foppa Pedretti N, Joshi A, Wright RO and Wright RJ. Non-linear and non-additive associations between the pregnancy metabolome and birthweight. Environment International. 2021;156:106750. 41. 41.Galit S. To Explain or to Predict? Statistical Science. 2010;25:289–310. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1214/10-STS330&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000286550700002&link_type=ISI) 42. 42.Lampa E, Lind L, Lind PM and Bornefalk-Hermansson A. The identification of complex interactions in epidemiology and toxicology: a simulation study of boosted regression trees. Environmental Health. 2014;13:57. 43. 43.Stingone JA, Pandey OP, Claudio L and Pandey G. Using machine learning to identify air pollution exposure profiles associated with early cognitive skills among U.S. children. Environmental Pollution. 2017;230:730–740. 44. 44.Gass K, Klein M, Chang HH, Flanders WD and Strickland MJ. Classification and regression trees for epidemiologic research: an air pollution example. Environmental Health. 2014;13:17. 45. 45.Ouidir M, Lepeule J, Siroux V, Malherbe L, Meleux F, Rivière E, Launay L, Zaros C, Cheminat M, Charles M-A and Slama R. Is atmospheric pollution exposure during pregnancy associated with individual and contextual characteristics? A nationwide study in France. Journal of Epidemiology and Community Health. 2017;71:1026. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiamVjaCI7czo1OiJyZXNpZCI7czoxMDoiNzEvMTAvMTAyNiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIzLzAyLzA0LzIwMjMuMDIuMDIuMjMyODUyMjIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 46. 46.Li Y-C, Hsu H-HL, Chun Y, Chiu P-H, Arditi Z, Claudio L, Pandey G and Bunyavanich S. Machine learning–driven identification of early-life air toxic combinations associated with childhood asthma outcomes. The Journal of Clinical Investigation. 2021;131. 47. 47.Shah RD and Meinshausen N. Random intersection trees. The Journal of Machine Learning Research. 2014;15:629–654. 48. 48.Basu S, Kumbier K, Brown JB and Yu B. Iterative random forests to discover predictive and stable high-order interactions. Proceedings of the National Academy of Sciences. 2018;115:1943–1948. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMDoiMTE1LzgvMTk0MyI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIzLzAyLzA0LzIwMjMuMDIuMDIuMjMyODUyMjIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 49. 49.Hertz-Picciotto I, Croen LA, Hansen R, Jones CR, Van de Water J and Pessah IN. The CHARGE study: an epidemiologic investigation of genetic and environmental factors contributing to autism. Environ Health Persp. 2006;114:1119–1125. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1289/ehp.8483&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16835068&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000239035100046&link_type=ISI) 50. 50.Asimakopoulos AG, Thomaidis NS and Kannan K. Widespread occurrence of bisphenol A diglycidyl ethers, p-hydroxybenzoic acid esters (parabens), benzophenone type-UV filters, triclosan, and triclocarban in human urine from Athens, Greece. Sci Total Environ. 2014;470-471:1243–9. 51. 51.Li AJ, Xue J, Lin S, Al-Malki AL, Al-Ghamdi MA, Kumosani TA and Kannan K. Urinary concentrations of environmental phenols and their association with type 2 diabetes in a population in Jeddah, Saudi Arabia. Environ Res. 2018;166:544–552. 52. 52.Rocha BA, Asimakopoulos AG, Honda M, da Costa NL, Barbosa RM, Barbosa F, Jr.. and Kannan K. Advanced data mining approaches in the assessment of urinary concentrations of bisphenols, chlorophenols, parabens and benzophenones in Brazilian children and their association to DNA damage. Environ Int. 2018;116:269–277. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.envint.2018.04.023&link_type=DOI) 53. 53.Li AJ, Martinez-Moral M-P, Al-Malki AL, Al-Ghamdi MA, Al-Bazi MM, Kumosani TA and Kannan K. Mediation analysis for the relationship between urinary phthalate metabolites and type 2 diabetes via oxidative stress in a population in Jeddah, Saudi Arabia. Environ Int. 2019;126:153–161. 54. 54.Rocha BA, Asimakopoulos AG, Barbosa Jr F and Kannan K. Urinary concentrations of 25 phthalate metabolites in Brazilian children and their association with oxidative DNA damage. Sci Total Environ. 2017;586:152–162. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.scitotenv.2017.01.193&link_type=DOI) 55. 55.Li AJ, Banjabi AA, Takazawa M, Kumosani TA, Yousef JM and Kannan K. Serum concentrations of pesticides including organophosphates, pyrethroids and neonicotinoids in a population with osteoarthritis in Saudi Arabia. Sci Total Environ. 2020;737:139706. 56. 56.Minnich MG, Miller DC and Parsons PJ. Determination of As, Cd, Pb, and Hg in urine using inductively coupled plasma mass spectrometry with the direct injection high efficiency nebulizer. Spectrochimica Acta Part B: Atomic Spectroscopy. 2008;63:389–395. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.sab.2007.11.033&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000254368400005&link_type=ISI) 57. 57.Hauser R, Meeker JD, Park S, Silva MJ and Calafat AM. Temporal variability of urinary phthalate metabolite levels in men of reproductive age. Environmental health perspectives. 2004;112:1734–1740. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1289/ehp.7212&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15579421&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000225781300052&link_type=ISI) 58. 58.Le Couteur A, Lord C and Rutter M. The autism diagnostic interview-revised (ADI-R). Los Angeles, CA: Western Psychological Services. 2003:659-85. 59. 59.Lord C, Rutter M and Le Couteur A. Autism Diagnostic Interview-Revised: a revised version of a diagnostic interview for caregivers of individuals with possible pervasive developmental disorders. Journal of autism and developmental disorders. 1994;24:659–685. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/BF02172145&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=7814313&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1994PM02100009&link_type=ISI) 60. 60.Lord C, Pickles A, McLennan J, Rutter M, Bregman J, Folstein S, Fombonne E, Leboyer M and Minshew N. Diagnosing autism: analyses of data from the Autism Diagnostic Interview. Journal of autism and developmental disorders. 1997;27:501–517. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1023/A:1025873925661&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9403369&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1997YJ08100002&link_type=ISI) 61. 61.Lord C, Risi S, Lambrecht L, Cook EH, Leventhal BL, DiLavore PC, Pickles A and Rutter M. The Autism Diagnostic Observation Schedule—Generic: A standard measure of social and communication deficits associated with the spectrum of autism. Journal of autism and developmental disorders. 2000;30:205–223. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1023/A:1005592401947&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11055457&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000089674100004&link_type=ISI) 62. 62.Risi S, Lord C, Gotham K, Corsello C, Chrysler C, Szatmari P, Cook Jr EH, Leventhal BL and Pickles A. Combining information from multiple sources in the diagnosis of autism spectrum disorders. Journal of the American Academy of Child & Adolescent Psychiatry. 2006;45:1094–1103. 63. 63.Rutter M, Bailey A and Lord C. The social communication questionnaire: Manual: Western Psychological Services; 2003. 64. 64.Tanner EM, Bornehag C-G and Gennings C. Repeated holdout validation for weighted quantile sum regression. MethodsX. 2019;6:2855–2860. 65. 65.Curtin P, Kellogg J, Cech N and Gennings C. A random subset implementation of weighted quantile sum (WQSRS) regression for analysis of high-dimensional mixtures. Communications in Statistics - Simulation and Computation. 2021;50:1119–1134. 66. 66.Agresti A. Categorical data analysis: John Wiley & Sons; 2003. 67. 67.Breiman L. Random forests. Machine learning. 2001;45:5–32. 68. 68.van Buuren S and Groothuis-Oudshoorn K. mice: Multivariate Imputation by Chained Equations in R. Journal of Statistical Software. 2011;45:1–67. 69. 69.Krakowiak P, Walker CK, Bremer AA, Baker AS, Ozonoff S, Hansen RL and Hertz-Picciotto I. Maternal Metabolic Conditions and Risk for Autism and Other Neurodevelopmental Disorders. Pediatrics. 2012;129:e1121–e1128. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1542/peds.2011-2583&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22492772&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000304709600002&link_type=ISI) 70. 70.Tordjman S, Somogyi E, Coulon N, Kermarrec S, Cohen D, Bronsard G, Bonnot O, Weismann-Arcache C, Botbol M and Lauth B. Gene× Environment interactions in autism spectrum disorders: role of epigenetic mechanisms. Frontiers in psychiatry. 2014;5:53. 71. 71.Assaf M, Hyatt CJ, Wong CG, Johnson MR, Schultz RT, Hendler T and Pearlson GD. Mentalizing and motivation neural function during social interactions in autism spectrum disorders. Neuroimage Clin. 2013;3:321–31. 72. 72.Shiani A, Sharafi K, Omer AK, Kiani A, Karamimatin B, Massahi T and Ebrahimzadeh G. A systematic literature review on the association between exposures to toxic elements and an autism spectrum disorder. Sci Total Environ. 2023;857:159246. 73. 73.Sulaiman R, Wang M and Ren X. Exposure to Aluminum, Cadmium, and Mercury and Autism Spectrum Disorder in Children: A Systematic Review and Meta-Analysis. Chem Res Toxicol. 2020;33:2699–2718. 74. 74.Kern JK, Grannemann BD, Trivedi MH and Adams JB. Sulfhydryl-reactive metals in autism. J Toxicol Environ Health A. 2007;70:715–21. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/15287390601188060&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17365626&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) 75. 75.Yorbik O, Kurt I, Haşimi A and Oztürk O. Chromium, cadmium, and lead levels in urine of children with autism and typically developing controls. Biol Trace Elem Res. 2010;135:10–5. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s12011-009-8494-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19688188&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) 76. 76.Ongono JS, Béranger R, Baghdadli A and Mortamais M. Pesticides used in Europe and autism spectrum disorder risk: can novel exposure hypotheses be formulated beyond organophosphates, organochlorines, pyrethroids and carbamates? - A systematic review. Environ Res. 2020;187:109646. 77. 77.von Ehrenstein OS, Ling C, Cui X, Cockburn M, Park AS, Yu F, Wu J and Ritz B. Prenatal and infant exposure to ambient pesticides and autism spectrum disorder in children: population based case-control study. Bmj. 2019;364. 78. 78.Shelton JF, Hertz-Picciotto I and Pessah IN. Tipping the balance of autism risk: potential mechanisms linking pesticides and autism. Environ Health Persp. 2012;120:944–951. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1289/ehp.1104553&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22534084&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000306035300017&link_type=ISI) 79. 79.Miani A, Imbriani G, De Filippis G, De Giorgi D, Peccarisi L, Colangelo M, Pulimeno M, Castellone MD, Nicolardi G, Logroscino G and Piscitelli P. Autism Spectrum Disorder and Prenatal or Early Life Exposure to Pesticides: A Short Review. Int J Environ Res Public Health. 2021;18. 80. 80.Biosca-Brull J, Pérez-Fernández C, Mora S, Carrillo B, Pinos H, Conejo NM, Collado P, Arias JL, Martín-Sánchez F, Sánchez-Santed F and Colomina MT. Relationship between Autism Spectrum Disorder and Pesticides: A Systematic Review of Human and Preclinical Models. Int J Environ Res Public Health. 2021;18. 81. 81.Liu J and Schelar E. Pesticide exposure and child neurodevelopment: summary and implications. Workplace Health Saf. 2012;60:235–42; quiz 243. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22587699&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) 82. 82.Chalupka S and Chalupka AN. The impact of environmental and occupational exposures on reproductive health. J Obstet Gynecol Neonatal Nurs. 2010;39:84–102. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1552-6909.2009.01091.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20409107&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) 83. 83.Information NCfB. PubChem Compound Summary for CID 129652268, Cadmium diethylphosphate. 2022;2022 84. 84.Miner VW, Prestegard JH and Faller JW. Cadmium diethyl phosphate: structure determination and comparison to cation phospholipid complexes. Inorganic Chemistry. 1983;22:1862–1865. 85. 85.Drake LR and Erbel AJ. 2,4,6-trichlorophenyl dialkylphosphates. [https://patentsgooglecom/patent/US2599375A/en#citedBy.US2599375A](https://patentsgooglecom/patent/US2599375A/en#citedBy.US2599375A). 86. 86.Hoppin JA, Brock JW, Davis BJ and Baird DD. Reproducibility of urinary phthalate metabolites in first morning urine samples. Environ Health Perspect. 2002;110:515–8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1289/ehp.02110515&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12003755&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000175626400027&link_type=ISI) 87. 87.Barr DB, Wang RY and Needham LL. Biologic monitoring of exposure to environmental chemicals throughout the life stages: requirements and issues for consideration for the National Children’s Study. Environ Health Perspect. 2005;113:1083–91. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1289/ehp.7617&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16079083&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000230941100050&link_type=ISI) 88. 88.Perrier F, Giorgis-Allemand L, Slama R and Philippat C. Within-subject Pooling of Biological Samples to Reduce Exposure Misclassification in Biomarker-based Studies. Epidemiology. 2016;27:378–88. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/EDE.0000000000000460&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27035688&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F04%2F2023.02.02.23285222.atom)