PT - JOURNAL ARTICLE AU - Karamarie Fecho AU - Perry Haaland AU - Ashok Krishnamurthy AU - Bo Lan AU - Stephen A. Ramsey AU - Patrick L. Schmitt AU - Priya Sharma AU - Meghamala Sinha AU - Hao Xu TI - An Approach for Open Multivariate Analysis of Integrated Clinical and Environmental Exposures Data AID - 10.1101/2021.06.30.21259727 DP - 2021 Jan 01 TA - medRxiv PG - 2021.06.30.21259727 4099 - http://medrxiv.org/content/early/2021/07/05/2021.06.30.21259727.short 4100 - http://medrxiv.org/content/early/2021/07/05/2021.06.30.21259727.full AB - The Integrated Clinical and Environmental Exposures Service (ICEES) provides regulatory-compliant open access to sensitive patient data that have been integrated with public exposures data. ICEES was designed initially to support dynamic cohort creation and bivariate contingency tests. The objective of the present study was to develop an open approach to support multivariate analyses using existing ICEES functionalities and abiding by all regulatory constraints. We first developed an open approach for generating a multivariate table that maintains contingencies between clinical and environmental variables using programmatic calls to the open ICEES application programming interface. We then applied the approach to data on a large cohort (N = 22,365) of patients with asthma or related conditions and generated an eight-feature table. Due to regulatory constraints, data loss was incurred with the incorporation of each successive feature variable, from a starting sample size of N = 22,365 to a final sample size of N = 4,556 (20.5%), but data loss was < 10% until the addition of the final two feature variables. We then applied a generalized linear model to the subsequent dataset and focused on the impact of seven select feature variables on asthma exacerbations, defined as annual emergency department or inpatient visits for respiratory issues. We identified five feature variables—sex, race, obesity, prednisone, and airborne particulate exposure—as significant predictors of asthma exacerbations. We discuss the advantages and disadvantages of ICEES open multivariate analysis and conclude that, despite limitations, ICEES can provide a valuable resource for open multivariate analysis and can serve as an exemplar for regulatory-compliant informatics solutions to open patient data, with capabilities to explore the impact of environmental exposures on health outcomes.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis project was funded with awards from the National Center for Advancing Translational Sciences, National Institutes of Health [OT3TR002020, OT2TR003430, UL1TR002489, UL1TR002489-03S4] and the Clinical Research Branch, Intramural Research Program of the National Institute of Environmental Health Sciences, National Institutes of Health [ZID ES103354-01].Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:All study procedures have been approved by the Institutional Review Board at the University of North Carolina at Chapel Hill (protocol #16-2978)All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe ICEES API openly exposed clinical data that have been integrated with environmental exposures data