Adjusting for the progressive digitization of health records: working examples on a multi-hospital clinical data warehouse

Adam Remaki; Benoît Playe; Paul Bernard; Simon Vittoz; Matthieu Doutreligne; Gilles Chatelier; Etienne Audureau; Emmanuelle Kempf; Raphaël Porcher; Romain Bey

doi:10.1101/2023.08.17.23294220

Abstract

Objectives To propose a new method to account for time-dependent data missingness caused by the increasing digitization of health records in the analysis of large-scale clinical data.

Materials and Methods Following a data-driven approach we modeled the progressive adoption of a common electronic health record in 38 hospitals. To this end, we analyzed data collected between 2013 and 2022 and made available in the clinical data warehouse of the Greater Paris University Hospitals. Depending on the category of data, we worked either at the hospital, department or unit level. We evaluated the performance of this model with a retrospective cohort study. We measured the temporal variations of some quality and epidemiological indicators by successively applying two methods, either a naive analysis or a novel complete-source-only analysis that accounts for digitization-induced missingness.

Results Unrealistic temporal variations of quality and epidemiological indicators were observed when a naive analysis was performed, but this effect was either greatly reduced or disappeared when the complete-source-only method was applied.

Discussion We demonstrated that a data-driven approach can be used to account for missingness induced by the progressive digitization of health records. This work focused on hospitalization, emergency department and intensive care units records, along with diagnostic codes, discharge prescriptions and consultation reports. Other data categories may require specific modeling of their associated data sources.

Conclusions Electronic health records are constantly evolving and new methods should be developed to debias studies that use these unstable data sources.

1 Background and Significance

The ongoing digitization of healthcare is generating more and more routine data. These data contain rich and diverse information about patients, opening up new prospects for research, innovation, quality monitoring and public health surveillance.[1–4] However, their analysis also raises new methodological challenges that differ in many ways from those associated with data collected specifically for research purposes.[5–8] In particular, the rapid pace of digitization leads to frequent variations in data availability, format, and coverage, thereby hampering the conduct of multicenter longitudinal studies, as records collected in different contexts are not always comparable.[9–22] To address this pervasive problem, some studies have proposed tools to facilitate the detection of drifts or sudden changes in statistical distributions,[9, 23–26] while others have explored methods to homogenize datasets before their analysis, such as imputing missing data or dropping incomplete cases.[27–29] These seminal studies have paved the way for the analysis of routine data, but many issues remain to be addressed.

First, temporal variations in statistical distributions can be induced by technological changes, but also by other mechanisms. Disentangling technology-induced drifts from variations caused, for example, by reorganizations of care or changes in patients populations seems to be a prerequisite for answering many questions of interest.

Second, the technological changes that need to be accounted for are increasingly complex. For example, clinical data warehouses (CDWs) include more and more healthcare sites, and within those sites, each department may have its own characteristics. In addition, the data being analyzed are increasingly diverse, including diagnostic codes, clinical reports, laboratory tests, imaging, medications, etc., thus multiplying the number of dynamics to consider. Controlling these complex drifts of data distributions by mere visual inspection would therefore mobilize enormous resources, and there is a clear need for some degree of automation.[29]

2 Objective

In this study, we developed and evaluated a new method to automatically handle time-dependent missingness induced by the progressive digitization of hospital health records. We analyzed data collected in the Greater Paris University Hospitals (Assistance Publique-Hôpitaux de Paris, APHP). Among the many mechanisms that may induce missing data, we focused on the effect of the progressive adoption of functionalities used to collect administrative records, diagnostic codes, discharge prescriptions, and consultation reports. Using a data-driven approach, we modeled the pace of their adoption at different levels (hospitals, departments, units). We then evaluated the performance of this detailed model by measuring some quality and epidemiological indicators. We evaluated their stability over time using two different methods, either a naive approach that did not take into account the progressive digitization of health records, or a novel approach, complete-source-only analysis, that took advantage of the previous modeling of this mechanism. This work aimed to answer the following questions:

Can we automatically model the progressive digitization of health records using a data-only approach?
Can we use such a model to adjust for the time-dependent missingness induced by this mechanism when conducting archetypal observational studies?

3 Materials and Methods

The study was reviewed and approved by the institutional review board of the AP-HP (IRB00011591, decision CSE21-33). French regulations do not require written patient consent for this type of research. In accordance with the European General Data Protection Regulation, patients were informed and those who objected to the secondary use of their data for research were excluded from the study. This report follows the REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) reporting guideline in the Section F of the Supplementary Materials.[30]

3.1 Data source

AP-HP includes 38 hospitals spread across the Paris region (22 000 beds, 1.5 million hospitalizations per year). A common electronic health record (EHR) software, ORBIS Dedalus Healthcare, has been progressively adopted since 2012. Each of the six data categories considered in this study is collected using a different EHR functionality (i.e., administrative records related to hospitalizations, emergency departments -ED- or intensive care units -ICU-, diagnostic codes, discharge prescriptions, and consultation reports). The CDW follows the Observational Medical Outcomes Partnership-Common Data Model version 5.4 standard.[31] Data are integrated into the CDW on a daily basis, and this study was conducted on July 31^st, 2023.

3.2 Modeling of the EHR adoption

The adoption of AP-HP’s common EHR is done by functionality, and the collection of each category of data depends on the use of a particular functionality. The functionalities dedicated to hospitalization records are adopted at the hospital level, while other functionalities are typically adopted at the department or unit level (e.g., ED or ICU visits, diagnostic codes, or clinical reports). No curated knowledge base currently provides information on the effective use of each EHR functionality, so we took a data-driven approach to generate it. Extracting the dynamics of EHR adoption from data is not straightforward because of the interplay of different mechanisms: i) the technical introduction of EHR functionalities causes a sudden increase in data availability, but ii) their gradual adoption by clinicians smooths this effect, which is also sometimes blurred by iii) the copying of data collected in an earlier software into the new EHR. Such mechanisms influence the shape of data availability curves, which rarely follow an ideal step shape when plotted over time. However, compared to variations induced by other mechanisms such as changes in clinical practice, the introduction and adoption of new EHR functionalities usually induces abrupt variations in data availability that are more localized in time and within healthcare sites. Therefore, to automatically detect EHR adoption we computed c(t), an estimate of completeness per EHR functionality and per healthcare site (i.e., hospital, department, or unit) for month t, and we fitted step functions to these abruptly changing time series (see Section A.1 of the Supplementary Materials for details).

We adopted two definitions of c(t) depending on the EHR functionality: either the proportion of hospitalization records with at least one data point (to study the EHR functionalities used to collect diagnostic codes and discharge prescriptions) or, when such a denominator was not available, the monthly number of data points divided by its highest value measured during the study period (to study hospitalization records, emergency records, intra-hospitalization visits to ICUs and consultation reports). This modeling provided, for each healthcare site and each EHR functionality, t₀, the estimated date of adoption, and c₀, the stabilized average value of completeness after that date. In the case of visits to ICUs, we also evaluated an alternative modeling that consisted of fitting rectangular functions to the completeness estimates instead of step functions, thus additionally providing t₁, an extinction date corresponding to the end of data collection in a unit. This alternative modeling was motivated by the observation that data collected in a single ICU sometimes disappeared after a certain date due to hospital reorganization (see Figure S1 in the Supplement). Finally, to assess the goodness of fit of our models, we computed an error term defined by the mean squared error between c₀ and c(t) after t₀ (see Figure S2 in the Supplement).

3.3 Quality and epidemiological indicators

EHR data can be used to study retrospectively or to monitor prospectively quality or epidemiological indicators. However, time-dependent missingness can lead to biased estimates of these indicators. In this article, quality indicators were defined as the monthly proportion of hospitalizations for which some outcomes were observed, and epidemiological indicators were defined as the weekly number of hospitalizations related to some seasonal epidemics. Table 1 lists the indicators we considered and the categories of data we used to select each cohort and calculate the outcomes. Bronchiolotis-related and flu-related hospitalizations were selected using J21 and J09, J10, J11 International Classification of Diseases 10^th Revision (ICD-10) codes, respectively.

View this table:

Table 1: Quality indicators and epidemiological indicators.

3.4 Statistical analysis

Continuous variables were reported as medians with interquartile ranges (IQRs), and qualitative variables were reported as numbers and proportions (%). Quality and epidemiological indicators were calculated for the period from t_init, a variable start date, to May 2022, a fixed end date. We chose this end date because of some technical issues affecting the integration of clinical reports into the CDW after this date. The temporal variations of quality indicators were modeled by linear functions: with QI the quality indicator, t the month, α₀ and α₁ parameters characterizing respectively the origin and the linear trend, and ϵ(t) a random error. We estimated model coefficients and 95% confidence intervals (CIs) using ordinary least squares regression. We discussed the linear trend α₁ because it characterizes the temporal variation and could be affected by time-dependent data missingness. The temporal variation of the epidemiological indicators was discussed qualitatively, with particular emphasis on the post-COVID-19 period. Indeed, seasonal epidemics of bronchiolitis and flu were affected by COVID-19 and epidemiological indicators were used to adjust the response of healthcare organizations.

We performed these analyses using either a naive approach (N), which did not take into account the progressive digitalization of healthcare, or a novel approach, which we termed complete-source-only (CSO). To calculate the outcomes, we either used all available data (N) or restricted the analysis to healthcare sites (hospitals, departments, or units) where the EHR functionalities used to collect the required data were considered fully adopted before the start of the study period (CSO method, t₀ ≤ t_init for each healthcare site and each data category used to calculate the outcome, as mentioned in Table 1). For the sake of simplicity, quality indicators were computed using the same denominator for both methods, considering cohorts of hospitalizations that occurred in the 28 of the 38 hospitals where hospitalization records were collected since January 2013 (i.e., with t₀ for hospitalization records prior to that date).

We expected to observe indicators with increasing values when using the naive method, as the progressive adoption of EHR functionalities induces a temporally improving detection of outcomes. On the contrary, the CSO method aimed at stabilizing the data source used to detect outcomes to avoid this spurious temporal drift.

We conducted two sensitivity analyses to examine the quality indicators. First, we varied the value of t_init in [January 2013; January 2016; January 2019]. Second, we performed a subgroup analysis per hospital. Statistical analysis was performed using the Python package statmodels v0.13.5.[37] The modeling of EHR adoption was realized using the Python library EDS-TeVa v0.2.4 that has been made freely available.[38]

4 Results

4.1 Modeling of the EHR adoption

The APHP CDW contains data related to 14.5 million patients. The total amount of data available in the CDW increased over time, following dynamics that varied by data category and reflected the progressive digitization of healthcare (see Figure 1). While administrative records related to hospitalizations showed a stable collection for a decade, the other data categories showed a monotonic increase in collection, reflecting the progressive adoption of new EHR functionalities.

Figure 1:

Temporal evolution of the amount of data collected within the electronic health record (EHR), per functionality. To obtain a common scale, for each functionality the monthly number of data points was divided by the highest value measured during the study period.

As shown in Figure2, we modeled the adoption of these functionalities in each healthcare site by fitting step functions with a level of description appropriate to the adoption mechanism (i.e., hospital, department, or unit). The digitization of hospitalization, ED, and ICU records was abrupt, whereas the adoption of EHR functionalities to capture prescription reports or consultation reports was more gradual. While the data availability curves per hospital and per department were mostly step-shaped, when we looked at the smaller unit level, here looking at ICU records, we observed adoptions that were rectangular (see Figure S1).

Figure 2:

Completeness estimates (plain curves) of arbitrarily selected hospitals and departments considering from top to bottom hospitalization records, emergency department records, consultation reports, discharge prescriptions, and diagnostic codes. Department 1 and Department 2 are located within Hospital 1 and Hospital 2, respectively. Completeness estimate is defined either as the monthly number of data points divided by its maximum value during the study period (hospitalization and emergency department records, and consultation reports), or by the proportion of hospitalization records with at least one data point (diagnostic codes and prescription report). The modeling of the completeness estimate is shown as dashed curves.

4.2 Quality indicators

Figure 3 shows the temporal variations of the quality indicators using either the naive or the complete-source-only method, and choosing three different starting dates t_init. Both methods resulted in similar curves for 30-day rehospitalization, but discrepancies appeared for other indicators. For 30-day ED consultation, 30-day consultation, and discharge prescription, a monotonic increase was observed with the naive method, which disappeared when the CSO method was applied. However, the application of the CSO method reduced the absolute value of the indicators, since the data used to determine the outcomes were filtered to obtain temporal stability of the data sources. The amplitude of this reduction was smaller for more recent starting dates. ICU readmission also showed a monotonic increase when using the naive method, but the application of the Step function CSO method resulted in a monotonically decreasing indicator (Figure S3). The use of a rectangular function CSO, which also takes into account the disappearance of units, resulted in a more stable indicator, as expected.

Figure 3:

Combined effect of varying the initial observation date (t_init) and the analysis (naive in blue and complete-source-only in orange) on the longitudinal study of quality indicators. The linear modeling of temporal variations is indicated by dashed lines. Insets indicate the number of healthcare sites selected in the complete-source-only analysis, along with the proportion they represent of healthcare sites with at least one data point collected by the EHR functionality during the study: January 2013 - May 2022.

As shown in the sensitivity analysis (Figure 4), the stabilizing effect of the CSO method was still observed when the 28 hospitals used for the cohort selection were considered separately. This sensitivity analysis also showed a strong reduction in the amplitude of the indicators induced by the CSO method, consistent with the observations in Figure 3. For 30-day ICU readmission, the sensitivity analysis on the 28 hospitals showed a positive slope for the naive analysis for older baseline data, a negative slope for the Step function CSO method, and no slope for the rectangular-function CSO method (Figure S4). Similar to the other quality indicators, this sensitivity analysis showed that both CSO methods reduced the amplitude of the indicator assessment. All fitted parameters are available in Tables S1 to S6 in the Supplementary Materials.

Figure 4:

Left: slopes α₁ of the linear model (Eq 1) estimated either on all the hospitals (diamond) or considering separately each one of the hospitals (IQR is the box, min and max are the lower and upper whiskers respectively), adopting either the naive (blue) or the complete-source-only (orange) method. Three different initial dates were considered. Right: values of the indicators averaged over the study period.

4.3 Epidemiological indicators

Figure 5 shows the weekly number of bronchiolitis- and flu-related hospitalizations estimated using either all diagnostic codes collected in the EHR (N), or only those collected in departments where the EHR functionality was fully adopted since the beginning of observation (CSO). While the strong impact of the COVID-19 context on these epidemics was observed in both cases, it seemed difficult to interpret the curves when using the naive method, as the COVID-19 effect could not be disentangled from the effect of the progressive EHR adoption. In fact, as can be seen by considering the years before the COVID-19 outbreak, this mechanism caused a spurious increase in the amplitude of the seasonal epidemics.

Figure 5:

Epidemiological indicators computed adopting either the naive (blue) or the complete-source-only (orange) approach and varying the initial observation date (t_init). The start of the COVID-19 epidemic is indicated by a gray line (March 2020). Insets indicate the number of healthcare sites selected in the complete-source-only analysis, along with the proportion they represent of healthcare sites with at least one data point collected by the EHR functionality during the study: January 2013 - May 2022.

5 Discussion

We showed that a multiscale modeling of digitization could be used to mitigate spurious temporal variations in the study of quality and epidemiological indicators. To this end, we discarded all healthcare sites that had not fully adopted the required EHR functionalities since the beginning of the measurement period, and referred to this approach as the complete-source-only method.

The time-dependent variation in completeness that we observed before any correction is consistent with previous work.[11] Our adjustment method is close to the well-known complete-case method, with the main difference being that we adopted a per-healthcare site approach instead of a per-case approach in order to leverage our understanding of the missingness mechanism induced by progressive digitization.[27] By taking advantage of a specific pattern that emerged when adopting this description, i.e., the abrupt increase or decrease in data availability, we were able to specifically adjust for the time-dependent bias induced by the progressive adoption of EHR functionalities, thus complementing other works that focused on the overall drifts or shifts of data distributions.[9, 10, 25] As discussed by Finlayson et al., many mechanisms can cause temporal variations, and we emphasize that this work addresses only one of them.[17]

The CSO method is not a panacea. On the one hand, its impact depends strongly on the time span examined. While too long a period leads to a massive discarding of data, too short a period can severely limit the scientific relevance of a longitudinal study. Therefore, an optimal trade-off should be found. Furthermore, this method is only useful when temporal drifts of statistical distributions are detrimental to the objective of the study. For example, it is not appropriate if the goal is instead to maximize the number of events/outcomes/patients detected while withstanding time-dependent biases. On the other hand, omitting data sources is not always harmless, as it may cause a selection bias by altering the population under study, e.g., omitting hospital units with patients with different severity. Consequently, the interpretation of results obtained using this strategy should remain cautious.

In addition to the specific advantages and limitations of the CSO method, this study illustrates some of the organizational challenges associated with real-world data platforms. Due to privacy concerns, investigators only have access to minimized cohorts of patients extracted from the full database which contains millions of records. However, the application of the analysis workflow presented in this study requires background information such as completeness estimated on the overall database. Such indicators should therefore be precomputed and provided to investigators in addition to patient-level data, requiring close coordination between platform operators and research teams. To address this challenge, we have structured the computer code of this project as an open source library that can be extended by investigators while being applied to the entire database by platform operators (see eFigure5 for the proposed workflow).

Our study has several limitations. First, we used a single, highly simplified definition of completeness. In some cases, it could be refined to match a more intuitive definition or a more elaborate definition of plausibility.[28] Second, we used a crude modeling of EHR adoption that relied on the detection of abrupt changes in data availability. The actual dynamics of a hospital’s information system exhibit diverse behaviors that may induce more complex patterns in statistical distributions.[39] Third, we considered highly simplified quality and epidemiological indicators. As such, the indicators we calculated are not directly applicable to epidemiological surveillance or quality monitoring. In particular, it seems important to further characterize the patients included in the analysis in order to be able to compare facilities with similar case mix.[32] Fourth, our analysis focused on some administrative and clinical data categories that do not cover the wide variety of data found in a CDW. It should therefore be expanded to support a larger range of studies conducted on real-world data.

6 Conclusion

EHR studies require working with data collected by a constantly evolving system. The current pace of technological innovation is unlikely to slow anytime soon, and this complexity will continue to increase. By focusing on a specific mechanism, the adoption of new EHR functionalities, we have shown that various metadata can be used to meaningfully analyze EHR data at scale. Moreover, automating the computation of such metadata was critical to avoid an explosion in data management burden. Our work is a step in the direction of developing tools and methods to address these challenges. Much work remains to be done, particularly to fully integrate information about technological change into the statistical design of studies.

Competing interests

No competing interest is declared.

Conflict of interest

No conflict of interest is declared.

Author contributions statement

A.R., S.V and R.B. had full access to all the data in the study. They take responsibility for the integrity of the data and the accuracy of the data analysis.

All authors designed the study.
A.R and R.B. drafted the manuscript.
All authors interpreted data and made critical intellectual revisions of the manuscript.
R.B. did the literature review.
A.R., B.P., P.B., S.V. developed the algorithms.
R.B. supervised the project.

Data availibility

Access to the clinical data warehouse’s raw data can be granted following the process described on its website: www.eds.aphp.fr. A prior validation of the access by the local institutional review board is required. In the case of non-AP-HP researchers, the signature of a collaboration contract is mandatory.

Code availibility

The analyses of this article make extensive use of EDS-TeVa, a library developed in the context of AP-HP’s clinical data warehouse to automate the computation of indicators. It has been made publicly available under an open source license (BSD 3-clause): https://github.com/aphp/edsteva. A technical documentation of the library has been published to facilitate its use: https://aphp.github.io/edsteva/latest/. Moreover, the code developed to run the experiments of this study has also been made available freely in a separate github repository.[40]

Fundings

This study has been supported by grants from the AP-HP Foundation.

Role of the funder/sponsor

The funder was involved neither during the design and conduct of the study nor during the preparation, submission or review of the manuscript.

Supplementary Materials

A Details on the modeling of EHR adoption

We used two alternative models of the adoption of elcetronic health record (EHR) functionalities: either a Step function model, which takes into account only the adoption of a functionality (most analyses, see Figure 2 in the main article), or a rectangular function model, which also takes into account the end of use of a functionality in a healthcare unit (complementary analyses related to the collection of intensive care unit -ICU-records, see Figure S1). This second modeling seems to be particularly appropriate when working at the unit level, since units are constantly appearing and disappearing due to hospital reorganizations.

Figure S1:

Completeness estimates of intensive care unit records (plain curve) considering two arbitrarily selected units (colors) along with their modeling (dashed lines) using either step functions (top) or rectangular functions (down). Completeness estimate is defined as the monthly number of data points divided by its maximum value during the study period.

A.1 Step function model

We used the following error definition to measure the distance between completeness estimates, c(t), and their modelings by step functions (c₀ and t₀ parameters): with t_max the end of the study period, i.e., May 2022. We underline that this definition leads to high values of error for healthcare sites that disappeared before the end of the study’s period.

A.2 Rectangular function model

We modified the fitted parameters and the definition of error when using rectangular functions to model EHR usage. In that case, in addition to c₀ and t₀ parameters the fit moreover provides t₁, the end of functionality usage, and the error function is replaced by:

B Goodness-of-fit of the Step function modeling

In the analyses presented in this article, we have not yet assessed the goodness-of-fit of our modeling. Figure S2 shows the dispersion of completeness estimates around the fitted Step function curve for each of the six data categories. Although the dispersion is not negligible, the pattern of an abrupt increase in data availability appears robust. Nevertheless, these curves show that the average value of the completeness estimate decreases after EHR adoption in the case of ICU records, consistent with the need to use rectangular functions as discussed earlier. For the sake of simplicity, we chose to remove healthcare sites with a low mean value of completeness after the estimated adoption date (c₀ ≤ 0.15), which represent about 2% of the total number of administrative records, because their normalized completeness estimates would have spiked and made the figure unreadable.

Figure S2:

Completeness estimates centered on their adoption date t₀ and normalized by their stabilized value c₀, from top to bottom for each one of the six EHR functionalities and shown as their average (plain curve) and standard deviation (shaded area). The reference Step function is shown as dashed curves.

C The special case of ICU-readmission

Since the model of ICU records is different from the other EHR functionalities, the analysis of 30-day ICU readmissions was treated separately. Figure S3 shows the temporal variation of the quality indicator using either the naive or the CSO method and choosing three different starting dates t_init. The naive method resulted in a monotonous increase, while the Step function CSO method resulted in a monotonous decrease of the indicator. However, the rectangular function CSO, which also accounts for unit disappearance, discarded many small units and resulted in a stabilized indicator.

Figure S3:

Combined effect of varying the initial observation date (t_init) and the analysis (naive in blue and complete-source-only in orange) on the study of 30-day readmission in intensive care units. Two modelings of the adoption of the EHR functionality used to collect intensive care units records were compared, using either a step function (up) or a rectangular function (down). The linear modeling of temporal variations is indicated by dashed lines. Insets indicate the number of healthcare sites selected in the complete-source-only analysis, along with the proportion they represent of healthcare sites with at least one data point collected by the EHR functionality during the study: January 2013 - May 2022.

Figure S4 provides a sensitivity analysis of the slope α₁ and the mean of the indicator for the 28 hospitals used for the cohort selection. Similar to the observations of Figure S3, it shows a positive slope for the naive analysis for older initial data, a negative slope for the step function CSO method, and no slope for the rectangular function CSO method. This sensitivity analysis also indicates that both CSO methods reduce the amplitude of the indicator.

Figure S4:

Left: slopes α₁ of the linear model (Eq 1) estimated either on all the hospitals (diamond) or considering separately each one of the hospitals (IQR is the box, min and max are the lower and upper whiskers respectively), adopting either the naive (blue) or the complete-source-only (orange) method. Right: values of the indicators averaged over the study period. Two modelings of the adoption of intensive care units records were compared, using either a step function (up) or a rectangular function (down). Three different initial dates were considered.

D Additional results

Tables S1 to S6 shows the detailed results of the modeling of quality and epidemiological indicators.

View this table:

Table S1: Parameters resulting from the modeling of the temporal variations of the 30-day rehospitalization (origin α₀ and slope α₁, see Eq 1 in the main article) along with the number and proportion of considered hospitals for each method (naive and complete-source-only) and each start date.

View this table:

Table S2: Parameters resulting from the modeling of the temporal variations of the 30-day emergency departments consultation (origin α₀ and slope α₁, see Eq 1 in the main article) along with the number and proportion of considered hospitals for each method (naive and complete-source-only) and each start date.

View this table:

Table S3: Parameters resulting from the modeling of the temporal variations of the 30-day consultation (origin α₀ and slope α₁, see Eq 1 in the main article) along with the number and proportion of considered departments for each method (naive and complete-source-only) and each start date.

View this table:

Table S4: Parameters resulting from the modeling of the temporal variations of the discharge prescription (origin α₀ and slope α₁, see Eq 1 in the main article) along with the number and proportion of considered departments for each method (naive and complete-source-only) and each start date.

View this table:

Table S5: Parameters resulting from the modeling of the temporal variations of the 30-day intensive care units readmission (origin α₀ and slope α₁, see Eq 1 in the main article) along with the number and proportion of considered units for each method (naive, step function complete-source-only and rectangular function complete-source-only) and each start date.

View this table:

Table S6: Number of departments considered for the computation of the epidemiological indicators for each method and each start date.

E Delivery of indicators in the context of a clinical data warehouse

The indicators used in this study’s analyses were computed on the entire database of the clinical data warehouse, to which access is never granted to investigators that are not members of the IT department in charge of the platform operation (i.e., data custodian) in order to protect patient privacy (data minimization principle). In fact, investigators analyze data extracts related to specific cohorts that are provided by data custodians in isolated and secure environments (see Figure S5). The provision of pre-computed indicators describing EHR adoption, in addition to patient-level data, therefore appears necessary for investigators to use the complete-source-only method.

Figure S5:

Data flows in the context of a clinical data warehouse. Blue: from left to right, patient-level data collected in clinical softwares are integrated and standardized by data custodians and data relative to cohorts of patients are extracted to be analyzed by investigators in per-project isolated environments. Red: in order to adjust for the progressive EHR adoption, dedicated indicators are computed either by investigators on their cohorts or by data custodians on the total dataset. Data custodians’ and investigators’ data accesses are represented by dashed red boxes.

F The RECORD’s statement checklist

View this table:

Acknowledgments

We thank C. Taille, C. Chau and C. Baudoin for fruitful conversations, T. Petit-Jean for code proofreading and the clinical data warehouse of the Greater Paris University Hospitals for its support and the realization of data management and data curation tasks.

References

[1].↵
Joan A. Casey, Brian S. Schwartz, Walter F. Stewart, and Nancy E. Adler. Using Electronic Health Records for Population Health Research: A Review of Methods and Applications. Annu. Rev. Public Health, 37(1):61–81, March 2016.
OpenUrl CrossRef PubMed
[2].
Henry J Lowe, Todd A Ferris, Penni M Hernandez Nd, and Susan C Weber. STRIDE - An Integrated Standards-Based Translational Research Informatics Platform. page 5, 2009.
[3].
S. N. Murphy, G. Weber, M. Mendis, et al. Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2). Journal of the American Medical Informatics Association, 17(2):124–130, March 2010.
OpenUrl CrossRef PubMed
[4].↵
Somalee Datta, Jose Posada, Garrick Olson, et al. A new paradigm for accelerating clinical data science at Stanford Medicine. page 44, 2020.
[5].↵
Isaac S Kohane, Bruce J Aronow, Paul Avillach, et al. What Every Reader Should Know About Studies Using Electronic Health Record Data but May Be Afraid to Ask. J Med Internet Res, 23(3):e22219. March 2021.
OpenUrl PubMed
[6].
William R. Hersh, Mark G. Weiner, Peter J. Embi, et al. Caveats for the Use of Operational Electronic Health Record Data in Comparative Effectiveness Research. Medical Care, 51(Supplement 8Suppl 3):S30–S37, August 2013.
[7].
Denis Agniel, Isaac S Kohane, and Griffin M Weber. Biases in electronic health record data due to processes within the healthcare system: retrospective observational study. BMJ, page k1479, April 2018.
[8].↵
Anis Sharafoddini, Joel A Dubin, David M Maslove, and Joon Lee. A New Insight Into Missing Data in Intensive Care Unit Patient Profiles: Observational Study. JMIR Med Inform, 7(1):e11605. January 2019.
OpenUrl
[9].↵
Carlos Sáez, Pedro Pereira Rodrigues, João Gama, et al. Probabilistic change detection and visualization methods for the assessment of temporal stability in biomedical data quality. Data Min Knowl Disc, 29(4):950–975, July 2015.
OpenUrl
[10].↵
Francisco Javier Pérez-Benito, Carlos Sáez, J. Alberto Conejero, et al. Temporal variability analysis reveals biases in electronic health records due to hospital process reengineering interventions over seven years. PLoS ONE, 14(8):e0220369. August 2019.
OpenUrl
[11].↵
Nicole G. Weiskopf, George Hripcsak, Sushmita Swaminathan, and Chunhua Weng. Defining and measuring completeness of electronic health records for secondary use. Journal of Biomedical Informatics, 46(5):830–836, October 2013.
OpenUrl CrossRef PubMed
[12].
Jiang Bian, Tianchen Lyu, Alexander Loiacono, et al. Assessing the practice of data quality evaluation in a national clinical data research network through a systematic scoping review in the era of real-world data. Journal of the American Medical Informatics Association, 27(12):1999–2010, December 2020.
OpenUrl CrossRef
[13].
Michael G. Kahn, Tiffany J. Callahan, Juliana Barnard, et al. A Harmonized Data Quality Assessment Terminology and Framework for the Secondary Use of Electronic Health Record Data. eGEMs, 4(1):18, September 2016.
OpenUrl
[14].
Jonathan H. Chen, Muthuraman Alagappan, Mary K. Goldstein, et al. Decaying relevance of clinical data towards future decisions in data-driven inpatient clinical order sets. International Journal of Medical Informatics, 102:71–79, June 2017.
OpenUrl CrossRef
[15].
Sharon E Davis, Thomas A Lasko, Guanhua Chen, et al. Calibration drift in regression and machine learning models for acute kidney injury. Journal of the American Medical Informatics Association, 24(6):1052–1061, November 2017.
OpenUrl CrossRef PubMed
[16].
Jérôme Dockès, Gaël Varoquaux, and Jean-Baptiste Poline. Preventing dataset shift from breaking machine-learning biomarkers. GigaScience, 10(9):giab055. September 2021.
OpenUrl
[17].↵
Samuel G. Finlayson, Adarsh Subbaswamy, Karandeep Singh, et al. The Clinician and Dataset Shift in Artificial Intelligence. N Engl J Med, 385(3):283–286, July 2021.
OpenUrl CrossRef PubMed
[18].
Joseph Futoma, Morgan Simons, Trishan Panch, et al. The myth of generalisability in clinical research and machine learning in health care. The Lancet Digital Health, 2(9):e489.#x2013;e492, September 2020.
OpenUrl
[19].
Xintong Li, Lana YH Lai, Anna Ostropolets, et al. Bias, Precision and Timeliness of Historical (Background) Rate Comparison Methods for Vaccine Safety Monitoring: An Empirical Multi-Database Analysis. Front. Pharmacol., 12:773875, November 2021.
OpenUrl CrossRef PubMed
[20].
Siaw-Teng Liaw, Jason Guan Nan Guo, Sameera Ansari, et al. Quality assessment of real-world data repositories across the data life cycle: A literature review. Journal of the American Medical Informatics Association, 28(7):1591–1599, July 2021.
OpenUrl PubMed
[21].
Vincent Looten, Liliane Kong Win Chang, Antoine Neuraz, et al. What can millions of laboratory test results tell us about the temporal aspect of data quality? Study of data spanning 17 years in a clinical data warehouse. Computer Methods and Programs in Biomedicine, 181:104825, November 2019.
OpenUrl
[22].↵
Patrick Rockenschaub, Vincent Nguyen, Robert W Aldridge, et al. Data-driven discovery of changes in clinical code usage over time: a case-study on changes in cardiovascular disease recording in two English electronic health records databases (2001–2015). BMJ Open, 10(2):e034396. February 2020.
OpenUrl PubMed
[23].↵
Grégoire Rey, Albertine Aouba, Gérard Pavillon, et al. Cause-specific mortality time series analysis: a general method to detect and correct for abrupt data production changes. Popul Health Metrics, 9(1):52, December 2011.
OpenUrl
[24].
Carlos Sáez, Oscar Zurriaga, Jordi Pérez-Panadés, et al. Applying probabilistic temporal and multisite data quality control methods to a public health mortality registry in Spain: a systematic approach to quality control of repositories. Journal of the American Medical Informatics Association, 23(6):1085–1095, November 2016.
OpenUrl CrossRef PubMed
[25].↵
Carlos Sáez, Alba Gutiérrez-Sacristán, Isaac Kohane, et al. EHRtemporalVariability: delineating temporal data-set shifts in electronic health records. GigaScience, 9(8):giaa079. August 2020.
OpenUrl
[26].↵
Simona Bottani, Ninon Burgos, Aurélien Maire, et al. Automatic quality control of brain T1-weighted magnetic resonance images for a clinical data warehouse. Medical Image Analysis, 75:102219, January 2022.
OpenUrl
[27].↵
Brett K Beaulieu-Jones, Daniel R Lavage, John W Snyder, et al. Characterizing and Managing Missing Structured Data in Electronic Health Records: Data Analysis. JMIR Med Inform, 6(1):e11. February 2018.
OpenUrl
[28].↵
Hui Wang, Ilana Belitskaya-Levy, Fan Wu, et al. A statistical quality assessment method for longitudinal observations in electronic health record data with an application to the VA million veteran program. BMC Med Inform Decis Mak, 21(1):289, December 2021.
OpenUrl
[29].↵
Júlio Souza, Ismael Caballero, João Vasco Santos, et al. Multisource and temporal variability in Portuguese hospital administrative datasets: Data quality implications. Journal of Biomedical Informatics, 136:104242, December 2022.
OpenUrl
[30].↵
Eric I. Benchimol, Liam Smeeth, Astrid Guttmann, et al. The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) Statement. PLoS Med, 12(10):e1001885. October 2015.
OpenUrl CrossRef PubMed
[31].↵
George Hripcsak, Jon D Duke, Nigam H Shah, et al. Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers. 2015.
[32].↵
Claudia Fischer, Hester F. Lingsma, Perla J. Marang-van de Mheen, et al. Is the Readmission Rate a Valid Quality Indicator? A Review of the Evidence. PLoS ONE, 9(11):e112282. November 2014.
OpenUrl CrossRef PubMed
[33].
Qian Gu, Lane Koenig, Jennifer Faerberg, et al. The Medicare Hospital Readmissions Reduction Program: Potential Unintended Consequences for Hospitals Serving Vulnerable Populations. Health Serv Res, 49(3):818– 837, June 2014.
OpenUrl CrossRef PubMed
[34].
Christopher A. Beadles, Alan R. Ellis, Jesse C. Lichstein, et al. First Outpatient Follow-Up After Psychiatric Hospitalization: Does One Size Fit All? PS, 66(4):364–372, April 2015.
OpenUrl
[35].
Camille Windsor, Camille Hua, Quentin De Roux, et al. Healthcare trajectory of critically ill patients with necrotizing soft tissue infections: a multicenter retrospective cohort study using the clinical data warehouse of Greater Paris University Hospitals. Ann. Intensive Care, 12(1):115, December 2022.
OpenUrl
[36].
A.-L. Soilly, C. Ferdynus, O. Desplanches, et al. Paediatric intensive care admissions for respiratory syncytial virus bronchiolitis in France: results of a retrospective survey and evaluation of the validity of a medical information system programme. Epidemiol. Infect., 140(4):608–616, April 2012.
OpenUrl PubMed
[37].↵
Skipper Seabold and Josef Perktold. Statsmodels: Econometric and Statistical Modeling with Python. pages 92–96, Austin, Texas, 2010.
[38].↵
Adam REMAKI, J Charline, Vincent M, and svittoz. aphp/edsteva: v0.2.4, July 2023.
[39].↵
Samaneh Aminikhanghahi and Diane J. Cook. A survey of methods for time series change point detection. Knowl Inf Syst, 51(2):339–367, May 2017.
OpenUrl
[40].↵
Adam REMAKI. aphp-datascience/cse_210033: v1.0.0, August 2023.

View the discussion thread.

Posted August 21, 2023.

Download PDF

Citation Tools

Subject Area

Health Informatics

Subject Areas

All Articles

Addiction Medicine (412)
Allergy and Immunology (726)
Anesthesia (214)
Cardiovascular Medicine (3107)
Dentistry and Oral Medicine (349)
Dermatology (263)
Emergency Medicine (463)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1100)
Epidemiology (13046)
Forensic Medicine (13)
Gastroenterology (862)
Genetic and Genomic Medicine (4866)
Geriatric Medicine (449)
Health Economics (751)
Health Informatics (3068)
Health Policy (1108)
Health Systems and Quality Improvement (1135)
Hematology (410)
HIV/AIDS (962)
Infectious Diseases (except HIV/AIDS) (14351)
Intensive Care and Critical Care Medicine (885)
Medical Education (453)
Medical Ethics (120)
Nephrology (502)
Neurology (4631)
Nursing (247)
Nutrition (689)
Obstetrics and Gynecology (847)
Occupational and Environmental Health (764)
Oncology (2393)
Ophthalmology (677)
Orthopedics (270)
Otolaryngology (333)
Pain Medicine (306)
Palliative Medicine (88)
Pathology (516)
Pediatrics (1243)
Pharmacology and Therapeutics (521)
Primary Care Research (522)
Psychiatry and Clinical Psychology (3976)
Public and Global Health (7201)
Radiology and Imaging (1606)
Rehabilitation Medicine and Physical Therapy (958)
Respiratory Medicine (944)
Rheumatology (460)
Sexual and Reproductive Health (478)
Sports Medicine (403)
Surgery (514)
Toxicology (65)
Transplantation (222)
Urology (190)

[1] [1].↵
Joan A. Casey, Brian S. Schwartz, Walter F. Stewart, and Nancy E. Adler. Using Electronic Health Records for Population Health Research: A Review of Methods and Applications. Annu. Rev. Public Health, 37(1):61–81, March 2016.
OpenUrl CrossRef PubMed

[2] [2].
Henry J Lowe, Todd A Ferris, Penni M Hernandez Nd, and Susan C Weber. STRIDE - An Integrated Standards-Based Translational Research Informatics Platform. page 5, 2009.

[3] [3].
S. N. Murphy, G. Weber, M. Mendis, et al. Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2). Journal of the American Medical Informatics Association, 17(2):124–130, March 2010.
OpenUrl CrossRef PubMed

[4] [4].↵
Somalee Datta, Jose Posada, Garrick Olson, et al. A new paradigm for accelerating clinical data science at Stanford Medicine. page 44, 2020.

[5] [5].↵
Isaac S Kohane, Bruce J Aronow, Paul Avillach, et al. What Every Reader Should Know About Studies Using Electronic Health Record Data but May Be Afraid to Ask. J Med Internet Res, 23(3):e22219. March 2021.
OpenUrl PubMed

[6] [6].
William R. Hersh, Mark G. Weiner, Peter J. Embi, et al. Caveats for the Use of Operational Electronic Health Record Data in Comparative Effectiveness Research. Medical Care, 51(Supplement 8Suppl 3):S30–S37, August 2013.

[7] [7].
Denis Agniel, Isaac S Kohane, and Griffin M Weber. Biases in electronic health record data due to processes within the healthcare system: retrospective observational study. BMJ, page k1479, April 2018.

[8] [8].↵
Anis Sharafoddini, Joel A Dubin, David M Maslove, and Joon Lee. A New Insight Into Missing Data in Intensive Care Unit Patient Profiles: Observational Study. JMIR Med Inform, 7(1):e11605. January 2019.
OpenUrl

[9] [9].↵
Carlos Sáez, Pedro Pereira Rodrigues, João Gama, et al. Probabilistic change detection and visualization methods for the assessment of temporal stability in biomedical data quality. Data Min Knowl Disc, 29(4):950–975, July 2015.
OpenUrl

[10] [10].↵
Francisco Javier Pérez-Benito, Carlos Sáez, J. Alberto Conejero, et al. Temporal variability analysis reveals biases in electronic health records due to hospital process reengineering interventions over seven years. PLoS ONE, 14(8):e0220369. August 2019.
OpenUrl

[11] [11].↵
Nicole G. Weiskopf, George Hripcsak, Sushmita Swaminathan, and Chunhua Weng. Defining and measuring completeness of electronic health records for secondary use. Journal of Biomedical Informatics, 46(5):830–836, October 2013.
OpenUrl CrossRef PubMed

[12] [12].
Jiang Bian, Tianchen Lyu, Alexander Loiacono, et al. Assessing the practice of data quality evaluation in a national clinical data research network through a systematic scoping review in the era of real-world data. Journal of the American Medical Informatics Association, 27(12):1999–2010, December 2020.
OpenUrl CrossRef

[13] [13].
Michael G. Kahn, Tiffany J. Callahan, Juliana Barnard, et al. A Harmonized Data Quality Assessment Terminology and Framework for the Secondary Use of Electronic Health Record Data. eGEMs, 4(1):18, September 2016.
OpenUrl

[14] [14].
Jonathan H. Chen, Muthuraman Alagappan, Mary K. Goldstein, et al. Decaying relevance of clinical data towards future decisions in data-driven inpatient clinical order sets. International Journal of Medical Informatics, 102:71–79, June 2017.
OpenUrl CrossRef

[15] [15].
Sharon E Davis, Thomas A Lasko, Guanhua Chen, et al. Calibration drift in regression and machine learning models for acute kidney injury. Journal of the American Medical Informatics Association, 24(6):1052–1061, November 2017.
OpenUrl CrossRef PubMed

[16] [16].
Jérôme Dockès, Gaël Varoquaux, and Jean-Baptiste Poline. Preventing dataset shift from breaking machine-learning biomarkers. GigaScience, 10(9):giab055. September 2021.
OpenUrl

[17] [17].↵
Samuel G. Finlayson, Adarsh Subbaswamy, Karandeep Singh, et al. The Clinician and Dataset Shift in Artificial Intelligence. N Engl J Med, 385(3):283–286, July 2021.
OpenUrl CrossRef PubMed

[18] [18].
Joseph Futoma, Morgan Simons, Trishan Panch, et al. The myth of generalisability in clinical research and machine learning in health care. The Lancet Digital Health, 2(9):e489.#x2013;e492, September 2020.
OpenUrl

[19] [19].
Xintong Li, Lana YH Lai, Anna Ostropolets, et al. Bias, Precision and Timeliness of Historical (Background) Rate Comparison Methods for Vaccine Safety Monitoring: An Empirical Multi-Database Analysis. Front. Pharmacol., 12:773875, November 2021.
OpenUrl CrossRef PubMed

[20] [20].
Siaw-Teng Liaw, Jason Guan Nan Guo, Sameera Ansari, et al. Quality assessment of real-world data repositories across the data life cycle: A literature review. Journal of the American Medical Informatics Association, 28(7):1591–1599, July 2021.
OpenUrl PubMed

[21] [21].
Vincent Looten, Liliane Kong Win Chang, Antoine Neuraz, et al. What can millions of laboratory test results tell us about the temporal aspect of data quality? Study of data spanning 17 years in a clinical data warehouse. Computer Methods and Programs in Biomedicine, 181:104825, November 2019.
OpenUrl

[22] [22].↵
Patrick Rockenschaub, Vincent Nguyen, Robert W Aldridge, et al. Data-driven discovery of changes in clinical code usage over time: a case-study on changes in cardiovascular disease recording in two English electronic health records databases (2001–2015). BMJ Open, 10(2):e034396. February 2020.
OpenUrl PubMed

[23] [23].↵
Grégoire Rey, Albertine Aouba, Gérard Pavillon, et al. Cause-specific mortality time series analysis: a general method to detect and correct for abrupt data production changes. Popul Health Metrics, 9(1):52, December 2011.
OpenUrl

[24] [24].
Carlos Sáez, Oscar Zurriaga, Jordi Pérez-Panadés, et al. Applying probabilistic temporal and multisite data quality control methods to a public health mortality registry in Spain: a systematic approach to quality control of repositories. Journal of the American Medical Informatics Association, 23(6):1085–1095, November 2016.
OpenUrl CrossRef PubMed

[25] [25].↵
Carlos Sáez, Alba Gutiérrez-Sacristán, Isaac Kohane, et al. EHRtemporalVariability: delineating temporal data-set shifts in electronic health records. GigaScience, 9(8):giaa079. August 2020.
OpenUrl

[26] [26].↵
Simona Bottani, Ninon Burgos, Aurélien Maire, et al. Automatic quality control of brain T1-weighted magnetic resonance images for a clinical data warehouse. Medical Image Analysis, 75:102219, January 2022.
OpenUrl

[27] [27].↵
Brett K Beaulieu-Jones, Daniel R Lavage, John W Snyder, et al. Characterizing and Managing Missing Structured Data in Electronic Health Records: Data Analysis. JMIR Med Inform, 6(1):e11. February 2018.
OpenUrl

[28] [28].↵
Hui Wang, Ilana Belitskaya-Levy, Fan Wu, et al. A statistical quality assessment method for longitudinal observations in electronic health record data with an application to the VA million veteran program. BMC Med Inform Decis Mak, 21(1):289, December 2021.
OpenUrl

[29] [29].↵
Júlio Souza, Ismael Caballero, João Vasco Santos, et al. Multisource and temporal variability in Portuguese hospital administrative datasets: Data quality implications. Journal of Biomedical Informatics, 136:104242, December 2022.
OpenUrl

[30] [30].↵
Eric I. Benchimol, Liam Smeeth, Astrid Guttmann, et al. The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) Statement. PLoS Med, 12(10):e1001885. October 2015.
OpenUrl CrossRef PubMed

[31] [31].↵
George Hripcsak, Jon D Duke, Nigam H Shah, et al. Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers. 2015.

[32] [32].↵
Claudia Fischer, Hester F. Lingsma, Perla J. Marang-van de Mheen, et al. Is the Readmission Rate a Valid Quality Indicator? A Review of the Evidence. PLoS ONE, 9(11):e112282. November 2014.
OpenUrl CrossRef PubMed

[33] [33].
Qian Gu, Lane Koenig, Jennifer Faerberg, et al. The Medicare Hospital Readmissions Reduction Program: Potential Unintended Consequences for Hospitals Serving Vulnerable Populations. Health Serv Res, 49(3):818– 837, June 2014.
OpenUrl CrossRef PubMed

[34] [34].
Christopher A. Beadles, Alan R. Ellis, Jesse C. Lichstein, et al. First Outpatient Follow-Up After Psychiatric Hospitalization: Does One Size Fit All? PS, 66(4):364–372, April 2015.
OpenUrl

[35] [35].
Camille Windsor, Camille Hua, Quentin De Roux, et al. Healthcare trajectory of critically ill patients with necrotizing soft tissue infections: a multicenter retrospective cohort study using the clinical data warehouse of Greater Paris University Hospitals. Ann. Intensive Care, 12(1):115, December 2022.
OpenUrl

[36] [36].
A.-L. Soilly, C. Ferdynus, O. Desplanches, et al. Paediatric intensive care admissions for respiratory syncytial virus bronchiolitis in France: results of a retrospective survey and evaluation of the validity of a medical information system programme. Epidemiol. Infect., 140(4):608–616, April 2012.
OpenUrl PubMed

[37] [37].↵
Skipper Seabold and Josef Perktold. Statsmodels: Econometric and Statistical Modeling with Python. pages 92–96, Austin, Texas, 2010.

[38] [38].↵
Adam REMAKI, J Charline, Vincent M, and svittoz. aphp/edsteva: v0.2.4, July 2023.

[39] [39].↵
Samaneh Aminikhanghahi and Diane J. Cook. A survey of methods for time series change point detection. Knowl Inf Syst, 51(2):339–367, May 2017.
OpenUrl

[40] [40].↵
Adam REMAKI. aphp-datascience/cse_210033: v1.0.0, August 2023.

Adjusting for the progressive digitization of health records: working examples on a multi-hospital clinical data warehouse

Abstract

1 Background and Significance

2 Objective

3 Materials and Methods

3.1 Data source

3.2 Modeling of the EHR adoption

3.3 Quality and epidemiological indicators

3.4 Statistical analysis

4 Results

4.1 Modeling of the EHR adoption

4.2 Quality indicators

4.3 Epidemiological indicators

5 Discussion

6 Conclusion

Competing interests

Conflict of interest

Author contributions statement

Data availibility

Code availibility

Fundings

Role of the funder/sponsor

Supplementary Materials

A Details on the modeling of EHR adoption

A.1 Step function model

A.2 Rectangular function model

B Goodness-of-fit of the Step function modeling

C The special case of ICU-readmission

D Additional results

E Delivery of indicators in the context of a clinical data warehouse

F The RECORD’s statement checklist

Acknowledgments

References

Citation Manager Formats

Subject Area