PT - JOURNAL ARTICLE AU - Jacqueline K. Kueper AU - Jennifer Rayner AU - Merrick Zwarenstein AU - Daniel J. Lizotte TI - Describing a complex primary health care population in a learning health system to support future decision support and artificial intelligence initiatives AID - 10.1101/2022.03.01.22271714 DP - 2022 Jan 01 TA - medRxiv PG - 2022.03.01.22271714 4099 - http://medrxiv.org/content/early/2022/04/22/2022.03.01.22271714.short 4100 - http://medrxiv.org/content/early/2022/04/22/2022.03.01.22271714.full AB - Introduction Learning health systems (LHS) use data to improve care. Descriptive epidemiology to reveal health states and needs of the LHS population is essential for informing LHS initiatives, including development of decision support tools. To properly characterize complex populations, both simple statistical and artificial intelligence techniques can be useful. We present the first large-scale description of the population served by one of the first primary care LHS in North America.Objectives Our objective is to describe sociodemographic, clinical, and health care use characteristics of adult primary care clients served by the Alliance for Healthier Communities, which provides team-based primary health care through Community Health Centres (CHCs) across Ontario, Canada.Methods Using electronic health record data from 2009-2019 for all CHCs, we perform table-based summaries for each characteristic; and apply unsupervised leaning techniques to explore patterns of common condition co-occurrence, care provider teams, and care frequency.Results Of the 221,047 eligible clients, those at CHCs that primarily serve those most at risk (homeless, mental health, addictions) tend to have more chronic conditions and social determinants of health, which are also prominent in clients with multimorbidity. Most care is provided by physician and nursing providers, with heterogeneous combinations of other provider types. A subset of clients have many issues addressed within single-visits and there is within- and between-client variability in care frequency. Example methodological considerations learned for future LHS initiatives include the need to carefully consider the level of analysis and associated implications for data quality and target population, heterogeneity in conditions and care characteristics, and non-uniform risk profiles across the care history.Conclusions We demonstrate the use of methods from statistics and artificial intelligence, applied with an epidemiological lens, to provide an overview of a complex primary care population. In addition to substantive findings, we discuss implications for future LHS initiatives.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was supported by the Canadian Institutes of Health Research Canadian Graduate Scholarship-Doctoral.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This study was approved by Western University Review Ethics Board project ID 111353.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data underlying this article were securely accessed from the Alliance for Healthier Communities. The data cannot be shared publicly due to their sensitive nature, as agreed upon in the ethics agreement.CHCCommunity Health CentreEHRElectronic Health RecordENCODE-FMElectronic Nomenclature and Classification Of Disorders and Encounters for Family MedicineICD-10International Classification of Disease - Version 10LHSLearning Health SystemNMFnon-negative matrix factorizationPCPrimary CareUARUrban At-Risk