PT - JOURNAL ARTICLE AU - Blay, Natalia AU - Carrasco-Ribelles, Lucía A AU - Farré, Xavier AU - Iraola-Guzmán, Susana AU - Danés-Castells, Marc AU - Violán, Concepción AU - de Cid, Rafael TI - Disease prevalence, health-related and socio-demographic factors in the GCAT cohort. A comparison with the general population of Catalonia AID - 10.1101/2023.09.08.23295239 DP - 2023 Jan 01 TA - medRxiv PG - 2023.09.08.23295239 4099 - http://medrxiv.org/content/early/2023/09/08/2023.09.08.23295239.short 4100 - http://medrxiv.org/content/early/2023/09/08/2023.09.08.23295239.full AB - Background Population-based cohorts play a key role in epidemiological studies. However, it is known that volunteer cohorts include a healthy volunteer bias. Assessment and characterization of this bias is needed to extrapolate results to the general population. Here, we assess the bias of the population-based cohort GCAT, encompassing 20 000 adult participants from Catalonia with electronic health record data. The aim of this study is to compare the GCAT cohort with its age-matching Catalan population, to assess their representativeness, as well as determining the weights to make results generalisable.Methods Statistical comparisons until 2019 in multiple variables across sociodemographic, lifestyle, diseases and medication domains were performed by stratified analysis with Fisher’s exact test and t-test. Electronic health records of Catalonia (SIDIAP), and registers from the statistics institute of Catalonia (IDESCAT) and Spain (INE) were used to make the comparisons. We generated weights accounting for sociodemographic, lifestyle and multimorbidity factors.Results GCAT cohort is enriched in women and younger individuals, with higher socioeconomic status, more health conscious and healthier in terms of mortality and chronic disease prevalence. We have shown that this bias can be corrected with weighting techniques, providing a more representative sample of the general population.Conclusions The application of multidomain weights, encompassing not only sociodemographic aspects, but also lifestyle and health-related variables, has effectively diminished the observed bias in disease prevalence estimates within the GCAT cohort. This correction has led to an enhancement of the cohort’s representativeness, rendering it more akin to the general population of Catalonia.Competing Interest StatementThe authors have declared no competing interest.Funding StatementGCAT was funded by Accion de Dinamizacion del ISCIII-MINECO and the Ministry of Health of the Generalitat of Catalunya [ADE 10/00026]; and have additional support by the Agencia de Gestio d'Ajuts Universitaris i de Recerca (AGAUR) [SGR 01537], Spanish National Grant [PI18/01512]. Xavier Farre is supported by VEIS project [001-P-001647] (co-funded by European Regional Development Fund, "A way to build Europe"). The SIDIAP project received a research grant from the Carlos III Institute of Health, Ministry of Economy and Competitiveness (Spain), awarded in 2019 under the Health Strategy Action 2013-2016, within the National Research Programme oriented to Societal Challenges, within the Technical, Scientific and Research National Plan 2013-2016 [PI19/00535], and the PFIS Grant [FI20/00040], co-funded with European Union European Regional Development Fund funds. Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Ethics committee of Germans Trias I Pujol University Hospital gave ethical approval for this work. Ethics committee of Fundacio Institut Universitari per a la recerca a l'Atencio Primaria de Salut Jordi Gol i Gurina gave ethical approval for this work.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.Yes