PT - JOURNAL ARTICLE AU - Fiandrino, Stefania AU - Doná, Daniele AU - Giaquinto, Carlo AU - Poletti, Piero AU - Tira, Micheal Davis AU - Di Chiara, Costanza AU - Paolotti, Daniela TI - Exploring clinical characteristics of COVID-19 in children and adolescents using a machine-learning approach AID - 10.1101/2024.12.04.24318465 DP - 2024 Jan 01 TA - medRxiv PG - 2024.12.04.24318465 4099 - http://medrxiv.org/content/early/2024/12/06/2024.12.04.24318465.short 4100 - http://medrxiv.org/content/early/2024/12/06/2024.12.04.24318465.full AB - Introduction The epidemiology and clinical characteristics of COVID-19 evolved due to new SARS-CoV-2 variants of concern (VOCs). The Omicron VOC’s higher transmissibility increased pediatric COVID-19 cases and hospital admissions. Most research during the Omicron period has focused on hospitalized cases, leaving a gap in understanding the disease’s evolution in community settings. This study targets children with mild to moderate COVID-19 during pre-Omicron and Omicron periods. It aims to identify patterns in COVID-19 morbidity by clustering individuals based on symptom similarities and duration of symptoms and develop a machine-learning tool to classify new cases into risk groups.Methods We propose a data-driven approach to explore changes in COVID-19 characteristics analyzing data collected within a pediatric cohort at the University Hospital of Padua. First, we apply an unsupervised machine-learning algorithm to cluster individuals into different groups. Second, we classify new patient risk groups using a Random-Forest classifier model based on sociodemographic information, pre-existing medical conditions, vaccination status, and the VOC as predictive variables. Third, we explore the key features influencing the classification.Results The unsupervised clustering identified three severity risk profile groups. The classification model effectively distinguished these groups, with age, gender, COVID-19 vaccination, VOC, and presence of comorbidities as top predictive features. A high number and longer duration of symptoms were associated with younger age groups, males, unvaccinated individuals, Omicron infections, and those with comorbidities. These results are consistent with evidence of severe COVID-19 in infants, older children with comorbidities, and unvaccinated children.Conclusion Our classification model has the potential to provide clinicians with insights into the children’s risk profile of COVID-19 using readily available data. This approach can support public health efforts by clarifying disease burden and improving patient care strategies. Furthermore, it underscores the importance of integrating risk classification models to monitor and manage infectious diseases.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work is part of the VERDI project (101045989), which is funded by the European Union. Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union or the European Health and Digital Executive Agency. Neither the European Union nor the granting authority can be held responsible for them.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The study protocol was reviewed and approved by the Ethics Committee of the University Hospital of Padova, Italy (Prot. Nr. 0070714 of November 24th, 2020; last amendment Prot. Nr. 0024018 of April 5th, 2022). Parents/legally authorized representatives were informed of the research proposal and provided written consent to participate in the study and use the collected patient data.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesData are available from the corresponding author upon reasonable request.