Abstract
Objective To develop a vocal biomarker for fatigue monitoring in people with COVID-19.
Design Prospective cohort study.
Setting Predi-COVID data between May 2020 and May 2021.
Participants A total of 1772 voice recordings was used to train an AI-based algorithm to predict fatigue, stratified by gender and smartphone’s operating system (Android/iOS). The recordings were collected from 296 participants tracked for two weeks following SARS-CoV-2 infection.
primary and secondary outcome measures Four machine learning algorithms (Logistic regression, k-nearest neighbors, support vector machine, and soft voting classifier) were used to train and derive the fatigue vocal biomarker. A t-test was used to evaluate the distribution of the vocal biomarker between the two classes (Fatigue and No fatigue).
Results The final study population included 56% of women and had a mean (±SD) age of 40 (±13) years. Women were more likely to report fatigue (P<.001). We developed four models for Android female, Android male, iOS female, and iOS male users with a weighted AUC of 79%, 85%, 86%, 82%, and a mean Brier Score of 0.15, 0.12, 0.17, 0.12, respectively. The vocal biomarker derived from the prediction models successfully discriminated COVID-19 participants with and without fatigue (t-test P<.001).
Conclusions This study demonstrates the feasibility of identifying and remotely monitoring fatigue thanks to voice. Vocal biomarkers, digitally integrated into telemedicine technologies, are expected to improve the monitoring of people with COVID-19 or Long-COVID.
Competing Interest Statement
The authors have declared no competing interest.
Clinical Trial
NCT04380987
Clinical Protocols
https://bmjopen.bmj.com/content/10/11/e041834
Funding Statement
This study was funded by the Luxembourg National Research Fund (FNR) (Predi-COVID, grant number 14716273), the Andre Losch Foundation, and the Luxembourg Institute of Health.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The National Research Ethics Committee of Luxembourg (study number 202003/07) gave a favorable opinion to the study in April 2020.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
ClinicalTrials.gov Identifier: NCT04380987
Data Availability
All data produced are available online at https://zenodo.org/record/5937844#.Yhi2oejMJPY
https://github.com/LIHVOICE/Predi_COVID_Fatigue_Vocal_Biomarker