Abstract
Background The COVID-19 pandemic has challenged healthcare systems and research worldwide. Data is collected all over the world and needs to be integrated and made available to other researchers quickly. However, the various heterogeneous information systems that are used in hospitals can result in fragmentation of health data over multiple data ‘silos’ that are not interoperable for analysis. Consequently, clinical observations in hospitalised patients are not prepared to be reused efficiently and timely. There is a need to adapt the research data management in hospitals to make COVID-19 observational patient data machine actionable, i.e. more Findable, Accessible, Interoperable and Reusable (FAIR) for humans and machines. We therefore applied the FAIR principles in the hospital to make patient data more FAIR.
Results In this paper, we present our FAIR approach to transform COVID-19 observational patient data collected in the hospital into machine actionable digital objects to answer medical doctors’ research questions. With this objective, we conducted a coordinated FAIRification among stakeholders based on ontological models for data and metadata, and a FAIR based architecture that complements the existing data management. We applied FAIR Data Points for metadata exposure, turning investigational parameters into a FAIR dataset. We demonstrated that this dataset is machine actionable by means of three different computational activities: federated query of patient data along open existing knowledge sources across the world through the Semantic Web, implementing Web APIs for data query interoperability, and building applications on top of these FAIR patient data for FAIR data analytics in the hospital.
Conclusions Our work demonstrates that a FAIR research data management plan based on ontological models for data and metadata, open Science, Semantic Web technologies, and FAIR Data Points is providing data infrastructure in the hospital for machine actionable FAIR digital objects. This FAIR data is prepared to be reused for federated analysis, linkable to other FAIR data such as Linked Open Data, and reusable to develop software applications on top of them for hypothesis generation and knowledge discovery.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
N. Queralt-Rosinach, R. Kaliyaperumal, C. Bernabe, Q. Long, A. Jacobsen and M. Roos are supported by funding from the European Union's Horizon 2020 research and innovation program under the EJP RD COFUND-EJP N 825575. We would also like to thank to the EJP RD, the GO FAIR VODAN, and the ZonMW Health Holland under the Trusted World of Corona, for supporting the research on FAIR data stewardship that was reused here. We would like to acknowledge that work in the BEAT-COVID project was partly funded by the Wake Up To Corona crowdfunding initiated by the Leiden University Fund (LUF).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Ethical approval was obtained from the Medical Ethical Committee Leiden-Den Haag-Delft (NL73740.058.20).
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
All data and code are publicly available under license terms.
Abbreviations
- FAIR
- Findable, Accessible, Interoperable and Reusable
- VODAN
- Virus Outbreak Data Network
- TWOC
- Trusted World of Corona
- FDO
- FAIR Digital Object
- GDPR
- General Data Protection Regulation
- LUMC
- Leiden University Medical Center
- RDM
- Research Data Management
- EJP RD
- European Joint Programme Rare Diseases
- DCAT
- Data Catalogue Vocabulary
- FDP
- FAIR Data Point
- LOD
- Linked Open Data
- EDC
- Electronic Data Capture
- OBO
- Open Biological Biomedical Ontologies
- OWL
- Web Ontology Language
- RDF
- Resource Description Framework
- ICU
- Intensive Care Unit
- GUI
- Graphical User Interface
- URI
- Uniform Resource Identifier
- CQ
- Competency Question