PT - JOURNAL ARTICLE AU - Gouthamchand, Varsha AU - Dekker, Andre AU - Wee, Leonard AU - van Soest, Johan TI - FAIR-IFICATION OF STRUCTURED CLINICAL DATA AID - 10.1101/2021.07.23.21261032 DP - 2022 Jan 01 TA - medRxiv PG - 2021.07.23.21261032 4099 - http://medrxiv.org/content/early/2022/02/23/2021.07.23.21261032.short 4100 - http://medrxiv.org/content/early/2022/02/23/2021.07.23.21261032.full AB - One of the common concerns in clinical research is improving the infrastructure to facilitate the reuse of clinical data and deal with interoperability issues. FAIR (Findable, Accessible, Interoperable and Reusable) Data Principles enables reuse of data by providing us with descriptive metadata, explaining what the data represents and where the data can be found. In addition to aiding scholars, FAIR guidelines also help in enhancing the machine-readability of data, making it easier for machine algorithms to find and utilize the data. Hence, the feasibility of accurate interpretation of data is higher and this helps in obtaining maximum results from research work. FAIR-ification is done by embedding knowledge on data. This could be achieved by annotating the data using terminologies and concepts from Web Ontology Language (OWL). By attaching a terminological value, we add semantics to a specific data element, increasing the interoperability and reuse. However, this FAIR-ification of data can be a complicated and a time-consuming process. Our main objective is to disentangle the process of making data FAIR by using both domain and technical expertise. We apply this process in a workflow which involves FAIR-ification of four independent public HNSCC datasets from The Cancer Imaging Archive (TCIA). This approach converts the data from the four datasets into Linked Data using RDF triples, and finally annotates these datasets using standardized terminologies. By annotating them, we link all the four datasets together using their semantics and thus a single query would get the intended information from all the datasets.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThe work of author Varsha Gouthamchand has been supported financially by the Dutch Research Council (NWO) Indo-Dutch grant TRAIN (629.002.212). The work of author Leonard Wee has been funded by the Hanarth Foundation.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Exemption - data is public open access and already freely availableI confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesData is already publicly available. https://wiki.cancerimagingarchive.net/display/Public/HNSCC https://wiki.cancerimagingarchive.net/pages/viewpage.action?pageId=33948764 https://wiki.cancerimagingarchive.net/display/Public/Head-Neck-Radiomics-HN1 https://wiki.cancerimagingarchive.net/display/Public/Head-Neck-PET-CT