Data Availability
The datasets supporting the results of this article are available as follows: The CADEC dataset can be accessed at https://data.csiro.au/collection/csiro:10948. The TAC dataset is available at https://bionlp.nlm.nih.gov/tac2017adversereactions/. The SMM4H 2023 dataset is available upon request at https://codalab.lisn.upsaclay.fr/competitions/12941#participate-get-data. All datasets are provided in their raw formats, and preprocessing scripts to replicate our experiments and prepare the data for the CONORM framework are available at https://github.com/ds4dh/CONORM. The MedDRA files used for entity normalization and data preprocessing can be obtained upon request from https://www.meddra.org/. Specifically, we used MedDRA English version 16.0 for CADEC, version 24.0 for SMM4H 2023, and version 18.1 for TAC, ensuring consistency with the original dataset annotations.
https://data.csiro.au/collection/csiro:10948
https://bionlp.nlm.nih.gov/tac2017adversereactions/
https://codalab.lisn.upsaclay.fr/competitions/12941#participate-get-data