Abstract
Objectives The use of synthetic data to supplement clinical trial placebo groups or for trial planning is rapidly gaining interest. However, there is not yet an established framework for generating synthetic data for these purposes. In this work we test two approaches to generating synthetic placebo arms for ALS trials with survival being the primary outcome variable.
Methods For the first approach, we extracted sample subsets from the UK MND register (n = 308) using an evolutionary algorithm such that the subset baseline variables matched a target trials group, either people enrolled in LiCALS (n = 106) or people included in the PRO-ACT database (n = 171). We also applied trial specific exclusion criteria where possible or alternatively we applied a custom ‘time filter’. For the second approach, survival was predicted for LiCALS participants using the ENCALS model. Survival probabilities from each method were compared to real placebo participants using Kaplan-Meier analysis and the log rank test.
Results We found that the synthetic placebo groups derived from the MND register matched the target trials outcomes very well. The ENCALS model produces synthetic placebo groups that are significantly different to the real placebo groups. However, when participants are censored at 6 month intervals, the ENCALS synthetic group matches the target group very well between 24 and 48 months, indicating a possible timeframe that this method could be utilised.
Conclusion Both the register based approach and the ENCALS prognostic model generated synthetic placebo groups that matched placebo groups from historical trials. These methods need to be validated in prospective trials.
Competing Interest Statement
AAC is a consultant for Mitsubishi-Tanabe Pharma, GSK, and Chronos Therapeutics, and chief investigator for clinical trials for Cytokinetics and OrionPharma.; AAC serves on scientific advisory boards for Mitsubishi Tanabe, Roche, Denali Pharma, Cytokinetics, Lilly, and Amylyx and has received a reeseach grant from Biogen.
Funding Statement
AAK is funded by The Motor Neurone Disease Association (MNDA), NIHR Maudsley Biomedical Research Centre and ALS Association Milton Safenowitz Research Fellowship.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
Harry Bowles is a PhD student working on bioinformatics and clinical trial optimisation in amyotrophic lateral sclerosis. His research is focused on genome sequence analysis and synthetic data.
Dr Sarah Opie-Martin is an epidemiologist working on data collection and analysis of epidemiology projects in amyotrophic lateral sclerosis. Her research focusses on environmental risk factors for ALS, clinical subgroups, and disease prognosis.
Ali Shojaie is a medical doctor and clinical researcher. He is doing his PhD in clinical neuroscience. His PhD focused on different aspects of non-motor symptoms in ALS.
Laura Libonati is a PhD student working in the clinical setting with patients with Amyotrophic Lateral Sclerosis. Her research is focused on analyzing clinical, electrophysiological, and serological markers with a possible role in the disease’s development and progression.
After a career in Biophysics, Dr. Alfredo Iacoangeli accepted his PhD “cum laude” in Life Sciences in January 2016 from “Sapienza” University of Rome where he worked on structural bioinformatics with a particular focus on protein structure prediction and protein-peptide interaction. He joined King’s College London in March 2016 and he is now the Bioinformatics lead of a joint programme between the Department of Basic and Clinical Neuroscience and the Health Informatics Unit at the Institute of Psychiatry, Psychology & Neuroscience at King’s College London. The aim of this project is twofold: 1) the development of a high throughput gene, environment and epigenetics database and analysis system for international MND/ALS research; 2) the use of large multi-omics datasets to identify subgroups MND/ALS patients with homogeneous disease causes and clinical phenotype, and to gain new insights into the disease pathogenesis. Dr. Iacoangeli is the primary author of several scientific articles in the field of Structural and Genomic Bioinformatics and MND/ALS genetics.
Dr Ahmad Al Khleifat is a clinician and scientist currently working with Professor Al-Chalabi’s group. His main focus of research is on ALS clinical staging, clinical trial data analysis, and disease gene identification through next generation sequencing and coupling this with advanced data analysis to deliver diagnostic tools for complex disease genetics. Since 2017 Dr Al Khleifat has been a co-chair of genomic structural variations group responsible for the collection of the largest genomic single disease cohort in the world; Project MinE. Dr Al Khleifat is the leading the analysis for the structural variation genome wide association analysis using 10,500 samples from the Project MinE initiative.
Data Availability
All data produced in the present study are available upon request to the authors