Follow this preprint
Predicting Long COVID in the National COVID Cohort Collaborative Using Super Learner
View ORCID ProfileZachary Butzin-Dozier, Yunwen Ji, Haodong Li, Jeremy Coyle, Junming (Seraphina) Shi, Rachael V. Philips, View ORCID ProfileAndrew Mertens, Romain Pirracchio, Mark J. van der Laan, View ORCID ProfileRena Patel, John M. Colford Jr., View ORCID ProfileAlan E. Hubbard the National COVID Cohort Collaborative (N3C) Consortium
doi: https://doi.org/10.1101/2023.07.27.23293272
This article is a preprint and has not been peer-reviewed [what does this mean?]. It reports new medical research that has yet to be evaluated and so should not be used to guide clinical practice.
Zachary Butzin-Dozier
1School of Public Health, University of California, Berkeley, Berkeley, CA USA
Yunwen Ji
1School of Public Health, University of California, Berkeley, Berkeley, CA USA
Haodong Li
1School of Public Health, University of California, Berkeley, Berkeley, CA USA
Jeremy Coyle
1School of Public Health, University of California, Berkeley, Berkeley, CA USA
Junming (Seraphina) Shi
1School of Public Health, University of California, Berkeley, Berkeley, CA USA
Rachael V. Philips
1School of Public Health, University of California, Berkeley, Berkeley, CA USA
Andrew Mertens
1School of Public Health, University of California, Berkeley, Berkeley, CA USA
Romain Pirracchio
2Department of Anesthesiology and Perioperative Care, University of California, San Francisco, San Francisco, CA USA
Mark J. van der Laan
1School of Public Health, University of California, Berkeley, Berkeley, CA USA
Rena Patel
3Department of Medicine, University of Washington, Seattle, WA USA
John M. Colford Jr.
1School of Public Health, University of California, Berkeley, Berkeley, CA USA
Alan E. Hubbard
1School of Public Health, University of California, Berkeley, Berkeley, CA USA

Data Availability
All data analyzed and produced in the manuscript are accessible via the National COVID Cohort Collaborative Data Enclave. A version of the manuscript analysis, using synthetic data rather than de-identified data, can be accessed via GitHub.
Posted September 19, 2023.
Predicting Long COVID in the National COVID Cohort Collaborative Using Super Learner
Zachary Butzin-Dozier, Yunwen Ji, Haodong Li, Jeremy Coyle, Junming (Seraphina) Shi, Rachael V. Philips, Andrew Mertens, Romain Pirracchio, Mark J. van der Laan, Rena Patel, John M. Colford Jr., Alan E. Hubbard
medRxiv 2023.07.27.23293272; doi: https://doi.org/10.1101/2023.07.27.23293272
This article is a preprint and has not been peer-reviewed [what does this mean?]. It reports new medical research that has yet to be evaluated and so should not be used to guide clinical practice.
Subject Area
Reviews and Context
0
Comment
0
TRIP Peer Reviews
0
Community Reviews
0
Automated Services
0
Blogs/Media
0
Author Videos
Subject Areas
- Addiction Medicine (420)
- Allergy and Immunology (744)
- Anesthesia (217)
- Cardiovascular Medicine (3204)
- Dermatology (270)
- Emergency Medicine (475)
- Epidemiology (13202)
- Forensic Medicine (19)
- Gastroenterology (884)
- Genetic and Genomic Medicine (5030)
- Geriatric Medicine (469)
- Health Economics (770)
- Health Informatics (3164)
- Health Policy (1121)
- Hematology (419)
- HIV/AIDS (996)
- Medical Education (467)
- Medical Ethics (124)
- Nephrology (512)
- Neurology (4774)
- Nursing (253)
- Nutrition (706)
- Oncology (2460)
- Ophthalmology (697)
- Orthopedics (275)
- Otolaryngology (335)
- Pain Medicine (318)
- Palliative Medicine (89)
- Pathology (527)
- Pediatrics (1272)
- Primary Care Research (545)
- Public and Global Health (7338)
- Radiology and Imaging (1659)
- Respiratory Medicine (960)
- Rheumatology (470)
- Sports Medicine (413)
- Surgery (532)
- Toxicology (68)
- Transplantation (227)
- Urology (197)