Abstract
The Collaborative Cohort of Cohorts for COVID-19 Research (C4R) is a national prospective study of adults at risk for coronavirus disease 2019 (COVID-19) comprising 14 established United States (US) prospective cohort studies. For decades, C4R cohorts have collected extensive data on clinical and subclinical diseases and their risk factors, including behavior, cognition, biomarkers, and social determinants of health. C4R will link this pre-COVID phenotyping to information on SARS-CoV-2 infection and acute and post-acute COVID-related illness. C4R is largely population-based, has an age range of 18-108 years, and broadly reflects the racial, ethnic, socioeconomic, and geographic diversity of the US. C4R is ascertaining severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection and COVID-19 illness using standardized questionnaires, ascertainment of COVID-related hospitalizations and deaths, and a SARS-CoV-2 serosurvey via dried blood spots. Master protocols leverage existing robust retention rates for telephone and in-person examinations, and high-quality events surveillance. Extensive pre-pandemic data minimize referral, survival, and recall bias. Data are being harmonized with research-quality phenotyping unmatched by clinical and survey-based studies; these will be pooled and shared widely to expedite collaboration and scientific findings. This unique resource will allow evaluation of risk and resilience factors for COVID-19 severity and outcomes, including post-acute sequelae, and assessment of the social and behavioral impact of the pandemic on long-term trajectories of health and aging.
The adverse effects of the coronavirus disease 2019 (COVID-19) pandemic on United States (US) health, economy, and society are widespread and will likely continue well beyond the initial waves of infections (1). Lack of preparedness and inadequate implementation and uptake of standard public health interventions in the US has already contributed to over 23 million cases, one million hospitalizations and over 500,000 deaths from COVID-19 (2, 2), making COVID-19 the third-leading cause of death in the United States in 2020 and the second-leading cause of death in those over 85 years of age (4, 2). Furthermore, prolonged symptoms and clinical abnormalities are observed in some COVID-19 survivors, raising concerns that post-acute sequelae of COVID-19 could pose an additional long-term health burden (6).
Epidemiologists have marshalled the strengths of numerous complementary study designs to identify the incidence and major clinical and socio-demographic risk factors for COVID-19 illness, as well as to describe post-COVID-19 outcomes. In particular, case-based registries and large-scale electronic health record (EHR)- and health systems-based cohorts provided critical early insights into disease susceptibility and short- and long-term sequelae. Among these were findings that socio-economic disadvantage (5, 7-9) and pre-existing clinical conditions, such as obesity, heart conditions, or lung disease (10-19), are associated with greater risk of severe illness.
Nonetheless, clinical and survey databases pose several problems for COVID-19 epidemiology. Clinical case series lack rigorous control groups, have non-standardized, limited data collection, and are subject to ascertainment biases – including, but not limited to, reduced health care access and quality among vulnerable communities. EHRs typically lack detailed information on health-related behaviors, such as smoking, so that controlling for confounders is challenging. Moreover, in the course of usual clinical care, clinically actionable diagnostic testing is performed for sick persons, but not well persons; hence, subclinical disease is not well detected, and genomic and other mechanistic biomarkers are generally lacking. Although inception cohorts with longitudinal follow-up of clinically ascertained cases of COVID-19 cases can address some of these knowledge gaps, survival bias, recall biases, and non-randomly missing data regarding pre-COVID health and behaviors are inevitable. In this context, strong assumptions are required to define phenotypes identified in COVID-19 survivors (e.g., fibrotic lung disease) as “sequelae” when they may have been present prior to the pandemic, and actually be antecedent risk factors or effect modifiers.
The Collaborative Cohort of Cohorts for COVID-19 Research (C4R) was established as a national, prospective study of adults at risk for incident COVID-19 that is relatively free of referral, survival, and recall biases. C4R includes fourteen US prospective cohort studies that, collectively, constitute a large, well-characterized, population-based sample that ranges in age from young adults to centenarians, and reflects the racial, ethnic, socioeconomic, and geographic diversity of the US. Using standardized protocols, C4R is aggressively attempting full ascertainment of SARS-CoV-2 infection and COVID-19 illness across all cohorts. C4R offers the additional major advantages of standardized data collection protocols, including high-quality clinical events surveillance dating back as far as 1971 in some studies, and robust retention rates.
For decades, the C4R cohorts have collected extensive longitudinal data on clinical and subclinical disease, behaviors, cognition, biomarkers, and social determinants of health. C4R will link this “pre-COVID” phenotyping to information on SARS-CoV-2 infection and acute and post-acute COVID-related illness. The integration of antecedent and illness-related data will provide a unique opportunity to understand mechanisms and modifiers of risk and resilience for SARS-CoV-2 infection and adverse COVID-19 outcomes. C4R will also support comparisons of longitudinal changes in health measures over the course of the pandemic in persons with varying degrees of COVID-19 severity. Furthermore, the availability of well-characterized participants unaffected by COVID-19 will allow the assessment and differentiation of the effects of infection, illness, and pandemic-related social, economic, and behavioral changes.
Overall, C4R aims to provide a valuable scientific resource to (1) evaluate risk and resilience factors for adverse COVID-19 outcomes, including severe COVID-19 illness and long-term complications, (2) assess the social and behavioral impact of the COVID-19 pandemic on long- term outcomes and trajectories of health and disease, and (3) examine disparities in COVID-19 risk and outcomes according to race, ethnicity, geography, and other social determinants of health.
METHODS
Cohort of cohorts
Fourteen prospective cohorts are collaborating in C4R (Table 1). Eight of the cohorts were designed to study cardiovascular disease epidemiology: Atherosclerosis Risk in Communities (ARIC) Study (20), Coronary Artery Risk Development in Young Adults (CARDIA) Study (21), Framingham Heart Study (FHS) (22), Hispanic Community Health Study/Study of Latinos (HCHS/SOL) (23-25), Jackson Heart Study (JHS) (26-28), Mediators of Atherosclerosis in South Asians Living in America (MASALA) Study (29, 2), Multi-Ethnic Study of Atherosclerosis (MESA) (31), and the Strong Heart Study (SHS) (32, 2). These cohorts generally recruited population- based samples, although only three (ARIC, CARDIA, FHS, HCHS/SOL) used representational sampling techniques at some or all sites. Four of the cardiovascular studies (ARIC, CARDIA, FHS, MESA) recruited multi-racial participants, and four were designed to study primarily specific race or ethnic groups (Hispanic/Latino participants in HCHS/SOL, Black participants in JHS, South Asian participants in MASALA, American Indian participants in SHS). Four multi-ethnic cohorts were established to study respiratory epidemiology: the Genetic Epidemiology of COPD (COPDGene) Study (34) and the SubPopulations and InteRmediate Outcome Measures in COPD Study (SPIROMICS) (35) were established as longitudinal case-control studies of cigarette smokers with and without COPD; Prevent Pulmonary Fibrosis (PrePF) is a study of early and established interstitial lung disease; and, the Severe Asthma Research Program (SARP) is a study of the entire range of mild to severe asthma, enriched for severe disease (36). Two studies – the Northern Manhattan Study (NOMAS) and the REasons for Geographic and Racial Differences in Stroke (REGARDS) – were established to study primarily neurological outcomes, including stroke and cognition. NOMAS is a multi-ethnic community study (37) and REGARDS is a biracial (non-Hispanic Black, White) national sample of the continental US that oversampled Black people and those residing in the southeast (38).
The cohorts that comprise C4R have collected detailed data on their participants’ health and behavior for as long as fifty years of follow-up (Figure 1). As summarized in Table 2, C4R cohorts have performed extensive longitudinal phenotyping of subclinical and clinical disease as well as assessments of laboratory biomarkers, ‘Omics, imaging, diet, behavior, and social determinants of health, and they have extensive biorepositories of stored specimens. Twelve cohorts have geocoding available, supporting participant-level assessment of neighborhood socioeconomic status, exposures to systemic racism, and environmental exposures such as air pollution. All C4R cohorts use similar or identical adjudication protocols to ascertain all-cause mortality. Ten cohorts ascertain cardiovascular events including myocardial infarction, stroke, and heart failure. Eight cohorts ascertain respiratory events such as COPD and asthma exacerbations. Seven cohorts ascertain incident cognitive impairment and/or dementia.
Collaboration
Most of the cohorts have a long history of collaboration in the genomics-oriented Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium (39), the NHLBI Pooled Cohorts Study focusing on respiratory epidemiology (40), the Cross-Cohort Collaboration (CCC) for cardiovascular epidemiology (41), the Blood Pressure and Cognition (BP COG) Study (42), and the genetic sequencing and multi-omics-focused Trans-Omics for Precision Medicine (TOPMed) Project (43). C4R is building and expanding upon these successes to advance COVID-19 research.
Planning for C4R began in March 2020, when the need for a coordinated, cross-cohort response to the knowledge gaps posed by the COVID-19 pandemic became self-evident and urgent. Cohort investigators initiated discussions regarding approaches to ascertain SARS-CoV-2 infections and COVID-related illnesses within the context of unprecedented cohort operational challenges associated with the outbreak. The National Heart, Lung, and Blood Institute (NHLBI) funded C4R via an Other Transactional Authority (OTA) mechanism in October 2020. Additional funding for inclusion of the neurology-focused cohorts was provided via the OTA by the National Institutes of Neurological Disorders and Stroke (NINDS) and the National Institute of Aging (NIA).
Leadership for C4R is provided by an organizing committee that includes leading – and often, founding – principal investigators (PIs) from all C4R cohorts, PIs from the C4R Data Coordination and Harmonization Center (DCHC), PIs from the C4R Biorepository and Central Laboratory (BCL), and program officers from the NHLBI, NINDS, and NIA. This organizing committee developed master C4R protocols for COVID-19 data collection.
Consistent with an ancillary studies model, each cohort in C4R is directly responsible for accomplishing its own data collection in accordance with the master protocol and under the supervision of its respective Observational Study Monitoring Board (OSMB), Steering Committee, and any other applicable regulatory authorities.
To promote and sustain this broad collaborative effort, C4R PIs invited additional investigators and cohort personnel to participate in C4R committees and working groups. Study materials, including protocols and meeting materials, are posted regularly on a password-protected investigator section of the C4R website (c4r-nih.org).
Participants
Cohort participants previously consented for in-person, telephone, and/or email contact and for the abstraction of medical records. Additional consent for ascertainment of COVID-19 data, including the serosurvey, is being obtained according to cohort-specific procedures, including verbal, remote, and traditional written informed consent.
Of 73,119 active participants across the fourteen cohorts, 53,972 participants were readily available for recruitment into C4R. Anticipated socio-demographic characteristics of potential C4R participants, estimated from current active cohort participants, are shown in Table 3. Fifty- eight percent of potential participants are 65 years or older, and thus at high risk for severe COVID-19. The anticipated sample is racially and ethnically diverse, based on self-report (44), with approximately 6% American Indian participants, 2% Asian participants, 26% Black participants, and 20% Hispanic/Latino participants.
All forty-eight continental states are represented among C4R participants, including rural, suburban, and urban communities (Figure 2). In all, C4R is being conducted across forty field/clinical centers, many of which are associated with more than one C4R cohort; one cohort with extensive geographic reach, the REGARDS, operates via telephone and in-home exams only (38).
Data collection
COVID-19 questionnaires
C4R is ascertaining self-reported COVID-related experiences by questionnaire. Each cohort will deploy C4R questionnaires twice within 18 months following the initial outbreak in March 2020 via telephone, mail-in, online, email, or smartphone apps. Wave 1 questionnaires were developed as early as March 2020 in certain cohorts (45) and urgently administered in spring and summer 2020. Although these efforts pre-dated C4R, early informal cross-cohort collaborations ensured that many cohorts used identical questionnaires, and all of them generated common data elements regarding infection, testing, hospitalization, and recovery. Wave 2 questionnaires were fully standardized to include domains on COVID-19 infection, testing, hospitalization, symptoms, recovery, re-infection, contacts, vaccination, behavioral changes, sleep, memory loss, depression, anxiety, fatigue, and resilience. The C4R questionnaire was developed collaboratively to include validated and PhenX toolkit instruments (https://www.phenxtoolkit.org) (46-55) in order to optimize comparability with pre-pandemic assessments and across C4R and other epidemiology cohorts. The C4R questionnaire, including translations into Spanish and Mandarin, are available on PhenX; Research Electronic Data Capture (REDCap (56, 2)) programming may be available on request.
COVID-related events ascertainment
C4R is ascertaining COVID-related hospitalizations and deaths that are identified via the C4R questionnaire or other surveillance methods available to the cohorts, including EHR linkages, where available. Each cohort is using its own established infrastructure for ascertainment of medical records and death certificates, including use of the National Death Index (NDI), the Centers for Medicare & Medicaid Services (CMS), International Classification of Diseases (ICD) codes (58), and linkage to records from local departments of health. Cohorts may review events locally at their Field/Coordinating Centers or transfer records for central review by C4R. The C4R events review is designed to assess severity and major complications of COVID-19 illness, including pneumonia, myocardial infarction, stroke, thromboembolism, and acute renal failure. The protocols use, or are modeled after, longstanding cohort protocols to classify and validate cardiovascular, respiratory (19), and thromboembolic (59) events. Protocols for ascertainment, review, and classification are available on the study website (c4r-nih.org).
Dried blood spot collection
C4R is ascertaining serostatus by dried blood spot (DBS) in 2021. Cohort field centers receive DBS collection kits from the BCL and are responsible for recruitment, consent, and distribution to participants. Updated details regarding vaccination status are obtained at the time of DBS consent and immediately prior to mailing the DBS kit to the participant. Participants mail the completed kits directly to the BCL or to the cohort field or coordinating center as an intermediary step. Participant instructions, including a video, are provided by the cohort and via the C4R website (c4r-nih.org) and/or cohort-specific websites. In cohorts with upcoming in-person exams, the DBS may be collected in-person by research staff.
C4R Common Data Elements
C4R data collection will define a spectrum of COVID-19 outcomes. Ascertainment of COVID- related hospitalizations and deaths will characterize, classify, and validate moderate-to-severe COVID-19 illnesses. In addition to identifying these events, questionnaires are being used to obtain self-reported information on the nature, severity, and duration of symptoms during acute infection and in the post-acute setting. This will support classification of symptomatic and asymptomatic infections, as well as cases of prolonged recovery or post-acute sequelae of SARS-CoV-2 infection (PASC). Data on behaviors, attitudes, psychosocial impacts, and vaccinations will also be collected. Seropositive individuals without self-reported infection will be reclassified as infected, whereas seronegative individuals with prior positive testing by self- report or health records will be classified as sero-reverted.
Data management
C4R data collection is coordinated centrally at the DCHC at Columbia University Irving Medical Center. Electronic data collection forms are being programmed into REDCap for use or adaptation by the cohort coordinating centers. Metadata on completion of questionnaires, events ascertainment, and DBS kit collection status are reported and reviewed bi-weekly to ensure operational milestones are met. Participants are assigned a C4R study identifier by cohort-specific coordinating centers that is used for participant-level data transfers and analyses.
Biorepository and central laboratory and serology measurement
The C4R BCL at the University of Vermont is responsible for establishing a C4R biorepository of DBS, plus other biospecimens that may be collected in the future, and for performing and/or coordinating performance of any centralized clinical and biomarker assays and serology assays. Individual DBS Collection kits are produced by the BCL and shipped to the cohorts (either to the individual field centers or the cohort coordinating center, based on cohort preference). Kits and DBS cards are labeled with a biospecimen identifier, which is linked to C4R identifiers that are maintained centrally and not shared with the BCL, through the use of a “linking key.” Filled DBS cards are returned to the BCL, and batches prepared for serology assays performed by the New York State Wadsworth Center’s Bloodborne Viruses Laboratory (BVL) under CLIA and New York State certification. The BVL performs a SARS-CoV-2 IgG Microsphere Immunoassay using Luminex bead technology for qualitative detection of human IgG antibodies to SARS-CoV-2 nucleocapsid (N) and spike subunit 1 (S1) antigens. Based on testing 730 pre-COVID DBS and >1100 DBS from individuals with laboratory-confirmed infection, specificity is 99.5% for both N and S1 and sensitivity ranged from 90 to 96% for symptomatic individuals and 77 to 91% for asymptomatic individuals. Sensitivity increased for both groups with time from positive PCR test, accounting for the range. This assay was used successfully to test over 57,000 DBS for statewide serosurveys from April-June as part of New York State’s public health response. Serology results are reported by the BVL to the C4R BCL, and then to the cohort coordinating centers, which are responsible for a) recombining the results with the proper participants based on the “linking key”, and b) reporting results to participants according to usual cohort practices. Serological results are not believed to have clinical relevance, and the CDC does not currently recommend modifications to individual behavior or clinical care based on antibody status alone (60); hence, no protocols for “alert” findings have been established, and participants may opt out of results return. Protocols for the serosurvey are available on the study website (c4r-nih.org).
Since all current vaccines in use in the U.S. generate an immune response to the Spike protein, we anticipate being able to distinguish vaccination from viral infection by the use of the anti- nucleocapsid assay results (61).
Harmonization
Harmonization of COVID-19 and pre-pandemic data will be performed centrally to define COVID-19 common data elements and to align pre-pandemic data for large-scale, longitudinal analyses. This effort will leverage prior harmonization efforts across C4R cohorts in the TOPMed Project, the NHLBI Pooled Cohorts Study, the BP COG Study, and the CHARGE Working Groups (10, 40, 42, 62-67). Due to their significance to COVID-19 epidemiology, particular emphasis will be placed on harmonizing the large amount of deep pre-pandemic physiologic (40), neurocognitive (68-74), and imaging-based (75-82) phenotyping collected within the decade prior to the outbreak using deep-learning (18, 83-85) and other methods (Table 4).
Quality control
C4R cohorts have established protocols for checking data completeness and accuracy at the field center and coordinating center levels. Dual data entry is encouraged but not required, since it will not be feasible in all settings due to local impediments and COVID-related exigencies. Ten percent of event reviews will be randomly selected for re-review. Reviewers not meeting standards will receive regular feedback with recommendations for retraining and/or protocol modifications, as appropriate. Serological assays will be repeated on a random 5% sub- sample of blind duplicates.
Data sharing
The C4R Commons Agreement, modeled on the CHARGE Analysis Commons Consortium Agreement (86), will expedite cross-cohort data harmonization and sharing, as allowed (87). Following review and approval, cohort-specific agreements would permit COVID-19 and pre- pandemic data to be uploaded to the NIH-supported cloud computing platform, hosted by BioData Catalyst. Access to the pooled C4R dataset would be granted to investigators involved in core harmonization efforts and those with manuscript proposals approved by C4R publications and cohort coordinating committees. Once harmonization and related quality control is completed, C4R common data elements will be transferred as a limited dataset for public access on BioData Catalyst in accord with cohort-specific consents and commitments.
Governance
The administrative coordinating center for C4R is the NHBLI CONNECTS program (nhlbi- connects.org). Metadata on operational progress is submitted biweekly to CONNECTS for tracking and review purposes. Central functions of C4R are overseen by a C4R OSMB convened by CONNECTS.
DISCUSSION
C4R will leverage existing American cohort studies to develop a large, multi-ethnic, pooled cohort of participants with incident COVID-19 and COVID-free participants that is relatively free of referral, survival, and recall biases compared to clinically based inception cohorts of COVID- 19 patients. C4R includes a highly diverse population of US adults, including older and socially disadvantaged populations that have especially high risk of adverse COVID-19 outcomes. C4R is distinguished from other large studies of COVID-19 by its unparalleled wealth of pre-pandemic phenotyping, providing unique opportunities to evaluate a range of risk and resilience factors for SARS-CoV-2 infection and adverse COVID-19 outcomes, including severe COVID-19 illness, PASC, and other long-term effects of the pandemic response. Unlike case registries and EHR-based studies, C4R’s repeated exams and cognitive assessments before and after COVID-19 also provide important opportunities to estimate the social and behavioral impact of the COVID-19- related pandemic response on changes in long-term mental and physical health across multiple domains.
C4R will provide important opportunities for future studies using a range of epidemiologic study designs. For example, nested within C4R, longitudinal cohort studies of COVID-affected and unaffected participants could repeat a variety of subclinical measures (e.g., echocardiography, lung imaging, neuro-cognitive assessment) to define reliably the consequences of COVID-19 infection. Ongoing high-quality events follow-up will allow assessment of long-term clinical health outcomes following COVID-19 and the pandemic period. The extensive biobanks maintained by the cohorts could support measurement of prior viral infections, immune- phenotypes, metabo-types, ‘Omics, and other pre-COVID characteristics that may be risk determinants or modifiers for COVID-19 susceptibility and vaccine effectiveness. The fact that the cohorts continue to follow their participants provides a dynamic resource to study emerging questions in COVID-19 epidemiology, including but not limited to viral variants and vaccination. And, C4R provides a model for cross-cohort collaboration and active data sharing that will promote consortium-based epidemiologic work on biological, social, and epidemiologic questions beyond the COVID-19 pandemic, in alignment with recommendations for the strategic transformation of population studies (88).
Data Availability
Following review and approval, cohort-specific agreements would permit COVID-19 and pre-pandemic data from C4R to be uploaded to the NIH-supported cloud computing platform, hosted by BioData Catalyst.
Author affiliations
Albert Einstein College of Medicine
Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, New York, United States (Carmen R Isasi, Robert C Kaplan)
Children’s Hospital/ Harvard Medical School
Boston Children’s Hospital/Harvard Medical School
Department of Medicine, Division of Allergy and Immunology, Boston Children’s Hospital, Boston Massachusetts (Wanda Phipatanakul)
Boston University
Department of Pathology & Laboratory Medicine, Boston University School of Medicine, Boston, Massachusetts, United States (Joel M Henderson)
Departments of Medicine and Epidemiology, Boston University, Boston, Massachusetts, United States (Ramachandran S Vasan)
Department of Medicine, School of Medicine, and Department of Biostatistics, School of Public Health, Boston University, Boston, Massachusetts, United States (Vanessa Xanthakis)
Brigham and Women’s Hospital/ Harvard Medical School
Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Boston, Massachusetts (Bruce Levy, Matthew R Moll)
Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA (Matthew Moll).
Colorado Anschutz Medical Campus
Division of Pulmonary Sciences and Critical Care, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, Colorado, United States (Alyssa Asaro, Joyce Lee, David Schwartz)
Department of Epidemiology, School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, Colorado, United States (Gregory Kinney)
Columbia University
Division of General Medicine, Department of Medicine, Columbia University Medical Center, New York, New York, United States (Pallavi P Balte, R Graham Barr, Akshaya Krishnaswamy, Elizabeth C Oelsner, Priya Palta, Yiyi Zhang)
Columbia Data Coordinating Center, New York State Psychiatric Institute and Columbia College of Physicians and Surgeons, Columbia University Medical Center, New York, New York, United States (Howard F Andrews)
Department of Neurology, Vagelos College of Physicians and Surgeons, Columbia University, New York, New York, United States (Mitchell Elkind, Jose Gutierrez, Jennifer J Manly)
Department of Epidemiology, Mailman School of Public Health, New York, NY United States (Ryan T Demmer [adjunct], Mitchell Elkind)
Georgetown University
MedStar Health Research Institute and Department of Medicine, Georgetown University, Washington, District of Columbia, United States (Jason G Umans)
Johns Hopkins University
Department of Epidemiology, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, Maryland, United States (Josef Coresh)
Division of Cardiology, Department of Medicine, Johns Hopkins University, Baltimore, Maryland, United States (Wendy S Post)
Lundquist Institute
Lundquist Institute, Los Angeles, California, United States (Jerome I Rotter)
National Institutes of Health (intramural)
Division of Intramural Research, National Heart Lung and Blood Institute of the National Institutes of Health (Veronique Roger)
National Jewish Health
Division of Pulmonary, Critical Care & Sleep Medicine, Department of Medicine, National Jewish Health, Denver, Colorado, United States (Barry Make)
Division of Rheumatology, Department of Medicine, National Jewish Health, Denver, Colorado, United States (Elizabeth A Regan)
National Jewish Health, Denver, Colorado, United States (Grace Chen)
New York State Department of Health
Bloodborne Viruses Laboratory, Wadsworth Center, New York State Department of Health, Albany, New York, United States (Monica Parker)
Northwestern
Center for Epidemiology and Population Health, Department of Preventive Medicine, Northwestern University, Chicago, Illinois, United States (Norinna Bai Allen, Namratha R Kandula)
Division of General Internal Medicine, Department of Medicine, Northwestern University, Chicago, Illinois, United States (Namratha R Kandula)
Pennsylvania State University
Division of Biostatistics and Bioinformatics, Department of Public Health Sciences, Pennsylvania State University, State College, Pennsylvania, United States (Dave Mauger)
San Diego State University
South Bay Latino Research Center, Department of Psychology, San Diego State University, San Diego, California, United States (Linda Gallo, Gregory A Talavera)
Texas Biomedical Research Institute
Population Health Program, Texas Biomedical Research Institute, San Antonio, Texas, United States (Shelley A Cole)
University of Alabama at Birmingham
Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama, United States (Suzanne E Judd)
Department of Epidemiology, University of Alabama at Birmingham, Birmingham, Alabama, United States (Kelley Pettee Gabriel, Virginia J Howard, Cora E Lewis, Emily B Levitan)
Division of Preventive Medicine, School of Medicine, University of Alabama at Birmingham, Birmingham, Alabama, United States (James M Shikany)
UCLA
The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA USA (Jerome I Rotter)
Division of Cardiology, Department of Medicine, UCLA Medical Center, Los Angeles, California, United States (Karol E Watson)
University of California San Francisco
Department of Medicine, University of California San Francisco, San Francisco, California, United States (Alka M. Kanaya, Arunee A. Chang)
Department of Obstetrics, Gynecology & Reproductive Sciences, University of California San Francisco, San Francisco, California, United States (Michael Schembri)
Division of Pulmonary, Critical Care, Allergy, and Sleep Medicine, Department of Medicine, University of California San Francisco, San Francisco, California, United States (Prescott G. Woodruff)
University of Kentucky
Department of Epidemiology, College of Public Health, University of Kentucky, Lexington, KY (Anna M Kucharska-Newton).
University of Miami
Department of Neurology and Evelyn F. McKnight Brain Institute, University of Miami, Miami, Florida, United States (Tatjana Rundek, Ralph L. Sacco)
University of Michigan
Division of Pulmonary and Critical Care, Department of Medicine, University of Michigan, Ann Arbor, Michigan, United States (MeiLan K. Han)
Division of General Medicine, Department of Medicine, University of Michigan, Ann Arbor, Michigan, United States (Deborah A Levine)
University of Minnesota
Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, Minneapolis, Minnesota, United States (Ryan T Demmer, Aaron R Folsom, David R Jacobs Jr)
University of Mississippi Medical Center
Departments of Medicine and Population Health Science, School of Public Health, University of Mississippi Medical Center, Jackson, Mississippi, United States (Pramod Anugu, Adolfo Correa, Mario Sims, Yuan-I Min)
School of Nursing, University of Mississippi Medical Center, Jackson, Mississippi, United States (Karen Winters)
University of North Carolina
Collaborative Studies Coordinating Center, Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, North Carolina, United States (David Couper, Kimberly Ring)
Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, North Carolina, United States (Anna Kucharska-Newton)
Department of Nutrition, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, North Carolina, United States (Katie Meyer)
University of Oklahoma Health Sciences Center
Center for American Indian Health Research, Department of Biostatistics and Epidemiology, Hudson College of Public Health, University of Oklahoma Health Sciences Center, Oklahoma City, Oklahoma, United States (Tauqeer Ali, Kimberly Malloy, Ying Zhang)
University of Pittsburgh
Department of Environmental and Occupational Health, Graduate School of Public Heath, University of Pittsburgh, Pittsburgh, Pennsylvania, United States (Sally E Wenzel)
Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States (Jessica Bon)
UT Health San Antonio
Glenn Biggs Institute for Alzheimer’s & Neurodegenerative Diseases, Graduate School of Biomedical Sciences, UT Health San Antonio, San Antonio, Texas, United States (Sudha Seshadri)
University of Vermont
Laboratory for Clinical Biochemistry Research, Department of Pathology & Laboratory Medicine, Larner College of Medicine, University of Vermont, Burlington, Vermont, United States (Rebekah Boyle, Elaine Cornell, Russell Tracy)
Department of Medicine, Vermont Center for Cardiovascular and Brain Health, Larner College of Medicine at the University of Vermont, Burlington, Vermont, United States (Mary Cushman, Debora Kamin Mukaz)
University of Washington
Department of Epidemiology, School of Public Health, University of Washington, Seattle, Washington, United States (Amanda M Fretts, Robert Kaplan, Bruce Psaty)
Department of Health Services, School of Public Health, University of Washington, Seattle, Washington, United States (Bruce M Psaty)
Division of General Medicine, Department of Medicine, University of Washington, Seattle, Washington, United States (Bruce Psaty)
Department of Biostatistics, School of Public Health, University of Washington, Seattle, Washington, United States (Karen Hinckley Stukovsky)
Wake Forest School of Medicine
Department of Epidemiology and Prevention, Department of Medicine, Wake Forest School of Medicine, Winston-Salem, North Carolina, United States (Alain G Bertoni)
Division of Pulmonary, Critical Care, Allergy, and Immunologic Diseases, Department of Medicine, Wake Forest School of Medicine (Wendy C Moore, Victor E Ortega)
Author contributions
All authors meet ICJME criteria for authorship.
Funding
Thank you’s
We thank the participants of each cohort for their dedication to the studies.
Members of the study group
Ali, Tauqueer
Allen, Nori
Andrews, Howard
Anugu, Pramod
Arora, Komal
Arynchyn, Alex
Asaro, Alyssa
Balte, Pallavi P
Bancks, Mike
Barr, R Graham
Bateman, Lori
Bertoni, Alain
Bick, Alexander G
Bleecker, Eugene
Bluemke, David
Boyle, Rebekah
Budoff, Matt
Cardwell, Jonathan
Carr, Jeffrey
Causey, Jackie
Chang, Ann
Chen, Grace
Coady, Sean
Cole, Shelley
Coresh, Josef
Cornell, Elaine
Correa, Adolfo
Couper, David
Culbertson, Emily
Curtis, Jeffrey L
Cushman, Mary
DeCarli, Charles
Demmer, Ryan T
Devereux, Richard B
DiPersio, Nicholas
Dolezal, Brett
Doyle, Margaret
Elkind, Mitch
Fain, Sean
Feinstein, Matt
Floyd, James
Freeman, Christine
Fretts, Amanda
Gabriel, Kelley
Gallo, Linda
Gharib, Sina
Goode, Caroline
Goyal, Parag
Griswold, Mike
Han, MeiLan
Hanna, David
Hastie, Annette
Heckbert, Susan R.
Henderson, Joel
Hinckley Stukovsky, Karen
Hoffman, Eric
Hoffman, Udo
Hornig, Mady
Howard, Virginia J
Isasi, Carmen R
Jacobs, David R Jr
Jaquis, Cashell
Judd, Suzanne E
Kamin Mukaz, Debora
Kanaya, Alka M
Kandula, Namratha R
Kaplan, Robert
Khan, Sadiya
King, Jonathan
Kinney, Gregory L
Kucharska-Newton, Anna
Kumi, Smith
Laine, Andrew F.
Lee, Joyce
Lemaitre, Rozenn
Levin, Bonnie
Levine, Deborah
Levitan, Emily B.
Levy, Bruce
Lima, Joao
Lynch, David Make, Barry
Malloy, Kimberly
Manly, Jennifer J
Mauger, Dave
Melius, Katy
Mendoza-Puccini, Carolina
Merkin, Sharon
Meyer, Katie
Meyers, Deborah
Min, Yuan-I (Nancy)
Minotti, Melissa
Moise, Nathalie
Moore, Wendy
Moy, Claudia
Mutalik, Karen
Nasrallah, Ilya
Nelson, Cheryl
Nelson, Lauren
Noel, Patricia
Nordvig, Anna
O’Connor, George
Odden, Michelle
Oelsner, Elizabeth
O’Leary, Marcia
O’Neal, Wanda
Ortega, Victor
Palta, Priya
Pamir, Nathalie
Papanicolaou, George
Phillips, Brenda
Phipatanakul, Wanda
Plante, Tim
Pokharel, Yashashwi
Post, Wendy
Postow, Lisa
Psaty, Bruce
Purcell, Shaun M.
Raffield, Laura
Ramachandran, Vasan
Redline, Susan
Regan, Liz
Reiner, Alex
Rodriguez, Carlos
Roger, Veronique
Rotter, Jerry
Sacco, Ralph
Safford, Monika M.
Schembri, Michael
Schwartz, David
Seshadri, Sudha
Shah, Amil
Shah, Sanjiv J
Shikany, James
Sims, Mario
Smith, Kumi
Smoller, Sylvia
Soliman, Elsayed
Solomon, Scott
Sotres-Alvarez, Daniela
Stewart, Meg
Strobino, Kevin
Sutherland, Patrice
Swenor, Bonnie
Talavera, Gregory
Terry, Greg
Tracy, Russ
Tse, Janis
Umans, Jason
Vasan, Ramachandran S
Vuga, Louis
Wagster, Molly
Wang, Henry
Washko, George
Wentzel, Sally
West, Cynthia
Wilson, Carla
Woodruff, Prescott
Wright, Jackie
Xanthakis, Vanessa
Yuan, Ya
Zakai, Neil
Zhang, Ying
Zhang, Yiyi
Disclaimer
The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Conflict of Interest Statement
Ali, Asaro, Bertoni, Boyle, Cole, Coresh, Cornell, Correa, Cushman, Demmer, Folsom, Fretts, Howard, Isasi, Jacobs, Kamin Mukaz, Kandula, Malloy, Meyer, Oelsner, Parker, Pettee, Ring, Roger, Schembri, Seshadri, Shikany, Tracy, Vasan, Winters, Zhang: None
Mitchell Elkind receives royalties from UpToDate for a chapter on neurological complications of COVID-19; receives study drug in kind from the BMS-Pfizer Alliance for Eliquis and ancillary funding from Roche, both for an NIH-funded trial of stroke prevention.
MeiLan K Han reports consulting for BI, GSK, AZ, Merck, Mylan, Verona, Teva, Cipla, Chiesi and Sanofi. She reports research support from Novartis and Sunovion.
Emily B. Levitan has research funding from Amgen and has received consulting fees for a research project sponsored by Novartis.
Wanda Phipatanakul has funding and trial medication support from Astra Zeneca, GSK, Merck, Genentech, Novartis, Regeneron, Sanofi for asthma studies and consulting from GSK, Genentech, Novartis, Sanofi, Regeneron for asthma therapeutics.
Bruce M Psaty serves on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson.
David Schwartz is founder and chief scientific officer of Eleven P15, a company dedicated to the diagnosis, prevention, and treatment of early presentations of pulmonary fibrosis. Joyce Lee serves as a consultant for Eleven P15.
Sally Wenzel receives funding for consulting and clinical trials from AstraZeneca, GSK, Sanofi- Genzyme, Novartis, Knopp; she also receives research support from Pieris and Regeneron.
Prescott Woodruff has a research grant from Genentech and is a consultant for Sanofi, Regeneron, Clarus ventures, 23andMe, Theravance, Astra Zeneca, Glenmark Pharmaceuticals, Amgen, and NGM Pharma, outside the submitted work.
- Abbreviations
- ARIC
- Atherosclerosis Risk in Communities
- BCL
- Biorepository and Central Laboratory
- C4R
- Collaborative Cohort of Cohorts for COVID-19 Research
- CARDIA
- Coronary Artery Risk Development in Young Adults
- CONNECTS
- Collaborating Network of Networks for Evaluating COVID-19 and Therapeutic Strategies
- COPDGene
- Genetic Epidemiology of COPD
- FHS
- Framingham Heart Study
- HCHS/SOL
- Hispanic Community Health Study/Study of Latinos
- JHS
- Jackson Heart Study
- MASALA
- Mediators of Atherosclerosis in South Asians Living in America
- MESA
- Multi-Ethnic Study of Atherosclerosis
- NOMAS
- Northern Manhattan Study
- PASC
- Post-Acute Sequelae of SARS-CoV-2 Infection
- PrePF
- Prevent Pulmonary Fibrosis
- REGARDS
- REasons for Geographic and Racial Differences in Stroke
- SARP
- Severe Asthma Research Program
- SPIROMICS
- Subpopulations and Intermediate Outcome Measures in COPD Study
- SHS
- Strong Heart Study
References
- 1.↵
- 2.↵
- 3.↵
- 4.↵
- 5.↵
- 6.↵
- 7.↵
- 8.
- 9.↵
- 10.↵
- 11.
- 12.
- 13.
- 14.
- 15.
- 16.
- 17.
- 18.↵
- 19.↵
- 20.↵
- 21.↵
- 22.↵
- 23.↵
- 24.
- 25.↵
- 26.↵
- 27.
- 28.↵
- 29.↵
- 30.↵
- 31.↵
- 32.↵
- 33.↵
- 34.↵
- 35.↵
- 36.↵
- 37.↵
- 38.↵
- 39.↵
- 40.↵
- 41.↵
- 42.↵
- 43.↵
- 44.↵
- 45.↵
- 46.↵
- 47.
- 48.
- 49.
- 50.
- 51.
- 52.
- 53.
- 54.
- 55.↵
- 56.↵
- 57.↵
- 58.↵
- 59.↵
- 60.↵
- 61.↵
- 62.↵
- 63.
- 64.
- 65.
- 66.
- 67.↵
- 68.↵
- 69.
- 70.
- 71.
- 72.
- 73.
- 74.↵
- 75.↵
- 76.
- 77.
- 78.
- 79.
- 80.
- 81.
- 82.↵
- 83.↵
- 84.
- 85.↵
- 86.↵
- 87.↵
- 88.↵