RT Journal Article SR Electronic T1 A large, curated, open-source stroke neuroimaging dataset to improve lesion segmentation algorithms JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2021.12.09.21267554 DO 10.1101/2021.12.09.21267554 A1 Liew, Sook-Lei A1 Lo, Bethany A1 Donnelly, Miranda R. A1 Zavaliangos-Petropulu, Artemis A1 Jeong, Jessica N. A1 Barisano, Giuseppe A1 Hutton, Alexandre A1 Simon, Julia P. A1 Juliano, Julia M. A1 Suri, Anisha A1 Ard, Tyler A1 Banaj, Nerisa A1 Borich, Michael R. A1 Boyd, Lara A. A1 Brodtmann, Amy A1 Buetefisch, Cathrin M. A1 Cao, Lei A1 Cassidy, Jessica M. A1 Ciullo, Valentina A1 Conforto, Adriana B. A1 Cramer, Steven C. A1 Dacosta-Aguayo, Rosalia A1 Rosa, Ezequiel de la A1 Domin, Martin A1 Dula, Adrienne N. A1 Feng, Wuwei A1 Franco, Alexandre R. A1 Geranmayeh, Fatemeh A1 Gramfort, Alexandre A1 Gregory, Chris M. A1 Hanlon, Colleen A. A1 Hordacre, Brenton G. A1 Kautz, Steven A. A1 Khlif, Mohamed Salah A1 Kim, Hosung A1 Kirschke, Jan S. A1 Liu, Jingchun A1 Lotze, Martin A1 MacIntosh, Bradley J. A1 MatarĂ³, Maria A1 Mohamed, Feroze B. A1 Nordvik, Jan E. A1 Park, Gilsoon A1 Pienta, Amy A1 Piras, Fabrizio A1 Redman, Shane M. A1 Revill, Kate P. A1 Reyes, Mauricio A1 Robertson, Andrew D. A1 Seo, Na Jin A1 Soekadar, Surjo R. A1 Spalletta, Gianfranco A1 Sweet, Alison A1 Telenczuk, Maria A1 Thielman, Gregory A1 Westlye, Lars T. A1 Winstein, Carolee J. A1 Wittenberg, George F. A1 Wong, Kristin A. A1 Yu, Chunshui YR 2021 UL http://medrxiv.org/content/early/2021/12/11/2021.12.09.21267554.abstract AB Accurate lesion segmentation is critical in stroke rehabilitation research for the quantification of lesion burden and accurate image processing. Current automated lesion segmentation methods for T1-weighted (T1w) MRIs, commonly used in rehabilitation research, lack accuracy and reliability. Manual segmentation remains the gold standard, but it is time-consuming, subjective, and requires significant neuroanatomical expertise. We previously released a large, open-source dataset of stroke T1w MRIs and manually segmented lesion masks (ATLAS v1.2, N=304) to encourage the development of better algorithms. However, many methods developed with ATLAS v1.2 report low accuracy, are not publicly accessible or are improperly validated, limiting their utility to the field. Here we present ATLAS v2.0 (N=955), a larger dataset of T1w stroke MRIs and manually segmented lesion masks that includes both training (public) and test (hidden) data. Algorithm development using this larger sample should lead to more robust solutions, and the hidden test data allows for unbiased performance evaluation via segmentation challenges. We anticipate that ATLAS v2.0 will lead to improved algorithms, facilitating large-scale stroke rehabilitation research.Competing Interest StatementA.G.B. serves on the Biogen Australia Dementia Scientific Advisory Committee and editorial boards of Neurology and International Journal of Stroke. S.C.C. serves as a consultant for Abbvie, Constant Therapeutics, MicroTransponder, Neurolutions, SanBio, Panaxium, NeuExcell, Elevian, Medtronic, and TRCare. E.d.L.R. is employed by icometrix. C.A.H. serves on the Advisory Board for Welcony Magstim and as a Consultant for Brainsway. B.G.H. has a clinical partnership with Fourier Intelligence. G.F.W. serves on the Scientific Advisory Board for Myomo, Inc. Funding StatementS.-L.L. is supported by the NIH (R01NS115845; K01HD091283; P2CHD06570). L.A.B. is supported by the Canadian Institutes for Health Research. A.G.B. is supported by the NHMRC (GNT1020526). C.M.B. is supported by the NIH (R21HD067906; R01NS090677). J.M.C. is supported by the NIH/NICHD (R00HD091375). A.B.C. is supported by the NIH (R01NS076348-01); Hospital Israelita Albert Einstein (grant 2250-14), CNPq/305568/2016-7. A.N.D. is supported by the Texas Legislature to the Lone Star Stroke Clinical Trial Network. Its contents are solely the responsibility of the authors and do not necessarily represent the official views of the Government of the United States or the State of Texas. F.G. is supported by the Wellcome Trust (093957). B.G.H. is supported the National Health and Medical Research Council fellowship (1125054). S.A.K. is supported by the NIH (P20 GM109040) and the VA (1IK6RX003075). M.S.K. is supported by the NHMRC. H.K. is supported by a BrightFocus Research Grant (A2019052S). B.J.M. is supported by the Canadian Partnership for Stroke Recovery. F.P. is supported by the Italian Ministry of Health; Ricerca Corrente 2020, 2021. K.P.R. is supported by the NIH (R21HD067906, R01NS090677). N.J.S. is supported by the NIH/NICHD (R01HD094731; I01RX003066) and the NIH/NIGMS (P20GM109040). S.R.S. is supported by the European Research Council (ERC) under the project NGBMI. G.S. is supported by the Italian Ministry of Health RC 20-21/A. M.T. is supported by the Center for Data Science. G.T. is supported by the Temple University sub-award of NIH R24. L.T.W. is supported by The European Research Council under the European Union's Horizon 2020 research and Innovation program (ERC StG, Grant 802998). C.J.W. is supported by the NIH (R01HD065438). G.F.W. is supported by the VA RR&D Merit Review Program. Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The data are derived from 33 different research studies that were approved by their local ethics committee and were conducted in accordance with the 1964 Declaration of Helsinki. Informed consent was obtained from all subjects. The ethics committee at the site receiving and processing all retrospective data (the University of Southern California) approved the receipt and sharing of the de-identified data, which do not contain any personal identifiers.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data are publicly available in preprocessed format (standardized to MNI-152 space) on INDI (http://fcon_1000.projects.nitrc.org/indi/retro/atlas.html), a free platform for neuroimaging data sharing. Raw data in native space are available on the Archive of Data on Disability to Enable Policy and research (ADDEP, http://doi.org/10.3886/ICPSR36684.v4), which has a more stringent data use agreement to maintain privacy of the raw data. http://fcon_1000.projects.nitrc.org/indi/retro/atlas.html http://doi.org/10.3886/ICPSR36684.v4