RT Journal Article SR Electronic T1 Three-dimensional cranio-facial landmark detection in CT slices from a publicly available database, using multi-phased regression networks on a personal computer JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2021.03.21.21253999 DO 10.1101/2021.03.21.21253999 A1 Nishimoto, Soh A1 Saito, Takuya A1 Ishise, Hisako A1 Fujiwara, Toshihiro A1 Kawai, Kenichiro A1 Kakibuchi, Maso YR 2021 UL http://medrxiv.org/content/early/2021/03/26/2021.03.21.21253999.abstract AB Aim Geometrical assessments to comprehend the shape of an object are done based on characteristic landmarks. Computer assisted tomography (CT) images, horizontal slices as two-dimensional pictures, can be digitally restructured into virtual three-dimensional objects. Automatic detection of the landmarks, if developed, will be a great help not only medically, but also for anthropologically. The aim of this study is to develop an automated system to predict three-dimensional coordinate values of cranio-facial landmarks in sequences of CT slices.Methods CT images were obtained from a publicly available database. Digital reconstruction was done to obtain three dimensional models. Sixteen landmarks were plotted on the models and coordinate values of them were recorded. Multi-phased deep learning system was constructed. For the first phase, 512 x 512 pixels images were resized to 96 x 96 pixels. A regression deep learning network was trained with 90 training data. For the second phase, for each landmark, 100 x 100 pixels images were cropped from the original images. Sixteen models were trained. For the third phase, 50 x 50 pixels images were cropped, and models were trained.Results Three-dimensional error for the first phase, testing 30 data, was 11.60 pixels in average. (1 pixel = 500 / 512 mm) For the second phase, it was significantly improved to 4.66 pixels. For the third phase, it was significantly progressed to 2.89. This was comparable to the gaps between the landmarks, plotted by two experienced practitioners.Discussion The calculation volume required to process three-dimensional pile of images is tremendous. One solution may be to compress the images, but detailed information may be lost during the process. Our proposing method of multi-phased prediction, coarse detection first and narrowing down the detection area, may be a possible solution, within the physical limitation of memory and computation.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study was partly supported by Nakatani Foundation Research grant.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Not applicableAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesOriginal CT data was obtained from a publicly available database: The Cancer Imaging Archive Public Access (wiki.cancerimagingarchive.net), Head-Neck-Radiomics-HN1 https://wiki.cancerimagingarchive.net/display/Public/Head-Neck-Radiomics-HN1