Abstract
The whole genome karyotype refers to the sequence of large chromosomal segments that make up an individual’s genotype. karyotype analysis, which includes descriptions of aneuploidies and other rearrangements is crucial for understanding genetic risk factors, for diagnosis, treatment decisions, and genetic counseling linked to constitutional disorders. The current karyotyping standard is based on microscopic examination of chromosomes, a complex process that requires high expertise and offers Mb scale resolution.
Optical Genome Mapping (OGM) technology can identify large DNA lesions in a cost-effective manner. In this paper, we developed OMKar, a method that uses OGM data to create a virtual karyotype. OMKar processes Structural (SV) and Copy Number (CN) Variants as inputs and encodes them into a compact breakpoint graph. It recomputes copy numbers using Integer Linear Programming to maintain CN balance and then identifies constrained Eulerian paths representing entire donor chromosomes. In tests using 38 whole genome simulations of constitutional disorders, OMKar reconstructed the karyotype with 88% precision and 95% recall on SV concordance and 95% Jaccard score on CN concordance. We applied OMKar to 50 prenatal, 41 postnatal, and 63 parental samples from ten different sites. OMKar reconstructed the correct karyotype in 144 out of 154 samples, covering 25 of 25 aneuploidies, 32 of 32 balanced translocations, and 72 of 82 unbalanced variations. Detected constitutional disorders included Cri-du-chat, Wolf-Hirschhorn, Prader-Willi deletions, Down, and Turner syndromes. Importantly, it identified a plausible genetic mechanism for five cases of constitutional disorder that were not detected by other technologies.
Together, these results demonstrate the robustness of OMKar for OGM-based karyotyping. OMKar is publicly available at https://github.com/siavashre/OMKar.
Competing Interest Statement
This work was supported by the National Institutes of Health grant R01GM114362. VB is a co- founder, serves on the scientific advisory board of Boundless Bio, Inc. (BBI), and Abterra Inc., and holds equity in both companies. BBI and Abterra were not involved in the research.
Funding Statement
This work was supported by the National Institutes of Health (NIH) grant R01GM114362.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
This study involved human participants and was approved by the Ethics Committees of the Medical University of Vienna (ethical code: 2229/2019) and Kecioren Teaching and Research Hospital (ethical code: 2012-KAEK-15/2083). Informed consent was obtained from all participants prior to their inclusion. Additionally, the study was conducted in accordance with the Declaration of Helsinki and received approval from the Institutional Review Boards of Western IRB-Copernicus Group (WCG) under study numbers 20203726 and 20212956. This approval included provisions for informed consent or waived authorization for the use of de-identified, banked samples for research purposes. All protected health information (PHI) was removed, and data were anonymized (coded and double-blinded) before accessioning for the study.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
The Chan Zuckerberg Initiative, Cold Spring Harbor Laboratory, the Sergey Brin Family Foundation, California Institute of Technology, Centre National de la Recherche Scientifique, Fred Hutchinson Cancer Center, Imperial College London, Massachusetts Institute of Technology, Stanford University, University of Washington, and Vrije Universiteit Amsterdam.