RT Journal Article SR Electronic T1 MaSk-LMM: A Matrix Sketching Framework for Linear Mixed Models in Association Studies JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2023.11.13.23298469 DO 10.1101/2023.11.13.23298469 A1 Burch, Myson A1 Bose, Aritra A1 Dexter, Gregory A1 Parida, Laxmi A1 Drineas, Petros YR 2023 UL http://medrxiv.org/content/early/2023/11/13/2023.11.13.23298469.abstract AB Linear mixed models (LMMs) have been widely used in genome-wide association studies (GWAS) to control for population stratification and cryptic relatedness. Unfortunately, estimating LMM parameters is computationally expensive, necessitating large-scale matrix operations to build the genetic relatedness matrix (GRM). Over the past 25 years, Randomized Linear Algebra has provided alternative approaches to such matrix operations by leveraging matrix sketching, which often results in provably accurate fast and efficient approximations. We leverage matrix sketching to develop a fast and efficient LMM method called Matrix-Sketching LMM (MaSk-LMM) by sketching the genotype matrix to reduce its dimensions and speed up computations. Our framework comes with both theoretical guarantees and a strong empirical performance compared to current state-of-the-art.Competing Interest StatementThe authors have declared no competing interest.Funding StatementPD and MB were partially supported by NSF 10001674, NSF 10001225, an IBM Faculty Award to PD, and an NSF GRFP to MB. AB and LP were supported by IBM Research.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Data analysis was performed under UK Biobank application 50658 using existing publicly available and deidentified data and was IRB exempt. I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesData analysis was performed under UK Biobank application 50658 using existing publicly available and deidentified data and was IRB exempt.