PT - JOURNAL ARTICLE AU - Burch, Myson AU - Bose, Aritra AU - Dexter, Gregory AU - Parida, Laxmi AU - Drineas, Petros TI - MaSk-LMM: A Matrix Sketching Framework for Linear Mixed Models in Association Studies AID - 10.1101/2023.11.13.23298469 DP - 2023 Jan 01 TA - medRxiv PG - 2023.11.13.23298469 4099 - http://medrxiv.org/content/early/2023/11/13/2023.11.13.23298469.short 4100 - http://medrxiv.org/content/early/2023/11/13/2023.11.13.23298469.full AB - Linear mixed models (LMMs) have been widely used in genome-wide association studies (GWAS) to control for population stratification and cryptic relatedness. Unfortunately, estimating LMM parameters is computationally expensive, necessitating large-scale matrix operations to build the genetic relatedness matrix (GRM). Over the past 25 years, Randomized Linear Algebra has provided alternative approaches to such matrix operations by leveraging matrix sketching, which often results in provably accurate fast and efficient approximations. We leverage matrix sketching to develop a fast and efficient LMM method called Matrix-Sketching LMM (MaSk-LMM) by sketching the genotype matrix to reduce its dimensions and speed up computations. Our framework comes with both theoretical guarantees and a strong empirical performance compared to current state-of-the-art.Competing Interest StatementThe authors have declared no competing interest.Funding StatementPD and MB were partially supported by NSF 10001674, NSF 10001225, an IBM Faculty Award to PD, and an NSF GRFP to MB. AB and LP were supported by IBM Research.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Data analysis was performed under UK Biobank application 50658 using existing publicly available and deidentified data and was IRB exempt. I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesData analysis was performed under UK Biobank application 50658 using existing publicly available and deidentified data and was IRB exempt.