PT - JOURNAL ARTICLE AU - Albright, Jack AU - Mick, Eran AU - Sanchez-Guerrero, Estella AU - Kamm, Jack AU - Mitchell, Anthea AU - Detweiler, Angela M. AU - Neff, Norma AU - Tsitsiklis, Alexandra AU - Serpa, Paula Hayakawa AU - Ratnasiri, Kalani AU - Havlir, Diane AU - Kistler, Amy AU - DeRisi, Joseph L. AU - Pisco, Angela Oliveira AU - Langelier, Charles R. TI - A 2-Gene Host Signature for Improved Accuracy of COVID-19 Diagnosis Agnostic to Viral Variants AID - 10.1101/2022.01.06.21268498 DP - 2022 Jan 01 TA - medRxiv PG - 2022.01.06.21268498 4099 - http://medrxiv.org/content/early/2022/01/07/2022.01.06.21268498.short 4100 - http://medrxiv.org/content/early/2022/01/07/2022.01.06.21268498.full AB - The continued emergence of SARS-CoV-2 variants is one of several factors that may cause false negative viral PCR test results. Such tests are also susceptible to false positive results due to trace contamination from high viral titer samples. Host immune response markers provide an orthogonal indication of infection that can mitigate these concerns when combined with direct viral detection. Here, we leverage nasopharyngeal swab RNA-seq data from patients with COVID-19, other viral acute respiratory illnesses and non-viral conditions (n=318) to develop support vector machine classifiers that rely on a parsimonious 2-gene host signature to predict COVID-19. Optimal classifiers achieve an area under the receiver operating characteristic curve (AUC) greater than 0.9 when evaluated on an independent RNA-seq cohort (n=553). We show that a classifier relying on a single interferon-stimulated gene, such as IFI6 or IFI44, measured in RT-qPCR assays (n=144) achieves AUC values as high as 0.88. Addition of a second gene, such as GBP5, significantly improves the specificity compared to other respiratory viruses. The performance of a clinically practical 2-gene RT-qPCR classifier is robust across common SARS-CoV-2 variants, including Omicron, and is unaffected by cross-contamination, demonstrating its utility for improving accuracy of COVID-19 diagnostics.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study was supported by the UCSF Clinical and Translational Science Institute Catalyst Program, Chan Zuckerberg Biohub and the National Heart, Lung, and Blood Institute (1K23HL138461-01A1).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:University of California San Francisco (UCSF) IRB, protocol #17-24056I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesGene counts for all UCSF samples have been deposited under NCBI GEO accession GSE188678. The New York dataset can be obtained according to the Data Availability statement in the original publication12. Code for RNA-seq and qPCR SVM classifier development and validation is available at: https://github.com/czbiohub/Covid-Host-Classifier-Code. https://github.com/czbiohub/Covid-Host-Classifier-Code