Data Availability
Two RNA-seq datasets and two DNA microarray datasets from lung cancer patients were analyzed in this study, including a Caucasian RNA-seq dataset from TCGA (https://www.cancer.gov/tcga), an Asian RNA-seq dataset from Gene Expression Omnibus (GEO) with the accession number GSE40419, an Asian microarray dataset from GEO with the accession number GSE19804 and a Caucasian microarray dataset from GEO with the accession number GSE10072. In addition, we analyzed a GSE34450 microarray dataset of gene expression from small airway epithelium and large airway epithelium of 50 healthy nonsmokers and 71 healthy smokers. Also, two single-cell RNA sequencing (scRNA-seq) datasets available in GEO with accession numbers GSE12296011 and GSE13139112 were downloaded and analyzed.