Potential Blood Transfusion Adverse Events Can be Found in Unstructured Text in Electronic Health Records using the “Shakespeare Method” ============================================================================================================================================ * Roselie A Bright * Summer K Rankin * Katherine Dowdy * Sergey V Blok * Susan J Bright-Ponte * Lee Anne Palmer ## ABSTRACT **Background** Text in electronic health records (EHRs) and big data tools offer the opportunity for surveillance of adverse events (patient harm associated with medical care) (AEs) in the unstructured notes. Writers may explicitly state an apparent association between treatment and adverse outcome (“attributed”) or state the simple treatment and outcome without an association (“unattributed”). We chose the case of transfusion adverse events (TAEs) and potential TAEs (PTAEs) because real dates were obscured in the study data, and new TAE types were becoming recognized during the study data period. **Objective** Develop a new method to identify attributed and unattributed potential adverse events using the unstructured text of EHRs. **Methods** We used EHRs for adult critical care admissions at a major teaching hospital, 2001-2012. We formed a transfusion (T) group (21,443 admissions treated with packed red blood cells, platelets, or plasma), excluded 2,373 ambiguous admissions, and formed a comparison (C) group of 25,468 admissions. We concatenated the text notes for each admission, sorted by date, into one document, and deleted replicate sentences and lists. We identified statistically significant words in T vs. C. T documents were filtered to those words, followed by topic modeling on the T filtered documents to produce 45 topics. For each topic, the three documents with the maximum topic scores were manually reviewed to identify events that occurred shortly after the first transfusion; documents with clear alternative explanations for heart, lung, and volume overload problems (e.g., advanced cancer, lung infection) were excluded. We also reviewed documents with the most topics, as well as 20 randomly selected T documents without alternate explanations. **Results** Topics centered around medical conditions. The average number of significant topics was 6.1. Most PTAEs were not attributed to transfusion in the notes. Admissions with a top-scoring cardiovascular topic (heart valve repair, tapped pericardial effusion, coronary artery bypass graft, heart attack, or vascular repair) were more likely than random T admissions to have at least one heart PTAE (heart rhythm changes or hypotension, proportion difference = 0.47, p = 0.022). Admissions with a top-scoring pulmonary topic (mechanical ventilation, acute respiratory distress syndrome, inhaled nitric oxide) were more likely than random T admissions (proportion difference = 0.37, p = 0.049) to have at least one lung PTAE (hypoxia, mechanical ventilation, bilateral pulmonary effusion, or pulmonary edema). **Conclusions** The “Shakespeare Method” could be a useful supplement to AE reporting and surveillance of structured EHR data. Future improvements should include automation of the manual review process. KEYWORDS * Electronic health records * big data * patient harm * patient safety * public health * product surveillance, postmarketing * natural language processing * proof of concept study * critical care * blood transfusion * humans ## INTRODUCTION Avoidable patient harm continues to be a significant problem [1]. To learn of patient harm known as adverse events (AEs) related to products regulated by FDA, FDA relies on spontaneous reports from manufacturers, healthcare providers, and the general public. Published deficiencies of these reports [2-10], include nonstatistical representativeness of harm and problems. Now that electronic healthcare records (EHRs) are very common [11] and often more informative than billing codes from payment claims [7, 12, 13], we have an opportunity to leverage them for automated surveillance of patient harm [3, 7, 14, 15]. We had two inspirations for naming the method after Shakespeare: * In the play [16], Macbeth is surprised by an attack on his castle by soldiers camouflaged by trees, even though he had been warned that his downfall would come when the woods moved. * Scholars have been using word-frequency methods to discuss the true authorship of works during Shakespeare’s time [17]. ### EHRs for Postmarketing Surveillance Many methods for finding prespecified AEs in text [6, 7, 9, 18-40] rely on defining possible AEs in advance. Crucially, events can be described in text but not necessarily attributed to being medical care AEs [14, 25, 41], we wanted to develop an unstructured method that would identify them. There are many challenges to automated use of EHRs: * Diagnosis codes may be “invalid, insensitive or non-specific” [20]. * “Often the notes contain medical and non-medical abbreviations, acronyms, numbers and misspelled words, which make it difficult to recognize the critical information in the notes. In other words, certain types of information such as ADEs [adverse drug events], indications, and signs and symptoms are harder to detect than other information such as drug names” [24]. * Medical entities in EHRs notes “can span across multiple words” [24]. * “… there is a lot of ambiguity among relevant named entities. Depending upon the context, the same exact phrase can be an ADE, indication, or a sign and symptom” [24]. * Periods do not always indicate end of sentence (“Dr.”, “1.23”, etc.) [24]. * “…notes are frequently ungrammatical and are often inconsistently formatted. Ambiguity is common: MS, for example, can mean mitral stenosis or multiple sclerosis” [12]. * EHRs are “…subject to access restrictions…” [6]. * “…[N]ot all events and outcomes are consistently captured …” [15]. * We observed that different vocabulary was used by different medical specialties, nurses, and other health care providers. We used the Medical Information Mart for Intensive Care III (MIMIC-III) [42, 43] because it is available to scientists with human subjects research training. MIMIC III focuses on critical care in a major Boston teaching hospital. A published report using MIMIC-III noted: * “…several sentence segmentation tools available in popular NLP [natural language processing] toolkits, such as NLTK31 and spaCy, were tested and did not work well in clinical notes. In clinical notes, sentences do not always end with regular punctuation marks such as a period or question mark. More specifically, both regular punctuation marks and newline characters can serve as sentence breakers; however, newline characters can also be used for text wrap. Moreover, enumeration-like and list-like formats are also common in clinical notes, especially for physical exam and list of medications” [36]. Many medical care AEs occur at higher frequency in hospital critical care settings and are related to complex illnesses, invasive procedures, and relatively long lists of treatments [44, 45]. ### Selection of Case of Blood Transfusion We decided to compare critical care patient admissions that involved blood transfusion (T) vs. control (C) admissions that had no transfusion events. An earlier version of the dataset showed a higher risk of near-term mortality for patients receiving red blood cell transfusion compared to non-transfused patients [46]. During the time period covered by the dataset, the transfusion research community recognized new transfusion adverse event (TAE) types—transfusion-related acute lung injury (TRALI) and transfusion-associated circulatory overload (TACO)—that prompted new guidelines to reduce the use of transfusion [47]. At the same time, far fewer reports were coming to FDA than would have been expected, considering the level of professional concern [48-50]. ### Study Objective Our objective was to develop a method of using EHRs notes to find recognized and unrecognized potential TAEs (PTAEs), which incidentally might also find other anomalies. We wanted our method to operate in the setting of the above-noted challenges. ## METHODS ### Pre-processing We used EHRs for critical care admissions within an adult hospital, Beth Israel Deaconess Medical Center, Boston, MA. Massachusetts Institute of Technology (MIT) worked with the hospital to process EHRs from 2001 to 2012, including unstructured notes, into the MIMIC III dataset that is publicly available to those meeting certification requirements. The research was designated not human subjects research by the FDA Institutional Review Board under Code of Federal Regulations Title 45 Part 46 [51]. We removed admissions with patient age < 16 years and admissions without notes from the total of 58,976 hospital admissions, resulting in 49,284 admissions. For each admission, we created one document by concatenating all available notes in chronological order. The notes in the MIMIC-III database contained duplicated paragraphs, sentences, and lists. These duplications distort statistical analyses of terms used and hamper manual review of the notes. We applied the Bloatectomy package to remove the duplicate text from each admission document [52]. We removed the personally identifying information mask string, lowercased the text, and retained punctuation, numerals, and stop words because they convey clinical information and are sometimes components of abbreviations. ### The Shakespeare Method The Shakespeare Method has three parts: * Convert each document into a vector of n-gram frequencies. * Create two groups of vectors: target and comparison. * Trim the n-gram vectors in the target group to those that are significant for the target group. * Apply topic analysis to the trimmed target group vectors. * Interpret the original documents with topic scores of interest. ### Create *N*-gram Vectors We used the collocation detection skip-gram method for extracting the *n*-grams with n = 1-5 consecutive words [53, 54] (see Figure 1a). Each document was vectorized using a bag of words representation where each dimension is represented by the frequency (count) of each *n*-gram (see Figure 1b), resulting in a set of 7,422,044 words. ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/01/06/2021.01.05.21249239/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2021/01/06/2021.01.05.21249239/F1) Figure 1. Word selection and topic modeling process with truncated examples. ### Create Two Groups We used the blood transfusion (T, 21,443 admissions) and comparison (C, 25,468 admissions) groups described in prior work [55] (see Figure 1c). ### Feature Selection We selected words based on their ability to discriminate between T and C, via a combination of filter and embedded methods [56-60] resulting in 41,664 words (Figure 2). The T document vectors were reduced to include only the 41,664 words (see Figure 1.e for a truncated example). ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/01/06/2021.01.05.21249239/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2021/01/06/2021.01.05.21249239/F2) Figure 2. Flowchart of the embedded-based and filter-based word selection processes. ### Topic Modeling Topic modeling (see Figure 1f-g) was performed by applying the Latent Dirichlet Allocation (LDA) model [61] to the trimmed document word vectors (Figure 1e) to find co-occurring words and group them into topics. The number of topics must be chosen *a priori*; for our analysis, we chose 45 topics based on 1 to 2 dozen known TAEs [62, 63], the clinical complexity of CCU patients [64], and the volume of data. This resulted in a matrix of scores for each word by each topic, which we refer to as word scores (see Figure 1f). An additional matrix shows the probability of fit for each topic (see Figure 1g); the topic document scores in each vector totals 1.0 (see Figure 3d). ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/01/06/2021.01.05.21249239/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2021/01/06/2021.01.05.21249239/F3) Figure 3. Topic modeling results. a) Distribution of all maximum document topic scores for all T. b) Documents that have only one strong topic. c) Documents that have many weak topics. d) All topic scores for a single document that has multiple weak topics. e) Two documents had a score of 0.022 for every topic. ### Topic Interpretation To evaluate whether topics described PTAEs, we selected the following records for manual document review: the three top-scoring documents for each of the 45 topics (see Figures 1h, 3a-b), the 7 documents with the most topics with significant scores (≥0.03) (such as in Figure 3c), and 24 randomly selected documents from the T group. We abstracted events, observations, clinicians’ attributions of causality, and clinicians’ diagnoses, as well as their dates (where offered). Further abstractions and tabulations were made to protect patients’ confidentiality. We tested comparisons with the Fisher’s Exact Test [65]. ## RESULTS Despite the inclusion of 1- to 5-grams in the vectorization, the terms that we extracted during classification were unigrams. ### Distribution of Topic Document Scores A histogram of maximum topic scores (Figure 3a) shows the distribution of each document’s maximum (strongest) topic. There are few documents in this corpus with a high maximum topic probability score (right tail; Figure 3b). The left tail (Figure 3c) shows a small number of documents with a maximum topic probability score that is low, or less than 20%, suggesting these documents are comprised of many weak topics. Figure 3d illustrates this with the topic distribution of a single document from this left tail. The lowest maximum topic document score was 0.022. Two documents had topic document scores of 0.022 for every topic (Figure 3e). They each had only one short record: a brief electrocardiogram report. There was no strict relationship between top word score and the frequency distribution of document topic scores (see Figure 4). Table 1 shows the categories of maximum document topic scores per number of topics. It shows that if there is one topic, the score is over 0.50. As the number of topics increases, the maximum topic score declines. The average number of topics with a topic document score >0.03 was 6.1. The maximum topic document score was 0.994. View this table: [Table 1.](http://medrxiv.org/content/early/2021/01/06/2021.01.05.21249239/T1) Table 1. Maximum document topic score for documents in relation to number of topics in a document. ![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/01/06/2021.01.05.21249239/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2021/01/06/2021.01.05.21249239/F4) Figure 4. Distribution of topic document scores and top word scores. ### Top Scoring Documents for each Topic For each topic, Table 2 shows the score for the top word, the top 20 words, top document score, and the distribution of documents by document score range. The rows are sorted by top document score. The maximum word score ranged from 26 to 91,911. The words with the top 20 scores included plain English words, clinical words, acronyms, shortened words, and misspellings. The maximum document score for a topic went as high as 0.994. The document scores were widely distributed. View this table: [Table 2.](http://medrxiv.org/content/early/2021/01/06/2021.01.05.21249239/T2) Table 2. For each topic, score for the top word, top 20 words, top document score, and the distribution of documents by document score range. See Table 3 for summaries of 135 documents. As is expected when hyperparameters of the model are optimal, most (35) topics were “coherent,” in that the top documents had clear common themes within topics that were consistent with the lists of top 20 words in the topic. The coherent topics had higher top document scores and tended to be the maximum scoring topics. Among the least coherent topics, the tendency for documents was to have some other topic as the maximum-scoring topic. This is expected with LDA, as the words that don’t fit into a coherent topic will be allocated to separate ‘junk’ topics. View this table: [Table 3.](http://medrxiv.org/content/early/2021/01/06/2021.01.05.21249239/T3) Table 3. Topic interpretations based on top 3 scoring documents in each topic and randomly selected documents. We present the 81 documents with T date information and without an obvious alternate explanation for the AE. Documents are the unit of analysis. Topics are grouped by similarity of interpretation (some did not have a coherent interpretation); the cardiovascular, lung failure, and volume overload topic groups are also grouped into a super PTAE topic group. “Codes” refers to billing codes. The tabulation of the presence/absence of the notes expected to have the most clinical information showed that 122 had a discharge summary, 66 had a nursing note, and 21 had a physician progress note. None of the documents attributed an adverse event to transfusion in the billing codes. New or worsening PTAEs occurring within a day or two of T were: * in the heart category: atrial fibrillation, tachycardia, bradycardia, other heart rhythm abnormalities, and hypotension * in the lung category: hypoxia, mechanical ventilation, bilateral pleural effusions, pulmonary edema * in the volume category: edema, diuresis therapy, acute kidney failure * fever or chills in the absence of other infection evidence Many documents (40) could not be evaluated for TAEs because either the dates of transfusion were missing or there was no identified treatment when transfusion could be presumed. For others, there was a clear alternate reason for heart or lung problems: advanced cancer (7), TTP present on admission (1), liver failure (1), and lung infection (1). Out of the remaining 85 documents with transfusion data, 52 had evidence of a PTAEs; the most common were heart PTAEs (35) and lung PTAEs (33), while non-infection-related fever or chills (12) and fluid overload (12) were less common. A few documents explicitly considered transfusion as causes of AEs: in topic 30 blood disease, one attributed DIC to transfusion, another document in topic 30 listed but discarded the possibility of TRALI or TACO, a document in topic 3 (bone trauma from MVA) proposed PTAE, and a document in topic 40 attributed a drop in platelets to transfusion. In two documents, the PTAEs were attributed to contrast (topic 37 kidney failure), a brand name for metronidazole (topic 38 colon problem), and surgery (3 bone trauma from MVA). Documents with transfusion timing but no apparent TAE were in the following topics: 10 (one of the mechanical ventilation topics), 2 (esophageal varices banding), 7 (spine surgery), 18 (GI bleeding), 31, and 8. For 10 documents, separate transfusion and PTAE codes were present but were not conceptually linked. We read 24 randomly selected documents to obtain 20 that did not have advanced cancer, cirrhosis, or severe lung trauma. They are summarized at the bottom of Table 3. The documents in the cardiovascular topic group were more likely than the random group to have any of the heart PTAEs (proportion difference= 0.47; p= 0.022). The analogous analysis for 14 documents in the lung failure topic group showed a higher rate of any lung PTAEs (proportion difference= 0.37; p= 0.049). Table 4 depicts characteristics of the eight documents that had 13 or 14 topics. Their document topic scores were distributed across a wide number of topics, and the notes described a large number of medical challenges to the patients. All of these documents had both discharge summaries and nurse progress notes. One physician wrote that the patient developed alloantibodies and had a delayed transfusion reaction. None of the billing codes linked transfusion to an adverse event, and in two records, the codes included an outcome code. All eight documents provided dates of transfusion, including three for which cancer was the more likely cause of the AE. Of the remaining five, three had pulmonary PTAEs: View this table: [Table 4.](http://medrxiv.org/content/early/2021/01/06/2021.01.05.21249239/T4) Table 4. Topic interpretations based on the three top-scoring documents with 13 or 14 topics. “Codes” refers to billing codes. “NA”: not applicable. * The document with all three types of PTAE had only one topic with a score above 0.1, topic 42 heart attack, and the notes, but not codes, said the patient had delayed transfusion reaction. * The document with pulmonary and volume PTAEs had the following topics with scores ≥ 0.1: topic 42 heart attack, topic 24 TPA to lyse thrombus, topic 10 cirrhosis, and topic 1 x-ray confirmation of device placement. The notes attributed worsening acute kidney failure to an antibiotic. * The document with only pulmonary PTAE had the following topics with scores ≥ 0.1: topic 24 TPA to lyse thrombus, topic 10 mechanical ventilation, and topic 37 kidney failure. ## DISCUSSION The Shakespeare Method successfully identified PTAEs. The three top-scoring documents in cardiovascular topics (17 heart valve repair, 33 tapped pericardial effusion, 35 CABG, 42 heart attack, and 11 vascular repair) were associated with cardiovascular PTAE: atrial fibrillation, tachycardia, bradycardia, other heart rhythm abnormality, or hypotension, which are features of TAE [62, 63]. Mechanical ventilation and nitric oxide therapy (topics 9, 10, 16, and 26) were used to treat lung failure [66] which also was a topic (29 ARDS). The associated breathing PTAE (hypoxia, mechanical ventilation, bilateral pleural effusion, and pulmonary edema) are components of TRALI and TACO [62, 63]. Other PTAE that correspond with known TAE were also observed in the top 3 documents of topics: * features of the volume overload component of TACO (edema, acute renal failure, and diuresis) [63]. * a feature of hemolytic transfusion reaction and febrile non hemolytic transfusion reaction (fever without other signs of infection) [63]. ### Distribution of Topic Document Scores Incoherent topics had few or no documents with high topic document scores with most documents scoring at or close to zero (see example in Figure 5a). A coherent topic follows a similar shape in terms of distribution, but the range is much greater, as we can see in the x-axis of Figure 5b when compared to Figure 5a. The coherent topics got higher scores in many documents. ![Figure 5.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/01/06/2021.01.05.21249239/F5.medium.gif) [Figure 5.](http://medrxiv.org/content/early/2021/01/06/2021.01.05.21249239/F5) Figure 5. Distribution of document topics scores for two topics: a) topic 8, a noncoherent topic. b) topic 42, a coherent topic. ### Top Scoring Documents for each Topic Many topics were conditions that can be reasons for transfusion: * Anemia [64] * Heart attack [67] * Blood disease (including blood cancers, chemotherapy, bone marrow transplant, neutropenia, thrombocytopenia, and pancytopenia) [64, 68] * Major surgery, vascular occlusion or repair, and GI problem or bleeding [69] TPA to lyse thrombus, because antithrombotic treatment can cause bleeding [70]. These topics could be consequences of the reasons for transfusion: * Tapped pericardial effusion, because pericardial effusions can result from cancers, heart disease, aortic dissection, and other conditions [71] that prompt transfusion [72] * Past sternotomy, a consequence of heart surgery [73], which in turn is often a reason for transfusion [74]. * Pneumomediastinum could be caused by surgery, or tearing of the esophagus or trachea [75], which in turn could be reasons to transfuse [69]. * Skin breakdown can be a consequence of long-term bed rest [72, 76], which is generally associated with critical illness and anemia [64], which in turn prompts transfusion [64]. Some could be alternate reasons for PTAE: * Advanced cancer [77] * Liver disease [78] * Infection [79]. Others could be PTAE or sequelae of PTAE: * Mechanical ventilation [80] * Pneumomediastinum could be caused by mechanical ventilation [75], which in turn are known is a consequence of TAE [80, 81]. * Tracheostomy tube placed when long term mechanical ventilation is anticipated [82] * ARDS, which shares features (noncardiogenic pulmonary edema, and hypoxia) with TRALI [80], is also known as acute lung injury, and is treated with non-invasive or invasive ventilation [83]. * Permanent hemodialysis, indicating permanent kidney injury [84], which can result from hemolytic transfusion reactions [85] and is associated with volume overload [86], which is part of TACO [62]. ### Documents with multiple topics The high number of topics per document reflects the complexity of CCU patients. Multiple topics covering illnesses and procedures were expected for critically ill patients and were the norm for the vast majority of documents. The documents with 13 and 14 significant topics described many complex clinical problems consistent with the need for critical care. ### Unsupervised methods for surveillance of AEs in EHRs We observed that there was much more adverse event data than explicit discussion in the notes, and in turn more in the notes than in the diagnosis and procedure codes. Our prior analysis of diagnosis codes [55] demonstrated that in transfused vs. not transfused patients, there were some explicit TAE and, also, more frequent diagnoses that were similar to TAE (TRALI vs. breathing difficulty, TACO vs. acute kidney failure, etc.). None of the documents we manually reviewed for this study bore any explicit TAE diagnosis code. Our prior and current analyses demonstrate that effective surveillance could benefit from using unstructured text as well as codes. Our method was successful despite limitations of this dataset. The extent of records for each admission grew during the time that the data were collected because of the hospital’s policy of gradually adding more types of records to EHRs [87]. There was variation in the presence of nursing and physician progress notes in the examined records; which would not be present in the EHRs in systems that have long since become completely electronic. The presence of different types of records would logically have influenced the generated topics; for example, topic 1 (x-ray confirmation of device placement) depends on the presence of radiology reports. ### Resources The data processing and topic modeling steps used relatively little programming time and computing resources. The manual review was very person-time and clinical-expertise intensive and would benefit from automation. ## CONCLUSIONS Topic analysis of statistically significant words in T documents found records that indicate PTAEs, even if the clinician did not explicitly write a link between transfusion and PTAE. This method succeeded despite the presence of a wide variety of vocabulary (discipline-specific, context dependence, misspellings, multiple word words, acronyms, personal abbreviations, etc.) and formats (sentences, phrases, free lists, formatted lists, etc.) used in the text. The Shakespeare Method would likely generalize to other her notes and possibly other types of medical texts. The computing resources and tools are accessible and openly available. Their application to EHRs broadens the number of types of entities that could independently conduct surveillance of AEs. It will be useful to adapt natural language processing methods to automate the abstraction of the notes; the tools will need to be tailored to the various formats used in the notes by different disciplines and individual clinicians. The expansion of vocabulary and acronym lists will also be useful. Automation tools will help to understand how PTAEs are distributed within and among topics. ## Data Availability We used the Medical Information Mart for Intensive Care III (MIMIC-III) data available from [https://mimic.physionet.org/about/mimic/](https://mimic.physionet.org/about/mimic/). The administrators of this website control all data releases. ## Conflict of interest and disclaimer The research was done with FDA support and under contract HHSF223201510027B between FDA and Booz Allen Hamilton Inc. None of the authors have other relevant financial interests. The opinions are those of the authors and do not represent official policy of either the FDA or Booz Allen Hamilton. ## ACKNOWLEDGEMENTS We are thankful for the enthusiastic support by our FDA and Booz Allen Hamilton supervisors, Department of Health and Human Services innovation programs (Ignite Accelerator and Data Science CoLab), and Alistair Johnson, DPhil, of the PhysioNet MIMIC-III program, Massachusetts Institute of Technology Laboratory for Computational Physiology. George Plopper, PhD, of Booz Allen Hamilton, provided project and consultation support. Many FDA colleagues offered ideas and feedback regarding the selection of the case of blood transfusion and the content of this paper. All authors had access to the data. All authors are responsible for the study topic, design, and interpretation. Dr. Bright, Dr. Rankin, Ms. Dowdy, and Dr. Blok are responsible for data processing and analysis. ## ABBREVIATIONS USED MORE THAN ONCE AE : adverse events ARDS : acute respiratory distress syndrome C : comparison group of admissions CABG : coronary artery bypass graft DIC : disseminated intravascular coagulation EHR : electronic healthcare record FDA : Food and Drug Administration GI : gastrointestinal MIMIC-III : Medical Information Mart for Intensive Care III MVA : motor vehicle accident NLP : natural language processing PTAE : potential transfusion adverse event T : group of admissions that received transfusion of blood components TACO : transfusion-associated circulatory overload TAE : transfusion adverse event TPA : tissue plasminogen activator TRALI : transfusion-related acute lung injury TTP : thrombotic thrombocytopenic purpura * Received January 5, 2021. * Revision received January 5, 2021. * Accepted January 6, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available for use under a CC0 license ## REFERENCES 1. 1.Brewer T, Colditz GA. Postmarketing surveillance and adverse drug reactions: current perspectives and future needs. JAMA. 1999 Mar 3;281(9):824–9. PMID:10071004. DOI: 10.1001/jama.281.9.824. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.281.9.824&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10071004&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000078804300033&link_type=ISI) 2. 2.Scott HD, Thacher-Renshaw A, Rosenbaum SE, et al. Physician reporting of adverse drug reactions: Results of the Rhode Island Adverse Drug Reaction Reporting Project. JAMA. 1990;263:1785–1788. PMID:2313850. DOI:10.1001/jama.1990.03440130073028. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.1990.03440130073028&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=2313850&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1990CW22900024&link_type=ISI) 3. 3.Bright RA, Nelson RC. Automated support for pharmacovigilance: a proposed system. Pharmacoepidemiol Drug Saf. 2002; 11(2):121–125. PMID:11998536. DOI:10.1002/pds.684. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/pds.684&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11998536&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 4. 4.Samore MH, Evans RS, Lassen A, et al. Surveillance of medical device-related hazards and adverse events in hospitalized patients. JAMA. 2004; 291:325–34. PMID:14734595 DOI:10.1001/jama.291.3.325. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.291.3.325&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14734595&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000188243100032&link_type=ISI) 5. 5.Bright RA. Strategy for surveillance of adverse drug events. Food Drug Law J. 2007; 62(3):605–615. PMID:17915403. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17915403&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 6. 6.Hoang T, Liu J, Pratt N, et al. Authenticity and credibility aware detection of adverse drug events from social media. Int J Med Inform. 2018 Dec;120:101–115. PMID:30409335. DOI:10.1016/j.ijmedinf.2018.09.002. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ijmedinf.2018.09.002&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30409335&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 7. 7.Classen D, Li M, Miller S, Ladner D. an electronic health record-based real-time analytics program for patient safety surveillance and improvement. Health Aff (Millwood). 2018 Nov;37(11):1805–1812. PMID:30395491. DOI:10.1377/hlthaff.2018.0728. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1377/hlthaff.2018.0728&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30395491&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 8. 8.Wang L, Rastegar-Mojarad M, Ji Z, et al. Detecting pharmacovigilance signals combining electronic medical records with spontaneous reports: A case study of conventional disease-modifying antirheumatic drugs for rheumatoid arthritis. Front Pharmacol. 2018 Aug 7;9:875. PMID:30131701. DOI:10.3389/fphar.2018.00875. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fphar.2018.00875&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30131701&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 9. 9.Alghamdi AA, Keers RN, Sutherland A, Ashcroft DM. Prevalence and nature of medication errors and preventable adverse drug events in paediatric and neonatal intensive care settings: A systematic review. Drug Saf. 2019 Dec;42(12):1423–1436. PMID:31410745. DOI:10.1007/s40264-019-00856-9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s40264-019-00856-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31410745&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 10. 10.Molina FJ, Rivera PT, Cardona A, et al. Adverse events in critical care: Search and active detection through the Trigger Tool. World J Crit Care Med. 2018 Feb 4;7(1):9–15. PMID:29430403. DOI:10.5492/wjccm.v7.i1.9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.5492/wjccm.v7.i1.9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29430403&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 11. 11.Report to Congress: Update on the adoption of health information technology and related efforts to facilitate the electronic use and exchange of health information. Office of the National Coordinator for Health Information Technology, US Department of Health and Human Services. 2016 Feb. [https://www.healthit.gov/sites/default/files/Attachment\_1\_-\_2-26-16\_RTC\_Health\_IT\_Progress.pdf](https://www.healthit.gov/sites/default/files/Attachment\_1\_-\_2-26-16\_RTC_Health_IT_Progress.pdf). 12. 12.Taggart M, Chapman WW, Steinberg BA, et al. Comparison of 2 natural language processing methods for identification of bleeding among critically ill patients. JAMA Netw Open. 2018 Oct 5;1(6):e183451. PMID:30646240. DOI:10.1001/jamanetworkopen.2018.3451. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jamanetworkopen.2018.3451&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30646240&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 13. 13.Jin Y, Li F, Vimalananda VG, Yu H. Automatic detection of hypoglycemic events from the electronic health record notes of diabetes patients: empirical study. JMIR Med Inform. 2019 Nov 8;7(4):e14340. PMID:31702562. DOI:10.2196/14340. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2196/14340&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31702562&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 14. 14.Melton GB, Hripcsak G. Automated detection of adverse events using natural language processing of discharge summaries. J Am Med Inform Assoc. 2005; 12:448–457. PMID:15802475. DOI:10.1197/jamia.M1794. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1197/jamia.M1794&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15802475&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 15. 15.Patadia VK, Schuemie MJ, Coloma PM, et al. Can electronic health records databases complement spontaneous reporting system databases? a historical-reconstruction of the association of rofecoxib and acute myocardial infarction. Front Pharmacol. 2018 Jun 6;9:594. PMID:29928230. DOI:10.3389/fphar.2018.00594. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fphar.2018.00594&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29928230&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 16. 16.Shakespeare W. The Tragedy of Macbeth. CreateSpace Independent Publishing Platform. 2015 Jun. ISBN-10: 1514630583. ISBN-13: 978-1514630587. 17. 17.Hugh C, Kinney AF, eds. Shakespeare, Computers, and the Mystery of Authorship. Cambridge: Cambridge University Press, 2009. ISBN-13:978-0521516235. ISBN-10:0521516234. 18. 18.Young IJB, Luz S, Lone N. A systematic review of natural language processing for classification tasks in the field of incident reporting and adverse event analysis. Int J Med Inform. 2019; 132: 103971. PMID:31630063. DOI:10.1016/j.ijmedinf.2019.103971. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ijmedinf.2019.103971&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31630063&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 19. 19.Fortenberry M, Odinet J, Shah P, et al. Development of an electronic trigger tool at a children’s hospital within an academic medical center. Am J Health Syst Pharm. 2019 Nov 13;76(Suppl_4):S107–S113. PMID:31724037. DOI:10.1093/ajhp/zxz222. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ajhp/zxz222&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31724037&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 20. 20.Zhou L, Siddiqui T, Seliger SL, et al. Text preprocessing for improving hypoglycemia detection from clinical notes - A case study of patients with diabetes. Int J Med Inform. 2019 Sep; 129:374–380. PMID:31445280. DOI:10.1016/j.ijmedinf.2019.06.020. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ijmedinf.2019.06.020&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31445280&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 21. 21.Mesfin YM, Cheng A, Lawrie J, Buttery J. Use of routinely collected electronic healthcare data for postlicensure vaccine safety signal detection: a systematic review. BMJ Glob Health. 2019 Jul 8;4(4):e001065. PMID:31354969. DOI:10.1136/bmjgh-2018-001065. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NToiYm1qZ2giO3M6NToicmVzaWQiO3M6MTE6IjQvNC9lMDAxMDY1IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDEvMDYvMjAyMS4wMS4wNS4yMTI0OTIzOS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 22. 22.Morel M, Bacry E, Gaiffas S, Guilloux A, Leroy F. ConvSCCS: convolutional self-controlled case series model for lagged adverse event detection. Biostatistics. 2019 Mar 8. pii: kxz003. PMID:30851046. DOI:10.1093/biostatistics/kxz003. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/biostatistics/kxz003&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30851046&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 23. 23.Dandala B, Joopudi V, Devarakonda M. Adverse drug events detection in clinical notes by jointly modeling entities and relations using neural networks. Drug Saf. 2019 Jan;42(1):135–146. PMID:30649738. DOI:10.1007/s40264-018-0764-x. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s40264-018-0764-x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30649738&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 24. 24.Wunnava S, Qin X, Kakar T, et al. Adverse drug event detection from electronic health records using hierarchical recurrent neural networks with dual-level embedding. Drug Saf. 2019 Jan;42(1):113–122. PMID:30649736. DOI:10.1007/s40264-018-0765-9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s40264-018-0765-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30649736&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 25. 25.Bagattini F, Karlsson I, Rebane J, Papapetrou P. A classification framework for exploiting sparse multi-variate temporal features with application to adverse drug event detection in medical records. BMC Med Inform Decis Mak. 2019 Jan 10;19(1):7. PMID:30630486. DOI:10.1186/s12911-018-0717-4. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12911-018-0717-4&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30630486&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 26. 26.Rafter N, Finn R, Burns K, et al. Identifying hospital-acquired infections using retrospective record review from the Irish National Adverse Events Study (INAES) and European point prevalence survey case definitions. J Hosp Infect. 2019 Mar;101(3):313–319. PMID:30590090. DOI:10.1016/j.jhin.2018.12.011. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jhin.2018.12.011&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30590090&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 27. 27.Li F, Liu W, Yu H. Extraction of information related to adverse drug events from electronic health record notes: Design of an end-to-end model based on deep learning. JMIR Med Inform. 2018 Nov 26;6(4):e12159. PMID:30478023. DOI:10.2196/12159. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2196/12159&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30478023&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 28. 28.Jeong E, Park N, Choi Y, Park RW, Yoon D. Machine learning model combining features from algorithms with different analytical methodologies to detect laboratory-event-related adverse drug reaction signals. PLoS One. 2018 Nov 21;13(11):e0207749. PMID:30462745. DOI:10.1371/journal.pone.0207749. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0207749&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30462745&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 29. 29.Santiso S, Perez A, Casillas A. Exploring joint AB-LSTM with embedded lemmas for adverse drug reaction discovery. IEEE J Biomed Health Inform. 2019 Sep;23(5):2148–2155. PMID:30403644. DOI:10.1109/JBHI.2018.2879744. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1109/JBHI.2018.2879744&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30403644&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 30. 30.Chu J, Dong W, He K, Duan H, Huang Z. Using neural attention networks to detect adverse medical events from electronic health records. J Biomed Inform. 2018 Nov; 87: 118–130. PMID:30336262. DOI:10.1016/j.jbi.2018.10.002. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jbi.2018.10.002&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30336262&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 31. 31.Wang SV, Maro JC, Baro E, et al. Data mining for adverse drug events with a propensity score-matched tree-based scan statistic. Epidemiology. 2018 Nov;29(6):895–903. PMID:30074538. DOI:10.1097/EDE.0000000000000907. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/EDE.0000000000000907&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30074538&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 32. 32.Martins RR, Silva LT, Bessa GG, Lopes FM. Trigger tools are as effective as non-targeted chart review for adverse drug event detection in intensive care units. Saudi Pharm J. 2018 Dec;26(8):1155–1161. PMID:30532636. DOI:10.1016/j.jsps.2018.07.003. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jsps.2018.07.003&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30532636&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 33. 33.Whalen E, Hauben M, Bate A. Time series disturbance detection for hypothesis-free signal detection in longitudinal observational databases. Drug Saf. 2018 Jun;41(6):565–577. PMID:29468602. DOI:10.1007/s40264-018-0640-8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s40264-018-0640-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29468602&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 34. 34.Zhou X, Douglas IJ, Shen R, Bate A. Signal detection for recently approved products: Adapting and evaluating selfcontrolled case series method using a US claims and UK electronic medical records database. Drug Saf. 2018 May;41(5):523–536. PMID:29327136. DOI:10.1007/s40264-017-0626-y. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s40264-017-0626-y&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29327136&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 35. 35.Nydert P, Unbeck M, Härenstam KP, et al. Drug use and type of adverse drug events-identified by a trigger tool in different units in a swedish pediatric hospital. Drug Healthc Patient Saf. 2020 Jan 31; 12: 31–40. eCollection 2020. PMID:32099481. DOI:10.2147/DHPS.S232604. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2147/DHPS.S232604&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32099481&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 36. 36.Chen L, Gu Y, Ji X, e al. Extracting medications and associated adverse drug events using a natural language processing system combining knowledge base and deep learning. J Am Med Inform Assoc. 2020 Jan 1; 27(1): 56–64. PMID:31591641. DOI:10.1093/jamia/ocz141. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/jamia/ocz141&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31591641&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 37. 37.Ju M, Nguyen NTH, Miwa M, Ananiadou S. An ensemble of neural models for nested adverse drug events and medication extraction with subwords. J Am Med Inform Assoc. 2020 Jan 1; 27(1): 22–30. PMID:31197355. DOI:10.1093/jamia/ocz075. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/jamia/ocz075&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31197355&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 38. 38.Griffey RT, Schneider RM, Todorov AA. Adverse events present on arrival to the emergency department: The ED as a dual safety net. Jt Comm J Qual Patient Saf. 2020 Apr; 46(4): 192–198. PMID:32007399. DOI:10.1016/j.jcjq.2019.12.003. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jcjq.2019.12.003&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32007399&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 39. 39.Pandya AD, Patel K, Rana D, et al. Global Trigger Tool: Proficient adverse drug reaction autodetection method in critical care patient units. Indian J Crit Care Med. 2020 Mar; 24(3): 172–178. PMID:32435095. DOI:10.5005/jp-journals-10071-23367. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.5005/jp-journals-10071-23367&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32435095&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 40. 40.McIsaac DI, Hamilton GM, Abdulla K, et al. Validation of new ICD-10-based patient safety indicators for identification of in-hospital complications in surgical patients: A study of diagnostic accuracy. BMJ Qual Saf. 2020 Mar; 29(3):209–216. PMID:31439760. DOI:10.1136/bmjqs-2018-008852. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoicWhjIjtzOjU6InJlc2lkIjtzOjg6IjI5LzMvMjA5IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDEvMDYvMjAyMS4wMS4wNS4yMTI0OTIzOS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 41. 41.de Vos MS, Hamming JF, Chua-Hendriks JJC, Marang-van de Mheen PJ. Connecting perspectives on quality and safety: patient-level linkage of incident, adverse event and complaint data. BMJ Qual Saf. 2019 Mar; 28(3): 180–189. PMID:30032125. DOI:10.1136/bmjqs-2017-007457. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoicWhjIjtzOjU6InJlc2lkIjtzOjg6IjI4LzMvMTgwIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDEvMDYvMjAyMS4wMS4wNS4yMTI0OTIzOS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 42. 42.Johnson AEW, Pollard TJ, Shen L, et al. MIMIC-III, a freely accessible critical care database. Sci Data. 2016; 3: 160035. PMID:27219127. DOI:10.1038/sdata.2016.35. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/sdata.2016.35&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27219127&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 43. 43.MIMIC-III Critical Care Database. [https://mimic.physionet.org/about/mimic/](https://mimic.physionet.org/about/mimic/). 44. 44.Bates DW, Cullen DJ, Laird N, et al. Incidence of adverse drug events and potential adverse drug events implications for prevention. JAMA. 1995; 274: 29–34. PMID:7791255. DOI:10.1001/jama.1995.03530010043033. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.1995.03530010043033&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=7791255&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1995RG23300027&link_type=ISI) 45. 45.Kane-Gill SL, Kirisci L, Verrico MM, Rothschild JM. Analysis of risk factors for adverse drug events in critically ill patients. Crit Care Med. 2012; 40(3): 823–828. PMID:22036859. DOI:10.1097/CCM.0b013e318236f473. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/CCM.0b013e318236f473&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22036859&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000300532800015&link_type=ISI) 46. 46.Dejam A, Malley BE, Feng M, et al. The effect of age and clinical circumstances on the outcome of red blood cell transfusion in critically ill patients. Crit Care. 2014; 18: 487. PMID:25175389. DOI:10.1186/s13054-014-0487-z. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13054-014-0487-z&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25175389&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 47. 47.Carson JL, Grossman BJ, Kleinman, S, et al. Red blood cell transfusion: a clinical practice guideline from the AABB. Ann Intern Med. 2012; 157: 49–58. PMID:22751760. DOI:10.7326/0003-4819-157-1-201206190-00429. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7326/0003-4819-157-1-201206190-00429&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22751760&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000307784900005&link_type=ISI) 48. 48.Holness L, Knippen MA, Simmons L, Lachenbruch PA. Fatalities caused by TRALI. Transfus Med Rev. 2004; 18(3):184–188. PMID:15248168. DOI:10.1016/j.tmrv.2004.03.004. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.tmrv.2004.03.004&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15248168&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000222864600004&link_type=ISI) 49. 49.Menis M, Anderson SA, Forshee RA, et al. Transfusion-associated circulatory overload (TACO) and potential risk factors among the inpatient US elderly as recorded in Medicare administrative databases during 2011. Vox Sang. 2014 Feb; 106(2):144–52. PMID:23848234. DOI:10.1111/vox.12070. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/vox.12070&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23848234&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 50. 50.Menis M, Anderson SA, Forshee RA, et al. Transfusion-related acute lung injury and potential risk factors among the inpatient US elderly as recorded in Medicare claims data, during 2007 through 2011. Transfusion. 2014 Sep; 54(9): 2182–93. PMID:24673344. DOI:10.1111/trf.12626. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/trf.12626&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24673344&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 51. 51.Code of Federal Regulations Title 45 Part 46 Protection of Human Subjects, Subpart A—Basic HHS Policy for Protection of Human Research Subjects, §46.101 (b) (4). 2000 Oct 1. [https://www.govinfo.gov/content/pkg/CFR-2000-title45-vol1/pdf/CFR-2000-title45-vol1-part46.pdf](https://www.govinfo.gov/content/pkg/CFR-2000-title45-vol1/pdf/CFR-2000-title45-vol1-part46.pdf). 52. 52.Rankin SK, Bright R, Dowdy K. Bloatectomy (Version v0.0.12). Zenodo. 2020, June 26. DOI:10.5281/zenodo.3909030. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.5281/zenodo.3909030&link_type=DOI) 53. 53.Mikolov T, Sutskever I, Chen K, et al. Distributed representations of words and phrases and their compositionality. ArXiv. 2013; 1310.4546. [https://arxiv.org/abs/1310.4546](https://arxiv.org/abs/1310.4546) and [https://papers.nips.cc/paper/5021-distributed-representations-of-words-and-phrases-and-their-compositionality.pdf](https://papers.nips.cc/paper/5021-distributed-representations-of-words-and-phrases-and-their-compositionality.pdf). 54. 54.Rehurek R, Sojka P. Software framework for topic modelling with large corpora. Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks. 2010; 45–50. [http://is.muni.cz/publication/884893/en](http://is.muni.cz/publication/884893/en). 55. 55.Bright RA, Bright-Ponte SJ, Palmer LA, Blok S. Using diagnosis codes to identify blood transfusion adverse events in electronic health records. MedRxiv 2020 Dec 30:20218610. DOI:10.1101/2020.12.30.20218610. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1101/2020.12.30.20218610&link_type=DOI) 56. 56.Guyon I, Elisseeff A. An introduction to variable and feature selection. J Mach Learn Res. 2003;3:1157–1182. [https://www.jmlr.org/papers/volume3/guyon03a/guyon03a.pdf](https://www.jmlr.org/papers/volume3/guyon03a/guyon03a.pdf). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1162/153244303322753616&link_type=DOI) 57. 57.Witten IH, Frank E, Hall MA, Pal CJ. Data Mining: Practical Machine Learning Tools and Techniques, 4th ed. Elsevier. 2016. ISBN-13:978-0128042915. ISBN-10:0128042915. 58. 58.Tang B, Kay S, He H. Toward optimal feature selection in naive Bayes for text categorization. ArXiv. 2016;1602:02850. [https://arxiv.org/abs/1602.02850](https://arxiv.org/abs/1602.02850). 59. 59.Marafino BJ, Boscardin WJ, Dudley RA. Efficient and sparse feature selection for biomedical text classification via the elastic net: Application to ICU risk stratification from nursing notes. J Biomed Inform. 2015;54:114–120. PMID:25700665. DOI:10.1016/j.jbi.2015.02.003. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jbi.2015.02.003&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25700665&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 60. 60.Liu Y, Ju S, Wang J, Su C. A new feature selection method for text classification based on independent feature space search. Math Probl Eng. 2020; 2020:6076272. DOI:10.1155/2020/6076272. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1155/2020/6076272&link_type=DOI) 61. 61.Blei D, Ng A Jordan M. Latent dirichlet allocation. J Mach Learn Res. 2003;3:993–1022. [https://www.jmlr.org/papers/volume3/blei03a/blei03a.pdf](https://www.jmlr.org/papers/volume3/blei03a/blei03a.pdf). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1162/jmlr.2003.3.4-5.993&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000184926200015&link_type=ISI) 62. 62.International Society of Blood Transfusion Working Party on Haemovigilance. Proposed standard definitions for surveillance of non infectious adverse transfusion reactions: Incorporating correction to TRALI definition (as adopted June 2013). International Haemovigilance Network. 2011, July. [http://www.isbtweb.org/fileadmin/user_upload/files-2015/haemovigilance/definitions/Proposed%20definitions%202011%20surveillance%20non%20infectious%20adverse%20reactions%20haemovigilance%20incl%20TRALI%20correction%202013.pdf](http://www.isbtweb.org/fileadmin/user_upload/files-2015/haemovigilance/definitions/Proposed%20definitions%202011%20surveillance%20non%20infectious%20adverse%20reactions%20haemovigilance%20incl%20TRALI%20correction%202013.pdf). 63. 63.Sahu S, Hemlata, Verma A. Adverse events related to blood transfusion. Indian J Anaesth. 2014; 58(5): 543–551. PMID:25535415. DOI:10.4103/0019-5049.144650. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.4103/0019-5049.144650&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25535415&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 64. 64.1. Juffermans NP, 2. Walsh TS Juffermans NP, Walsh TS. Introduction. IN: Juffermans NP, Walsh TS, eds. Transfusion in the Intensive Care Unit. Springer, New York. 2015; 1–4. ISBN:978-3-319-08735-1. DOI:10.1007/978-3-319-08735-1_1. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/978-3-319-08735-1_1&link_type=DOI) 65. 65.Preacher, KJ, Briggs, NE. Calculation for Fisher’s exact test: An interactive calculation tool for Fisher’s exact probability test for 2 x 2 tables. Quantpsy. 2001, May. [http://quantpsy.org/fisher/fisher.htm](http://quantpsy.org/fisher/fisher.htm). 66. 66.1. Varon J. Varon J. Pulmonary disorders. In: Varon J. Handbook of Critical and Intensive Care Medicine, 3rd ed. Springer, Cham; 2016. ISBN:978-3-319-31605-5. DOI:10.1007/978-3-319-31605-5_13. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/978-3-319-31605-5_13&link_type=DOI) 67. 67.1. Juffermans NP, 2. Walsh TS, editors Krishnamoorthy P, Mukherjee D, Chatterjee S. Red blood cell transfusion trigger in cardiac disease. In: Juffermans NP, Walsh TS, editors. Transfusion in the Intensive Care Unit. New York: Springer; 2015;25–34. ISBN:978-3-319-08735-1. DOI:10.1007/978-3-319-08735-1_4. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/978-3-319-08735-1_4&link_type=DOI) 68. 68.1. Juffermans NP, 2. Walsh TS, editors Ortega D, Sakr Y. Causes of anemia in critically ill patients. In: Juffermans NP, Walsh TS, editors. Transfusion in the Intensive Care Unit. New York: Springer; 2015;5–12. ISBN:978-3-319-08735-1. DOI:10.1007/978-3-319-08735-1_2. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/978-3-319-08735-1_2&link_type=DOI) 69. 69.1. Simon TL, 2. Snyder EL, 3. Solheim BG, et al., editors Carson JL, Hébert P. Anemia and red blood cell transfusion. In: Simon TL, Snyder EL, Solheim BG, et al., editors. Rossi’s Principles of Transfusion Medicine, 4th ed. Blackwell Publishing Ltd; 2009;131–148. ISBN-10:1405175885. ISBN-13:978-1405175883. 70. 70.1. Simon TL, 2. Snyder EL, 3. Solheim BG, et al., editors Raife TJ, Rose JS, Lentz SR. Bleeding from acquired coagulation defects and antithrombotic therapy. In: Simon TL, Snyder EL, Solheim BG, et al., editors. Rossi’s Principles of Transfusion Medicine, 4th ed. Blackwell Publishing Ltd; 2009;376–390. ISBN-10:1405175885. ISBN-13:978-1405175883. 71. 71.Snyder MJ, Bepko J, White M. Acute pericarditis: diagnosis and management. Am Fam Physician. 2014 Apr 1;89(7):553–560. PMID:24695601. [https://www.aafp.org/afp/2014/0401/p553.html](https://www.aafp.org/afp/2014/0401/p553.html). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24695601&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 72. 72.1. Harmening, Denise, editor Kennedy MS, Wu HM. Transfusion therapy and transfusion in transplantation. In: Harmening, Denise, editor. Modern Blood Banking and Transfusion Practices, 5th ed. Philadelphia: F.A. Davis Company; 2005;303–321. ISBN:9780803612488. 73. 73.Reser D, Caliskan E, Tolboom H, et al. Median sternotomy. In: Multimedia Manual of Cardio-Thoracic Surgery. 2015. [https://mmcts.org/tutorial/80](https://mmcts.org/tutorial/80). Online ISSN:1813-9175. DOI:10.1093/mmcts/mmv017. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/mmcts/mmv017&link_type=DOI) 74. 74.1. Juffermans NP, 2. Walsh TS, editors Murphy GJ, Patel NN, Sterne JAC. Red blood cell transfusion trigger in cardiac surgery. In: Juffermans NP, Walsh TS, editors. Transfusion in the Intensive Care Unit. New York: Springer; 2015;35–44. ISBN:978-3-319-08735-1. DOI:10.1007/978-3-319-08735-1_5. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/978-3-319-08735-1_5&link_type=DOI) 75. 75.Banki F, Estrera AL, Harrison RG, et al. Pneumomediastinum: etiology and a guide to diagnosis and treatment. Am J Surg. 2013;206(6):1001–1006. PMID:24296102. DOI:10.1016/j.amjsurg.2013.08.009. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.amjsurg.2013.08.009&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24296102&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 76. 76.Mervis JS, Phillips TJ. Pressure ulcers: Pathophysiology, epidemiology, risk factors, and presentation. J Am Acad Dermatol. 2019 Oct;81(4):881–890. PMID:30664905. DOI:10.1016/j.jaad.2018.12.069. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jaad.2018.12.069&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30664905&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 77. 77.Varon J. Critical care oncology. In: Handbook of Critical and Intensive Care Medicine. Springer, Cham. 2016;243–262. ISBN:978-3-319-31605-5. DOI:10.1007/978-3-319-31605-5_11. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/978-3-319-31605-5_11&link_type=DOI) 78. 78.Goldberg DS, Fallon MB. The art and science of diagnosing and treating lung and heart disease secondary to liver disease. Clin Gastroenterol Hepatol. 2015;13(12):2118–2127. PMID:25934564 DOI:10.1016/j.cgh.2015.04.024. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.cgh.2015.04.024&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25934564&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 79. 79.Morris A. Heart-lung interaction via infection. Ann Am Thorac Soc. 2014 Jan;11(Suppl 1):S52–S56. PMID:24437407. DOI:10.1513/AnnalsATS.201306-157MG. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1513/AnnalsATS.201306-157MG&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24437407&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 80. 80.1. Simon TL, 2. Snyder EL, 3. Solheim BG, et al., editors Wallis JP, Sachs UJH. Transfusion-related acute lung injury. In: Simon TL, Snyder EL, Solheim BG, et al., editors. Rossi’s Principles of Transfusion Medicine, 4th ed. Blackwell Publishing Ltd; 2009;870–884. ISBN-10:1405175885. ISBN-13:978-1405175883. 81. 81.1. Simon TL, 2. Snyder EL, 3. Solheim BG, et al., editors Park YA, Brecher ME. Bacterial contamination of blood products. In: Simon TL, Snyder EL, Solheim BG, et al., editors. Rossi’s Principles of Transfusion Medicine, 4th ed. Blackwell Publishing Ltd; 2009;771–790. ISBN-10:1405175885. ISBN-13:978-1405175883. 82. 82.Vincent J-L. Topic 2: Resuscitation and acute respiratory failure. In: Critical Care Medicine: Churchill’s Ready Reference. Churchill Livingstone; 2009;5–34. ISBN-10:0080451365. DOI:10.1016/B978-0-08-045136-7.00002-9.83. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/B978-0-08-045136-7.00002-9.83&link_type=DOI) 83. 83.Krueger W, Ludman AJ. Acute respiratory distress syndrome. In: Core Knowledge in Critical Care Medicine. Berlin, Heidelberg: Springer; 2014;99–158. ISBN:978-3-642-54971-7. DOI:10.1007/978-3-642-54971-7_2. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/978-3-642-54971-7_2&link_type=DOI) 84. 84.Zhong J, Yang H-C, Fogo AB. A perspective on chronic kidney disease progression. Am J Physiol Renal Physiol. 2017 Mar 1;312(3):F375–F384. PMID:27974318. DOI:10.1152/ajprenal.00266.2016. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1152/ajprenal.00266.2016&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27974318&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 85. 85.1. Simon TL, 2. Snyder EL, 3. Solheim BG, et al., editors Davenport RD. Hemolytic transfusion reactions. In: Simon TL, Snyder EL, Solheim BG, et al., editors. Rossi’s Principles of Transfusion Medicine, 4th ed. Blackwell Publishing Ltd; 2009;811-825. ISBN-10:1405175885. ISBN-13:978-1405175883. 86. 86.Butcher BW, Liu KD. Fluid overload in AKI - Epiphenomenon or putative effect on mortality? Curr Opin Crit Care. 2012 Dec;18(6):593–598. PMID:23037878. DOI:10.1097/MCC.0b013e32835a1c44. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/MCC.0b013e32835a1c44&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23037878&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F01%2F06%2F2021.01.05.21249239.atom) 87. 87.Halamka J. Dispatch from the digital health frontier - November 2007. 2007 Nov 1, 2, 19, 20. [http://geekdoctor.blogspot.com/2007/11](http://geekdoctor.blogspot.com/2007/11).