Using Artificial Intelligence To Label Free-Text Operative And Ultrasound Reports For Grading Pediatric Appendicitis

Waseem Abu-Ashour; Sherif Emil; Dan Poenaru

doi:10.1101/2023.08.30.23294850

Abstract

Purpose Data science approaches personalizing pediatric appendicitis management are hampered by small datasets and unstructured electronic medical records (EMR). Artificial intelligence (AI) chatbots based on large language models (LLMs) can structure free-text EMR data. Here we compare data extraction quality between ChatGPT-4 and human data collectors.

Methods To train AI models to grade pediatric appendicitis preoperatively, several data collectors (medical students and research assistants) extracted detailed preoperative and operative data from 2100 children operated for acute appendicitis between 2014-2021. Collectors were trained and certified for the task based on satisfactory Kappa scores. ChatGPT-4 was prompted to structure free text from 103 random anonymized ultrasound and operative records in the dataset using the set variables and coding options, and to estimate the Pediatric Appendicitis Grade (PAG) from the operative report. A pediatric surgeon then adjudicated all data, identifying errors in each method.

Results Within the 44 ultrasound (42.7%) and 32 operative reports (31.1%) discordant in at least one field, 98% of the errors were found in the manual data extraction. The PAG was erroneously assigned manually in 29 patients (28.2%), and by ChatGPT-4 in 3 (2.9%). Across datasets, the use of the AI chatbot was able to avoid misclassification in 59.2% of the records including both reports and extracted data approx. 100x faster than manually.

Conclusion An AI chatbot significantly outperformed manual data extraction in accuracy for ultrasound and operative reports, and correctly assigned the PAG score. While wider validation is required and data safety concerns must be fully addressed, these novel AI tools show significant promise in improving the accuracy and efficiency of research data collection.

Highlights 1. What is known about this topic?

AI chatbots have several benefits and implications including healthcare uses.

2. What new information is contained in this article?

AI chatbot was proven to be more accurate when compared to human data extraction.

Introduction

Perforated appendicitis comprises approximately 25%–30% of appendicitis cases and has historically been considered a single disease entity [1]. Current clinical scoring systems for appendicitis, such as the Pediatric Appendicitis Score and the Alvarado score, focus on diagnosis of the disease, rather than its severity [2]. Attempts have also been made to achieve a standard definition for perforated appendicitis [3]. However, a recent review of the outcomes of perforated appendicitis revealed persistent significant variability in the outcomes of perforated appendicitis owing to lack of utilization of an evidence-based definition [4]. Previous work within our own team has protocolized the care of children with perforated appendicitis by developing a complication grade [5] and has shown that postoperative outcomes and resource utilization strongly correlate with increasing grade of perforated appendicitis [3].

Artificial intelligence (AI) and its subset of machine learning (ML) have demonstrated significant potential in pediatrics and in surgery [6]. Specifically for acute appendicitis, various AI methods have been explored for improving the accuracy of acute appendicitis diagnosis and predicting the need for surgery [7,8]. Clinical prediction tools (CPTs), using both patient data and details from patient encounters, have become an integral part of diagnosis and treatment processes, especially with ML approaches [9]. ML-based CPTs have shown to be superior in silico, such as in diagnosing suspected appendicitis in pediatric populations [10], due to their capability to handle complex data like electronic medical records (EMRs). However, their clinical applicability and validation in clinical settings remain underexplored [10]. The considerable increase in health data digitization over the past decade has seen a rise in multimodal data repositories, including electronic health text data, medical imaging, multiomics, and environmental data [11]. However, deriving robust real-world evidence from these data requires overcoming barriers like data recency, clinical depth, provenance, completeness, representativeness, and usability [12]. Conventionally, extracting clinical variables and outcomes from routinely collected EMRs necessitates significant pre-processing, laborious curation, and intensive manual patient chart reviews, posing a time and resource challenge [13].

Lately, the field of AI has seen an expansion of deep learning, an emergent AI branch, into numerous network architectures. While ML typically involves the use of statistical methods to learn from data, deep learning uses neural networks to learn from large datasets [14]. Several healthcare applications of deep learning are frequent today, such as computer vision for radiology and pathology, speech recognition, and natural language processing (NLP) [15]. AI’s transformative power in healthcare is demonstrated in its ability to analyze large volumes of data, informing treatment decisions, enhancing medical research, and facilitating tasks like early disease diagnosis, outcome predictions, and automation of routine tasks [16]. Chatbots represent AI-driven applications that mimic human dialogue, offering automated responses to user queries through the use of NLP [17]. They serve various roles including education, health assistance, and financial management [18]. Well-known examples of chatbots include Siri, Alexa and Google Assistant [19]. ChatGPT, recently appeared, has proven to be a useful chatbot. This AI system utilizes ML algorithms and NLP to generate human-like responses, functioning as a sophisticated chatbot [20,21]. Despite not being originally intended for healthcare applications, it exhibits potential in supporting patient queries and easing healthcare operations [22]. The automation of tasks like reliable and swift transcription of patient medical records allows medical professionals to dedicate more time to patient interactions [23]. In making medical reports, clinical trial documents, and other related documents more comprehensible for both patients and healthcare professionals, ChatGPT can serve as a summarizing tool which can potentially reduce the risk of errors in medical records [24,25]. ChatGPT’s ability to translate medical text between languages also contributes to improved communication and understanding among patients and healthcare providers [24,25].

In this pilot study, we compared clinical data extraction from free-text medical reports between an AI chatbot, ChatGPT-4, and trained human data extractors.

Methods

Study Design

This is a fully paired comparative accuracy study [26] to compare the accuracy of ChatGPT-4 and human data collectors.

Patient Characteristics

Our dataset included pediatric patients from the Montreal Children Hospital (MCH) between January 2014 - December 2021. Patient criteria for inclusion were children ≤ 18 years of age who underwent surgery for acute appendicitis and were confirmed to have acute appendicitis. Patients were excluded if they were <1 or = 18 years old at the time of operation, had incomplete medical records, or if no conclusive diagnosis or grade of perforation was possible from the operative report.

Instruments and Data Collection

A standard case report form was created in the secure web-based Research Electronic Data Capture (REDCap) software. Patient demographics were collected via the Outcome and Assessment Information Set (Oacis) platform, the McGill University Health Centre (MUHC) electronic health record platform. To develop an appendicitis severity prediction model, we extracted data from two report types: ultrasound (US) and operative reports.

Perforated Appendicitis Grade (PAG)

The PAG (Table 1) has been developed in the H.E. Beardmore Division of Pediatric Surgery based on an earlier validated appendicitis score [5]. It was used directly to assign a score of 0-5 in a selection of operative reports.

View this table:

Table 1: Perforated Appendicitis Grade (PAG) Definition And ChatGPT-4 Prompt Definitions

Determination of perforation and severity of appendicitis (OR report)

The operative reports served as the reference standard for determining appendiceal perforation and severity of appendicitis. If there was a contradiction between the operative and pathology reports regarding the presence of perforation, the pathology report was considered definitive.

Procedures

To train AI models to grade pediatric appendicitis preoperatively, several data collectors (medical students and research assistants) extracted detailed preoperative and operative data from 2100 children operated for acute appendicitis between January 2014 and December 2021. Human data collectors were trained to extract the necessary clinical data into our dataset and derive the PAG. For the current study, a random selection of 103 ultrasound and operative reports were tested as prompts for ChatGPT-4, in conjunction with the PAG. For this purpose, the PAG categories were expanded using an expert-derived vocabulary of pertinent medical terms and expressions extracted from actual operative reports. ChatGPT was prompted to structure the free text of both report types using set parameters representing the actual variables and codes in the dataset, and to estimate the PAG from the operative report (Supplemental File [A]). A pediatric surgeon then compared the human and AI-generated data, identifying all misclassified data within each group.

Variables and statistical analysis

The patient data used for this report was part of a larger dataset on pediatric patients with appendicitis. For this study we limited information to be collected to the US and the operative report. The data collected from the US report included: ultrasound diagnosis, appendix identification, appendiceal maximum diameter, appendix location, probe tenderness over appendix, appendix compressible, mesenteric fat stranding, fluid around appendix, fluid in pelvis, phlegmon or inflammatory mass, bowel thickening, presence of fecalith, intra-abdominal abscess, as well as the full US report text. The data collected from the operative report included: appendiceal necrosis, fecalith, intra-abdominal abscess, intraperitoneal fluid, peritonitis, presence of perforation, site of perforation (if mentioned), PAG recorded in report, intraoperative PAG mentioned in the report, an automatically calculated PAG within REDCap, and the full operative report text.

All de-identified information was exported into an Excel sheet. We calculated percentages and rates of discrepancy between human data extractors and ChatGPT-4. Data was organized and preliminary analyses were conducted using Microsoft Excel (version 16.74, 2023, Microsoft Corp.). Raw data was entered into Excel spreadsheets, and data cleaning was performed to check for discrepancies between human data extractors and ChatGPT-4. We calculated percentages and rate of errors between human data extractor information and ChatGPT-4 output.

Results

We compared the data between human data extractors and ChatGPT-4 in 103 records. The data collected included 13 variable fields from the US report and 9 fields from the operative report. The included 103 records were a subset of a larger multimodal dataset gathered for a ML appendicitis classification project. The entire dataset will contain 2300 records. Within this large dataset, missingness of information related to the US and operative reports varied widely. We ensured however that the 103 records selected for this analysis did not contain any missing information. The actual PAG distribution in these 103 records was: grade I - 24%, II - 12.5%, III - 11.5%, grade IV - 10%, and grade V - 42%. The correction made by ChatGPT-4 to human extractors across all records encompassing both types of reports was found in 61 records (59.2%).

US Reports

We compared 13 fields extracted from the US reports by human and AI extractors (Table 2). Figure 1 shows the US report human extractors read and the structured ChatGPT-4 output after using the prompt. The discordance between human data extractors and ChatGPT-4 in the US report features ranged from 0% - 12.6%. The highest discrepancies were in the final US diagnosis (12.6%), followed by presence of intra-abdominal abscess (11.6%), mesenteric fat stranding (7.8%) and fluid in the pelvis (7.8%). There were no discrepancies encountered in appendiceal compression. Based on the ground truth assessment by the expert, ChatGPT-4 was found to have successfully corrected the human data extractors in 65 individual fields within 44 records, yielding an overall 42.7% correction rate. The misclassification rate by Cahatgpt-4 was in only 9 individual fields within (8.7%) of the records.

View this table:

Table 2: Accuracy And Discrepancies In US Report Field By Data Extraction Method

Fig. 1: US Report & ChatGPT-4 Structured Data After Prompt

Operative Reports

Within the operative reports we compared 10 distinct fields (Table 3). Figure 2 shows the operative report human extractors read and the structured ChatGPT-4 output after using the prompt. The discrepancy between human data extractors and the ChatGPT-4 the operative report ranged from 0% - 41.7%. The highest discrepancy was in the final PAG (31%), followed by presence of peritonitis (26.2%), intra-operative PAG (20.4%) and intra-abdominal abscess (18.4%). Following validation by the expert surgeon, ChatGPT-4 corrected human data extractors in 133 individual fields, yielding a 31.1% overall correction rate. Importantly, the PAG was misclassified by the human collectors in 28% of the records, and only in 3% by ChatGPT-4. Figure 3 shows human vs ChatGPT-4 errors in each report.

View this table:

Table 3: Accuracy And Discrepancies In Operative Report Field By Data Extraction Method

Fig. 2: Operative Report & ChatGPT-4 Structured Data After Prompt

Fig. 3: Human And ChatGPT-4 Errors In Data Extraction

Time required

We compared the time saved using the AI chatbot for data extraction vs human data extractors. The latter required an average of 20 minutes to extract data from patient records into the secure web-based software (REDCap). For the 103 records, this totaled 33.3 hours. By comparison, when using ChatGT-4, taking into account copying and pasting the required information from the reports, alongside the prompts, into the chatbox, and generating the required output from ChatGPT-4, task completion for one record required on average 30 seconds - totalling therefore 50 minutes for 103 records. The AI chatbot was therefore almost 40 times faster than the human extractors in this extraction/summarization task. It is however worth noting that currently ChatGPT-4 is limited to only allowing 25 entries every 3 hours. One other limitation is the occasional need to refresh the prompt thread, as the chatbot seemed to get “fatigued” and not respect exactly the constraints listed in the prompt after approx. ten records.

Discussion

In this study we compared accuracy of data extracted by human data extractors, on detailed preoperative and operative data from the EMR of children operated for acute appendicitis, to ChatGPT-4 extraction data after specific prompts. The data was extracted from US and operative reports to estimate the PAG. After comparison between human data extractors and ChatGPT-4t to identify discrepancies, the latter was found to be more accurate in extracting data from both US and operative reports by 41.7% and 47.6%, respectively. Not only did it provide more accurate data extraction, but it made the process of data extraction 40 times faster.

To our knowledge, there are no other studies that directly compared free-text data extraction from the EMR between AI chatbots and human data extractors. Adamson et al [27] applied NLP to train, validate, and test the extraction of information from unstructured documents (e.g., clinician notes, radiology reports, lab reports, etc.) to output a set of structured variables required for real-world data analysis. The authors concluded that NLP enabled the extraction of retrospective clinical data from the EMR faster and more efficiently [27]. The authors did not however directly compare their model to human data extractors - their purpose was to show success in building an AI model that extracts accurate information from the EMR.

There are, however, several studies successfully documenting the use of AI chatbots in clinical settings to extract EMR information. Clinical text differs significantly from typical text used in general NLP, both in syntax and vocabulary [28]. As a result, the clinical NLP community often trains domain-specific models on clinical corpora, using language modeling strategies from the broader NLP community. However, in several such applications the performance gains were marginal compared to classical methods such as logistic regression [29,30]. Early studies using LLMs such as GPT-3 failed to show competitive results on biomedical NLP tasks [31,32]. One study developed a generic predictive model that covers observed medical conditions and medication uses [33]. This temporal model using recurrent neural networks was developed and applied to longitudinal time stamped EMR data. Encounter records (e.g. diagnosis codes, medication codes or procedure codes) were input into recurrent neural networks to predict diagnosis and medication categories for subsequent visits, showing improved accuracy compared to several baselines that are based on experts’ intuition about the dynamics of events in clinical settings [33]. Other groups have been able to predict the risk of 30-day readmission [34] and diagnose rare diseases [35] based on structured EMR data.

LLMs can also be used to generate accurate, relevant, comprehensive, and coherent answers to clinical questions based on hospital admission notes. One study demonstrated the accuracy, relevance, comprehensiveness, and coherence of the answers generated by AI chatbots (ChatGPT 3.5 and Claude) on a set of patient-specific questions [36]. Their results suggest that LLMs are a promising tool for patient-specific inquiries from clinical notes [36].

ChatGPT’s versatile nature and advanced NLP capabilities have made it a valuable tool across various domains, specifically in healthcare, education and scientific research. Some of the potential for ChatGPT use in healthcare include: (i) chatbots that can assist with patient triage, helping healthcare providers determine the urgency of a patient’s condition and the appropriate course of action [37]; (ii) medical diagnosis and treatment recommendations by analyzing patient data and symptoms [38], (iii) patient engagement and adherence - by providing personalized recommendations and reminders, helping patients stay on track with their treatment [39], (iv) clinical research and development through analysis of large amounts of clinical data, identifying patterns and trends that can be used to develop new treatments and interventions [40]. In the medical education and training sector: (i) AI has been beneficial in personalized learning by providing tailored recommendations in medical imaging to aid in diagnosing diseases such as surgical training and tutoring [41,42]. (ii) ChatGPT has the potential to assist with medical education, and potentially, clinical decision-making and knowledge development [43,44]. The application of ChatGPT in scientific research is multifaceted: (i) ChatGPT has been instrumental in transforming the way researchers interact with and interpret data [45]. (ii) ChatGPT is a powerful tool for hypothesis generation and testing, aiding researchers in conceiving new research questions and hypotheses[46]. Moreover, substantial progress has been achieved in discerning the extensive implications of ChatGPT in areas such as healthcare and education research, although these domains do present their unique challenges[38,47].

Limitations

There are several limitations to the current study. Our data is specific to pediatric appendicitis, therefore the findings may not be generalizable to other types of medical records or patient populations - the performance of both the AI chatbot and the human extractors might differ when dealing with other types of data. However, with the proper prompts the chatbot should be able to successfully extract structured data from other datasets and populations. Another limitation was the natural presence of missing and/or incorrect data in the EMR, which could potentially impact the accuracy of both extraction methods. Such data input gaps and errors would however impact both extraction methods similarly. Moreover, the data set used in this study was relatively small. It is possible that the results would be different if a larger data set were used. Furthermore, it’s important to note that the accuracy evaluation was conducted by a single individual - a senior pediatric surgeon. While this could potentially introduce a higher risk of bias, the evaluator’s expertise in the field helps mitigate this concern.

AI chatbot performance is naturally very dependent on the LLMs and the content they have been trained on, in particular the size of the medical corpus. The other key limitation is the quality of the prompts used, which is related to the rapidly expanding expertise in the new field dubbed “prompt engineering” [48].

Both human and AI performance could have been affected by the quality and specificity of the pre-defined extraction criteria. The large number of errors identified after the manual data extraction was particularly concerning, raising doubts on the adequacy of the on-task content training and certification, despite satisfactory Kappa scores.

Finally, it’s worth noting that the use of any AI, such as AI chatbots, in analyzing and extracting patient data raises important ethical and privacy concerns. To address these concerns, only de-identified data was provided to ChatGPT. Despite these limitations, the results of this study suggest that AI chatbots have the potential to be a valuable tool in a healthcare research setting.

In spite of the valuable contributions made by AI in healthcare, it is important to acknowledge and address the challenges and ethical implications arising from its application. This includes a multitude of factors, such as: (i) model reliability and precision, wherein the AI could generate erroneous or deceptive information; (ii) Inherent bias in AI models can lead to biased and inequitable information and treatment suggestions, and the efficiency of ChatGPT may also be affected by dataset bias, where the quality and diversity of the data used for training could influence the model’s performance; (iii) overdependence on AI, consequently diminishing critical thinking and independent problem-solving skills; (iv) privacy issues due to chatbots’ access to extensive user data, triggering concerns over privacy and data protection; and (v) factual integrity, since LLMs can “hallucinate”, generating inconsistent or erroneous texts. It is thus crucial to verify the factual correctness of the generated content [47]. AI models also grapple with issues related to understanding context, ethical reasoning, conversational context, and generation of visual content. ChatGPT, while impressive, grapples with several challenges such as handling inappropriate requests, adjusting to user expertise levels, delivering personalized feedback, and dealing with multilingual queries and non-literal language. There are also ethical quandaries related to data privacy and security, intellectual property rights, transparency, accountability, and susceptibility to adversarial attacks. The potential influence of ChatGPT on human behavior prompts questions about personal autonomy. Bias and discrimination issues are also prevalent, as AI language models like ChatGPT are trained on extensive datasets which could inadvertently include biases, stereotypes, and prejudiced language. This could result in the model unknowingly generating offensive or harmful responses, thereby perpetuating these biases. Mitigating this problem necessitates refining the training data, enhancing the model’s structure, and implementing guidelines to ensure fairness and unbiased results[25,49,50]. Approaching these ethical considerations and challenges requires a forward-thinking attitude from developers, researchers, and the larger AI community. Collaborative efforts to recognize, comprehend, and resolve potential issues will ensure that AI language models like ChatGPT are developed and utilized in a responsible manner, maximizing their advantages while minimizing potential risks.

Conclusion

Our study showed that ChatGPT outperformed manual data extraction for ultrasound and operative reports of acute appendicitis in accuracy, and correctly assigned the PAG score. Despite significant limitations in the clinical use of similar AI chatbots, these novel AI tools already show promise in rendering research tasks more accurate and significantly faster.

Data Availability

All data produced in the present study are available upon reasonable request to the authors

Footnotes

Financial Support Statement: This publication is based on data from a larger ongoing project (“Augmented Intelligence to Support Shared Decision-Making in the Care of Children with Appendicitis”), which is supported through a project grant of the Canadian Institutes of Health Research (CIHR).

References

[1].↵
Cheong LHA, Emil S. Outcomes of pediatric appendicitis: an international comparison of the United States and Canada. JAMA Surg 2014;149:50–5. doi:10.1001/jamasurg.2013.2517.
OpenUrl CrossRef PubMed
[2].↵
Samuel M. Pediatric appendicitis score. J Pediatr Surg 2002;37:877–81. doi:10.1053/jpsu.2002.32893.
OpenUrl CrossRef PubMed Web of Science
[3].↵
St. Peter SD, Sharp SW, Holcomb GW, Ostlie DJ. An evidence-based definition for perforated appendicitis derived from a prospective randomized trial. J Pediatr Surg 2008;43:2242–5. doi:10.1016/j.jpedsurg.2008.08.051.
OpenUrl CrossRef PubMed Web of Science
[4].↵
Rogers AP, Zens TJ, Leys CM, Nichol PF, Ostlie DJ. A call for a standardized definition of perforated appendicitis. J Pediatr Surg 2017;52:89–92. doi:10.1016/j.jpedsurg.2016.10.026.
OpenUrl CrossRef
[5].↵
Yousef Y, Youssef F, Homsy M, Dinh T, Pandya K, Stagg H, et al. Standardization of care for pediatric perforated appendicitis improves outcomes. J Pediatr Surg 2017;52:1916–20. doi:10.1016/j.jpedsurg.2017.08.054.
OpenUrl CrossRef
[6].↵
Lonsdale H, Jalali A, Ahumada L, Matava C. Machine Learning and Artificial Intelligence in Pediatric Research: Current State, Future Prospects, and Examples in Perioperative and Critical Care. J Pediatr 2020;221S:S3–10. doi:10.1016/j.jpeds.2020.02.039.
OpenUrl CrossRef
[7].↵
Reismann J, Romualdi A, Kiss N, Minderjahn MI, Kallarackal J, Schad M, et al. Diagnosis and classification of pediatric acute appendicitis by artificial intelligence methods: An investigator-independent approach. PLoS One 2019;14:e0222030. doi:10.1371/journal.pone.0222030.
OpenUrl CrossRef
[8].↵
Akmese OF, Dogan G, Kor H, Erbay H, Demir E. The Use of Machine Learning Approaches for the Diagnosis of Acute Appendicitis. Emerg Med Int 2020;2020:7306435. doi:10.1155/2020/7306435.
OpenUrl CrossRef
[9].↵
Adams ST, Leveson SH. Clinical prediction rules. BMJ 2012;344:d8312. doi:10.1136/bmj.d8312.
OpenUrl FREE Full Text
[10].↵
Marcinkevics R, Reis Wolfertstetter P, Wellmann S, Knorr C, Vogt JE. Using Machine Learning to Predict the Diagnosis, Management and Severity of Pediatric Appendicitis. Front Pediatr 2021;9:662183. doi:10.3389/fped.2021.662183.
OpenUrl CrossRef
[11].↵
Murdoch TB, Detsky AS. The inevitable application of big data to health care. JAMA 2013;309:1351–2. doi:10.1001/jama.2013.393.
OpenUrl CrossRef PubMed Web of Science
[12].↵
Shore C, Gee AW, Kahn B, Forstag EH, National Academies of Sciences Engineering, Medicine, et al. Barriers and Disincentives to the Use of Real-World Evidence and Real-World Data. Examining the Impact of Real-World Evidence on Medical Product Development: Proceedings of a Workshop Series, National Academies Press (US); 2019.
[13].↵
Birnbaum B, Nussbaum N, Seidl-Rathkopf K, Agrawal M, Estevez M, Estola E, et al. Model-assisted cohort selection with bias analysis for generating large-scale cohorts from the EHR for oncology research. arXiv [csCY] 2020.
[14].↵
Sharifani K, Amini M. Machine Learning and Deep Learning: A Review of Methods and Applications 2023.
[15].↵
Chai J, Zeng H, Li A, Ngai EWT. Deep learning in computer vision: A critical review of emerging techniques and application scenarios. Machine Learning with Applications 2021;6:100134. doi:10.1016/j.mlwa.2021.100134.
OpenUrl CrossRef
[16].↵
Davenport T, Kalakota R. The potential for artificial intelligence in healthcare. Future Healthc J 2019;6:94–8. doi:10.7861/futurehosp.6-2-94.
OpenUrl Abstract/FREE Full Text
[17].↵
Dwivedi YK, Hughes L, Ismagilova E, Aarts G, Coombs C, Crick T, et al. Artificial Intelligence (AI): Multidisciplinary perspectives on emerging challenges, opportunities, and agenda for research, practice and policy. Int J Inf Manage 2021;57:101994. doi:10.1016/j.ijinfomgt.2019.08.002.
OpenUrl CrossRef
[18].↵
Ashfaq M, Yun J, Yu S, Loureiro SMC. I, Chatbot: Modeling the determinants of users’ satisfaction and continuance intention of AI-powered service agents. Telemat Inform 2020;54:101473. doi:10.1016/j.tele.2020.101473.
OpenUrl CrossRef
[19].↵
Adamopoulou E, Moussiades L. Chatbots: History, technology, and applications. Machine Learning with Applications 2020;2:100006. doi:10.1016/j.mlwa.2020.100006.
OpenUrl CrossRef
[20].↵
Brown T, Mann B, Ryder N, Subbiah M, Kaplan JD, Dhariwal P, et al. Language models are few-shot learners. Adv Neural Inf Process Syst 2020;33:1877–901.
OpenUrl
[21].↵
Ventayen RJM. OpenAI ChatGPT Generated Results: Similarity Index of Artificial Intelligence-Based Contents 2023. doi:10.2139/ssrn.4332664.
OpenUrl CrossRef
[22].↵
Devlin J, Chang M-W, Lee K, Toutanova K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv [csCL] 2018.
[23].↵
Haluza D, Jungwirth D. Artificial Intelligence and Ten Societal Megatrends: An Exploratory Study Using GPT-3. Systems 2023;11:120. doi:10.3390/systems11030120.
OpenUrl CrossRef
[24].↵
Gordijn B, Have HT. ChatGPT: evolution or revolution? Med Health Care Philos 2023;26:1–2. doi:10.1007/s11019-023-10136-0.
OpenUrl CrossRef
[25].↵
Ali MJ, Djalilian A. Readership awareness series - paper 4: Chatbots and ChatGPT - ethical considerations in scientific publications. Semin Ophthalmol 2023;38:403–4. doi:10.1080/08820538.2023.2193444.
OpenUrl CrossRef
[26].↵
Yang B, Olsen M, Vali Y, Langendam MW, Takwoingi Y, Hyde CJ, et al. Study designs for comparative diagnostic test accuracy: A methodological review and classification scheme. J Clin Epidemiol 2021;138:128–38. doi:10.1016/j.jclinepi.2021.04.013.
OpenUrl CrossRef
[27].↵
Adamson B, Waskom M, Blarre A, Kelly J, Krismer K, Nemeth S, et al. Approach to machine learning for extraction of real-world data variables from electronic health records. bioRxiv 2023. doi:10.1101/2023.03.02.23286522.
OpenUrl CrossRef
[28].↵
Wu S, Roberts K, Datta S, Du J, Ji Z, Si Y, et al. Deep learning in clinical natural language processing: a methodical review. J Am Med Inform Assoc 2020;27:457–70. doi:10.1093/jamia/ocz200.
OpenUrl CrossRef PubMed
[29].↵
Chen IY, Alsentzer E, Park H, Thomas R, Gosangi B, Gujrathi R, et al. Intimate Partner Violence and Injury Prediction From Radiology Reports. Pac Symp Biocomput 2021;26:55–66.
OpenUrl
[30].↵
1. Shaban-Nejad A,
2. Michalowski M,
3. Buckeridge DL
Krishna K, Pavel A, Schloss B, Bigham JP, Lipton ZC. Extracting Structured Data from Physician-Patient Conversations by Predicting Noteworthy Utterances. In: Shaban-Nejad A, Michalowski M, Buckeridge DL, editors. Explainable AI in Healthcare and Medicine: Building a Culture of Transparency and Accountability, Cham: Springer International Publishing; 2021, p. 155–69. doi:10.1007/978-3-030-53352-6_14.
OpenUrl CrossRef
[31].↵
Moradi M, Blagec K, Haberl F, Samwald M. GPT-3 Models are Poor Few-Shot Learners in the Biomedical Domain. arXiv [csCL] 2021.
[32].↵
Gutiérrez BJ, McNeal N, Washington C, Chen Y, Li L, Sun H, et al. Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again. arXiv [csCL] 2022.
[33].↵
Choi E, Bahadori MT, Schuetz A, Stewart WF, Sun J. Doctor AI: Predicting Clinical Events via Recurrent Neural Networks. JMLR Workshop Conf Proc 2016;56:301–18.
OpenUrl
[34].↵
Pang C, Jiang X, Kalluri KS, Spotnitz M, Chen R, Perotte A, et al. CEHR-BERT: Incorporating temporal information from structured EHR data to improve prediction tasks. In: Roy S, Pfohl S, Rocheteau E, Tadesse GA, Oala L, Falck F, et al., editors. Proceedings of Machine Learning for Health, vol. 158, PMLR; 2021, p. 239–60.
OpenUrl
[35].↵
Prakash PKS, Chilukuri S, Ranade N, Viswanathan S. RareBERT: Transformer Architecture for Rare Disease Patient Identification using Administrative Claims. AAAI 2021;35:453–60. doi:10.1609/aaai.v35i1.16122.
OpenUrl CrossRef
[36].↵
Hamidi A, Roberts K. Evaluation of AI Chatbots for Patient-Specific EHR Questions. arXiv [csCL] 2023.
[37].↵
Ali SR, Dobbs TD, Hutchings HA, Whitaker IS. Using ChatGPT to write patient clinic letters. Lancet Digit Health 2023;5:e179–81. doi:10.1016/S2589-7500(23)00048-1.
OpenUrl CrossRef
[38].↵
Sallam M. ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns. Healthcare (Basel) 2023;11. doi:10.3390/healthcare11060887.
OpenUrl CrossRef PubMed
[39].↵
Hopkins AM, Logan JM, Kichenadasse G. Artificial intelligence chatbots will revolutionize how cancer patients access information: ChatGPT represents a paradigm-shift. JNCI Cancer Spectrum 2023;7. doi:10.1093/jncics/pkad010.
OpenUrl CrossRef
[40].↵
Khan RA, Jawaid M, Khan AR, Sajjad M. ChatGPT - Reshaping medical education and clinical management. Pak J Med Sci Q 2023;39:605–7. doi:10.12669/pjms.39.2.7653.
OpenUrl CrossRef
[41].↵
Fazlollahi AM, Bakhaidar M, Alsayegh A, Yilmaz R, Winkler-Schwartz A, Mirchi N, et al. Effect of Artificial Intelligence Tutoring vs Expert Instruction on Learning Simulated Surgical Skills Among Medical Students: A Randomized Clinical Trial. JAMA Netw Open 2022;5:e2149008. doi:10.1001/jamanetworkopen.2021.49008.
OpenUrl CrossRef
[42].↵
McKinney SM, Sieniek M, Godbole V, Godwin J, Antropova N, Ashrafian H, et al. International evaluation of an AI system for breast cancer screening. Nature 2020;577:89–94. doi:10.1038/s41586-019-1799-6.
OpenUrl CrossRef PubMed
[43].↵
Tsang R. Practical Applications of ChatGPT in Undergraduate Medical Education. J Med Educ Curric Dev 2023;10:23821205231178449. doi:10.1177/23821205231178449.
OpenUrl CrossRef
[44].↵
Kung TH, Cheatham M, Medenilla A, Sillos C, De Leon L, Elepaño C, et al. Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models. PLOS Digit Health 2023;2:e0000198. doi:10.1371/journal.pdig.0000198.
OpenUrl CrossRef PubMed
[45].↵
Salvagno M, Taccone FS, Gerli AG. Can artificial intelligence help for scientific writing? Crit Care 2023;27:75. doi:10.1186/s13054-023-04380-2.
OpenUrl CrossRef
[46].↵
Salah M, Alhalbusi H, Ismail MM, Abdelfattah F. Chatting with ChatGPT: Decoding the Mind of Chatbot Users and Unveiling the Intricate Connections between User Perception, Trust and Stereotype Perception on Self-Esteem and Psychological Well-being 2023. doi:10.21203/rs.3.rs-2610655/v2.
OpenUrl CrossRef
[47].↵
Ray PP. ChatGPT: A comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope. Internet of Things and Cyber-Physical Systems 2023;3:121–54. doi:10.1016/j.iotcps.2023.04.003.
OpenUrl CrossRef
[48].↵
White J, Fu Q, Hays S, Sandborn M, Olea C, Gilbert H, et al. A Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPT. arXiv [csSE] 2023.
[49].↵
Liebrenz M, Schleifer R, Buadze A, Bhugra D, Smith A. Generating scholarly content with ChatGPT: ethical challenges for medical publishing. The Lancet Digital Health 2023;5:E105–6. doi:10.1016/S2589-7500(23)00019-5.
OpenUrl CrossRef PubMed
[50].↵
Crawford J, Cowling M, Allen K-A. Leadership is needed for ethical ChatGPT: character, assessment, and learning using artificial intelligence (AI),. Teaching & Learning Practice 2023;20. doi:10.53761/1.20.3.02.
OpenUrl CrossRef

View the discussion thread.

Posted September 01, 2023.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Health Informatics

Subject Areas

All Articles

Addiction Medicine (319)
Allergy and Immunology (622)
Anesthesia (162)
Cardiovascular Medicine (2334)
Dentistry and Oral Medicine (283)
Dermatology (205)
Emergency Medicine (373)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (828)
Epidemiology (11685)
Forensic Medicine (10)
Gastroenterology (691)
Genetic and Genomic Medicine (3669)
Geriatric Medicine (345)
Health Economics (629)
Health Informatics (2357)
Health Policy (924)
Health Systems and Quality Improvement (881)
Hematology (339)
HIV/AIDS (769)
Infectious Diseases (except HIV/AIDS) (13247)
Intensive Care and Critical Care Medicine (764)
Medical Education (363)
Medical Ethics (104)
Nephrology (395)
Neurology (3425)
Nursing (193)
Nutrition (518)
Obstetrics and Gynecology (665)
Occupational and Environmental Health (659)
Oncology (1798)
Ophthalmology (531)
Orthopedics (214)
Otolaryngology (285)
Pain Medicine (229)
Palliative Medicine (66)
Pathology (444)
Pediatrics (1018)
Pharmacology and Therapeutics (424)
Primary Care Research (413)
Psychiatry and Clinical Psychology (3137)
Public and Global Health (6077)
Radiology and Imaging (1251)
Rehabilitation Medicine and Physical Therapy (727)
Respiratory Medicine (818)
Rheumatology (375)
Sexual and Reproductive Health (365)
Sports Medicine (320)
Surgery (396)
Toxicology (50)
Transplantation (171)
Urology (144)

[1] [1].↵
Cheong LHA, Emil S. Outcomes of pediatric appendicitis: an international comparison of the United States and Canada. JAMA Surg 2014;149:50–5. doi:10.1001/jamasurg.2013.2517.
OpenUrl CrossRef PubMed

[2] [2].↵
Samuel M. Pediatric appendicitis score. J Pediatr Surg 2002;37:877–81. doi:10.1053/jpsu.2002.32893.
OpenUrl CrossRef PubMed Web of Science

[3] [3].↵
St. Peter SD, Sharp SW, Holcomb GW, Ostlie DJ. An evidence-based definition for perforated appendicitis derived from a prospective randomized trial. J Pediatr Surg 2008;43:2242–5. doi:10.1016/j.jpedsurg.2008.08.051.
OpenUrl CrossRef PubMed Web of Science

[4] [4].↵
Rogers AP, Zens TJ, Leys CM, Nichol PF, Ostlie DJ. A call for a standardized definition of perforated appendicitis. J Pediatr Surg 2017;52:89–92. doi:10.1016/j.jpedsurg.2016.10.026.
OpenUrl CrossRef

[5] [5].↵
Yousef Y, Youssef F, Homsy M, Dinh T, Pandya K, Stagg H, et al. Standardization of care for pediatric perforated appendicitis improves outcomes. J Pediatr Surg 2017;52:1916–20. doi:10.1016/j.jpedsurg.2017.08.054.
OpenUrl CrossRef

[6] [6].↵
Lonsdale H, Jalali A, Ahumada L, Matava C. Machine Learning and Artificial Intelligence in Pediatric Research: Current State, Future Prospects, and Examples in Perioperative and Critical Care. J Pediatr 2020;221S:S3–10. doi:10.1016/j.jpeds.2020.02.039.
OpenUrl CrossRef

[7] [7].↵
Reismann J, Romualdi A, Kiss N, Minderjahn MI, Kallarackal J, Schad M, et al. Diagnosis and classification of pediatric acute appendicitis by artificial intelligence methods: An investigator-independent approach. PLoS One 2019;14:e0222030. doi:10.1371/journal.pone.0222030.
OpenUrl CrossRef

[8] [8].↵
Akmese OF, Dogan G, Kor H, Erbay H, Demir E. The Use of Machine Learning Approaches for the Diagnosis of Acute Appendicitis. Emerg Med Int 2020;2020:7306435. doi:10.1155/2020/7306435.
OpenUrl CrossRef

[9] [9].↵
Adams ST, Leveson SH. Clinical prediction rules. BMJ 2012;344:d8312. doi:10.1136/bmj.d8312.
OpenUrl FREE Full Text

[10] [10].↵
Marcinkevics R, Reis Wolfertstetter P, Wellmann S, Knorr C, Vogt JE. Using Machine Learning to Predict the Diagnosis, Management and Severity of Pediatric Appendicitis. Front Pediatr 2021;9:662183. doi:10.3389/fped.2021.662183.
OpenUrl CrossRef

[11] [11].↵
Murdoch TB, Detsky AS. The inevitable application of big data to health care. JAMA 2013;309:1351–2. doi:10.1001/jama.2013.393.
OpenUrl CrossRef PubMed Web of Science

[12] [12].↵
Shore C, Gee AW, Kahn B, Forstag EH, National Academies of Sciences Engineering, Medicine, et al. Barriers and Disincentives to the Use of Real-World Evidence and Real-World Data. Examining the Impact of Real-World Evidence on Medical Product Development: Proceedings of a Workshop Series, National Academies Press (US); 2019.

[13] [13].↵
Birnbaum B, Nussbaum N, Seidl-Rathkopf K, Agrawal M, Estevez M, Estola E, et al. Model-assisted cohort selection with bias analysis for generating large-scale cohorts from the EHR for oncology research. arXiv [csCY] 2020.

[14] [14].↵
Sharifani K, Amini M. Machine Learning and Deep Learning: A Review of Methods and Applications 2023.

[15] [15].↵
Chai J, Zeng H, Li A, Ngai EWT. Deep learning in computer vision: A critical review of emerging techniques and application scenarios. Machine Learning with Applications 2021;6:100134. doi:10.1016/j.mlwa.2021.100134.
OpenUrl CrossRef

[16] [16].↵
Davenport T, Kalakota R. The potential for artificial intelligence in healthcare. Future Healthc J 2019;6:94–8. doi:10.7861/futurehosp.6-2-94.
OpenUrl Abstract/FREE Full Text

[17] [17].↵
Dwivedi YK, Hughes L, Ismagilova E, Aarts G, Coombs C, Crick T, et al. Artificial Intelligence (AI): Multidisciplinary perspectives on emerging challenges, opportunities, and agenda for research, practice and policy. Int J Inf Manage 2021;57:101994. doi:10.1016/j.ijinfomgt.2019.08.002.
OpenUrl CrossRef

[18] [18].↵
Ashfaq M, Yun J, Yu S, Loureiro SMC. I, Chatbot: Modeling the determinants of users’ satisfaction and continuance intention of AI-powered service agents. Telemat Inform 2020;54:101473. doi:10.1016/j.tele.2020.101473.
OpenUrl CrossRef

[19] [19].↵
Adamopoulou E, Moussiades L. Chatbots: History, technology, and applications. Machine Learning with Applications 2020;2:100006. doi:10.1016/j.mlwa.2020.100006.
OpenUrl CrossRef

[20] [20].↵
Brown T, Mann B, Ryder N, Subbiah M, Kaplan JD, Dhariwal P, et al. Language models are few-shot learners. Adv Neural Inf Process Syst 2020;33:1877–901.
OpenUrl

[21] [21].↵
Ventayen RJM. OpenAI ChatGPT Generated Results: Similarity Index of Artificial Intelligence-Based Contents 2023. doi:10.2139/ssrn.4332664.
OpenUrl CrossRef

[22] [22].↵
Devlin J, Chang M-W, Lee K, Toutanova K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv [csCL] 2018.

[23] [23].↵
Haluza D, Jungwirth D. Artificial Intelligence and Ten Societal Megatrends: An Exploratory Study Using GPT-3. Systems 2023;11:120. doi:10.3390/systems11030120.
OpenUrl CrossRef

[24] [24].↵
Gordijn B, Have HT. ChatGPT: evolution or revolution? Med Health Care Philos 2023;26:1–2. doi:10.1007/s11019-023-10136-0.
OpenUrl CrossRef

[25] [25].↵
Ali MJ, Djalilian A. Readership awareness series - paper 4: Chatbots and ChatGPT - ethical considerations in scientific publications. Semin Ophthalmol 2023;38:403–4. doi:10.1080/08820538.2023.2193444.
OpenUrl CrossRef

[26] [26].↵
Yang B, Olsen M, Vali Y, Langendam MW, Takwoingi Y, Hyde CJ, et al. Study designs for comparative diagnostic test accuracy: A methodological review and classification scheme. J Clin Epidemiol 2021;138:128–38. doi:10.1016/j.jclinepi.2021.04.013.
OpenUrl CrossRef

[27] [27].↵
Adamson B, Waskom M, Blarre A, Kelly J, Krismer K, Nemeth S, et al. Approach to machine learning for extraction of real-world data variables from electronic health records. bioRxiv 2023. doi:10.1101/2023.03.02.23286522.
OpenUrl CrossRef

[28] [28].↵
Wu S, Roberts K, Datta S, Du J, Ji Z, Si Y, et al. Deep learning in clinical natural language processing: a methodical review. J Am Med Inform Assoc 2020;27:457–70. doi:10.1093/jamia/ocz200.
OpenUrl CrossRef PubMed

[29] [29].↵
Chen IY, Alsentzer E, Park H, Thomas R, Gosangi B, Gujrathi R, et al. Intimate Partner Violence and Injury Prediction From Radiology Reports. Pac Symp Biocomput 2021;26:55–66.
OpenUrl

[30] [30].↵
Shaban-Nejad A,
Michalowski M,
Buckeridge DL
Krishna K, Pavel A, Schloss B, Bigham JP, Lipton ZC. Extracting Structured Data from Physician-Patient Conversations by Predicting Noteworthy Utterances. In: Shaban-Nejad A, Michalowski M, Buckeridge DL, editors. Explainable AI in Healthcare and Medicine: Building a Culture of Transparency and Accountability, Cham: Springer International Publishing; 2021, p. 155–69. doi:10.1007/978-3-030-53352-6_14.
OpenUrl CrossRef

[31] Shaban-Nejad A,

[32] Michalowski M,

[33] Buckeridge DL

[34] [31].↵
Moradi M, Blagec K, Haberl F, Samwald M. GPT-3 Models are Poor Few-Shot Learners in the Biomedical Domain. arXiv [csCL] 2021.

[35] [32].↵
Gutiérrez BJ, McNeal N, Washington C, Chen Y, Li L, Sun H, et al. Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again. arXiv [csCL] 2022.

[36] [33].↵
Choi E, Bahadori MT, Schuetz A, Stewart WF, Sun J. Doctor AI: Predicting Clinical Events via Recurrent Neural Networks. JMLR Workshop Conf Proc 2016;56:301–18.
OpenUrl

[37] [34].↵
Pang C, Jiang X, Kalluri KS, Spotnitz M, Chen R, Perotte A, et al. CEHR-BERT: Incorporating temporal information from structured EHR data to improve prediction tasks. In: Roy S, Pfohl S, Rocheteau E, Tadesse GA, Oala L, Falck F, et al., editors. Proceedings of Machine Learning for Health, vol. 158, PMLR; 2021, p. 239–60.
OpenUrl

[38] [35].↵
Prakash PKS, Chilukuri S, Ranade N, Viswanathan S. RareBERT: Transformer Architecture for Rare Disease Patient Identification using Administrative Claims. AAAI 2021;35:453–60. doi:10.1609/aaai.v35i1.16122.
OpenUrl CrossRef

[39] [36].↵
Hamidi A, Roberts K. Evaluation of AI Chatbots for Patient-Specific EHR Questions. arXiv [csCL] 2023.

[40] [37].↵
Ali SR, Dobbs TD, Hutchings HA, Whitaker IS. Using ChatGPT to write patient clinic letters. Lancet Digit Health 2023;5:e179–81. doi:10.1016/S2589-7500(23)00048-1.
OpenUrl CrossRef

[41] [38].↵
Sallam M. ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns. Healthcare (Basel) 2023;11. doi:10.3390/healthcare11060887.
OpenUrl CrossRef PubMed

[42] [39].↵
Hopkins AM, Logan JM, Kichenadasse G. Artificial intelligence chatbots will revolutionize how cancer patients access information: ChatGPT represents a paradigm-shift. JNCI Cancer Spectrum 2023;7. doi:10.1093/jncics/pkad010.
OpenUrl CrossRef

[43] [40].↵
Khan RA, Jawaid M, Khan AR, Sajjad M. ChatGPT - Reshaping medical education and clinical management. Pak J Med Sci Q 2023;39:605–7. doi:10.12669/pjms.39.2.7653.
OpenUrl CrossRef

[44] [41].↵
Fazlollahi AM, Bakhaidar M, Alsayegh A, Yilmaz R, Winkler-Schwartz A, Mirchi N, et al. Effect of Artificial Intelligence Tutoring vs Expert Instruction on Learning Simulated Surgical Skills Among Medical Students: A Randomized Clinical Trial. JAMA Netw Open 2022;5:e2149008. doi:10.1001/jamanetworkopen.2021.49008.
OpenUrl CrossRef

[45] [42].↵
McKinney SM, Sieniek M, Godbole V, Godwin J, Antropova N, Ashrafian H, et al. International evaluation of an AI system for breast cancer screening. Nature 2020;577:89–94. doi:10.1038/s41586-019-1799-6.
OpenUrl CrossRef PubMed

[46] [43].↵
Tsang R. Practical Applications of ChatGPT in Undergraduate Medical Education. J Med Educ Curric Dev 2023;10:23821205231178449. doi:10.1177/23821205231178449.
OpenUrl CrossRef

[47] [44].↵
Kung TH, Cheatham M, Medenilla A, Sillos C, De Leon L, Elepaño C, et al. Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models. PLOS Digit Health 2023;2:e0000198. doi:10.1371/journal.pdig.0000198.
OpenUrl CrossRef PubMed

[48] [45].↵
Salvagno M, Taccone FS, Gerli AG. Can artificial intelligence help for scientific writing? Crit Care 2023;27:75. doi:10.1186/s13054-023-04380-2.
OpenUrl CrossRef

[49] [46].↵
Salah M, Alhalbusi H, Ismail MM, Abdelfattah F. Chatting with ChatGPT: Decoding the Mind of Chatbot Users and Unveiling the Intricate Connections between User Perception, Trust and Stereotype Perception on Self-Esteem and Psychological Well-being 2023. doi:10.21203/rs.3.rs-2610655/v2.
OpenUrl CrossRef

[50] [47].↵
Ray PP. ChatGPT: A comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope. Internet of Things and Cyber-Physical Systems 2023;3:121–54. doi:10.1016/j.iotcps.2023.04.003.
OpenUrl CrossRef

[51] [48].↵
White J, Fu Q, Hays S, Sandborn M, Olea C, Gilbert H, et al. A Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPT. arXiv [csSE] 2023.

[52] [49].↵
Liebrenz M, Schleifer R, Buadze A, Bhugra D, Smith A. Generating scholarly content with ChatGPT: ethical challenges for medical publishing. The Lancet Digital Health 2023;5:E105–6. doi:10.1016/S2589-7500(23)00019-5.
OpenUrl CrossRef PubMed

[53] [50].↵
Crawford J, Cowling M, Allen K-A. Leadership is needed for ethical ChatGPT: character, assessment, and learning using artificial intelligence (AI),. Teaching & Learning Practice 2023;20. doi:10.53761/1.20.3.02.
OpenUrl CrossRef

Using Artificial Intelligence To Label Free-Text Operative And Ultrasound Reports For Grading Pediatric Appendicitis

Abstract

Introduction

Methods

Study Design

Patient Characteristics

Instruments and Data Collection

Perforated Appendicitis Grade (PAG)

Determination of perforation and severity of appendicitis (OR report)

Procedures

Variables and statistical analysis

Results

US Reports

Operative Reports

Time required

Discussion

Limitations

Conclusion

Data Availability

Footnotes

References

Citation Manager Formats

Subject Area