Prognostic Models Predicting Clinical Outcomes in Patients Diagnosed with Visceral Leishmaniasis: A Systematic Review ===================================================================================================================== * James P Wilson * Forhad Chowdhury * Shermarke Hassan * Eli Harriss * Fabiana Alves * Ahmed Musa * Prabin Dahal * Kasia Stepniewska * Philippe J Guérin ## Abstract **Background** Visceral leishmaniasis (VL) is a neglected tropical disease prevalent in populations affected by poverty and poor nutrition. Without treatment, death is the norm. Prognostic models can steer important management decisions by identifying patients at high-risk of adverse outcomes. We therefore aim to identify, summarise, and appraise the available prognostic models predicting clinical outcomes in VL patients. **Methods** We reviewed all published studies that developed, validated, or updated models predicting clinical outcomes in VL patients. Five bibliographic databases were searched from database inception to March 1st 2023 with no language restriction. Screening, data extraction, and risk of bias assessment were performed in duplicate. Findings are presented with tables, figures, and a narrative review. **Results** Eight studies, published 2003-21, were identified describing 12 model developments and 19 external validations. All models predicted either in-hospital mortality (n=10 models) or registry-reported mortality (n=2), and were developed in either Brazilian or East African settings (n=9 and n=3 models respectively). Model discrimination (c-statistic) ranged from 0.62-0.92 when evaluated in new data (19 external validations, 10 models). Risk of bias was high for all model developments and validations: no studies presented calibration plots, 11 models were at high risk of overfitting due to small sample sizes, and six models presented risk scores that were inconsistent with reported regression coefficients. **Conclusion** With a high risk of bias identified for all models, caution must be exercised when interpreting model predictions and performance measures. Prior to model development or validation, we encourage investigators to review model reporting guidelines. No prognostic models were identified predicting treatment failure or relapse. Furthermore, despite South Asia representing the highest VL burden pre-2010, no models were developed in this population. In the context of the current South Asia elimination programme, these represent important evidence gaps where new model development should be prioritised. **Registration details** A protocol for this systematic review has been published (1) and registered (PROSPERO ID: CRD42023417226). **What is already known on this topic** * Visceral leishmaniasis (VL) is a neglected tropical disease associated with high mortality, and endemic to regions with constrained resources. * Identification of high-risk patients is important when prioritising the allocation of limited resources, including inpatient beds, certain VL treatments, and follow-up clinic capacity. * Risk stratification of VL patients can be performed using prognostic models, however, the range of models, and important model characteristics, have yet to be systematically evaluated. **What this study adds** * Following reporting guidelines for systematic reviews of prediction model studies, we present the first comprehensive review of prognostic models that predict clinical outcomes in VL patients. * We describe 12 prognostic models that all predict mortality in Brazil or East Africa. * All identified models, including model validations, are assessed at high risk of bias – model predictions and performance measures should be interpreted with caution. **How this study might affect research, practice or policy** * This review allows investigators to assess important evidence gaps in the VL prediction model landscape, and identify candidate models for validation or updating using their own patient data. * Models are identified, summarised, and appraised so that policymakers and healthcare providers can assess model applicability to their own patient population. * By highlighting limitations in the interpretation of model predictions and performance measures, and to address common sources of bias, we encourage investigators interested in prediction model research to review current guidelines in model reporting, including recently published tools for the calculation of sample sizes and model presentation. Keywords * Leishmaniasis * Visceral * Review * Systematic * Clinical Decision Rules * Prognosis * Prognostic Factors ## Introduction Visceral leishmaniasis (VL), a parasitic infection transmitted between mammalian hosts via the bite of an infected sand fly, disproportionately touches vulnerable people affected by poverty, malnutrition, and forced migration (2). Considered a neglected tropical disease by the World Health Organization (WHO), VL typically presents insidiously with fever, splenomegaly, and weight-loss, and is almost always fatal without effective treatment. The WHO estimates an incidence of 50,000 to 90,000 cases per year, although accurate estimates are obfuscated by incomplete country-level reporting (2,3). Despite progress over the last 20 years, successful treatment remains challenged by drug availability, prolonged treatment courses requiring hospitalisation, and frequent drug side effects (4). Patients with previous treatment failure or immunosuppressive comorbidities such as advanced human immunodeficiency virus (HIV) infection experience particularly high relapse and mortality rates (2,5). In endemic settings, the accurate identification of high-risk patients is paramount when prioritising the distribution of limited resources; including inpatient beds, treatment regimen, and follow-up intensity. Using prognostic models, typically developed using multivariable regression techniques, the likelihood of a future event occurring can be predicted at the individual patient level (6). Often presented as simplified risk scores, such models abound in the medical literature and support policymakers and healthcare providers in developing treatment guidelines and patient-centred management plans (7,8). Indeed, the number of published prognostic models has steadily surged across range of medical disciplines (8). In infectious diseases alone, recent systematic reviews have identified over 600 prognostic models for COVID-19 (9), 37 models for tuberculosis (10), and 27 models for malaria (11). Despite a burgeoning interest in prediction model research, concerns have been voiced on the conduct and reporting of new models. Biased models can lead to exaggerated estimates of model performance, misleading predictions, and, ultimately, suboptimal and potentially inequitable decision-making (7,8,12). Prognostic models have been developed and implemented for patients with VL (13,14). The Brazilian Ministry of Health introduced guidelines in 2011 advocating the use of four related risk scores, using a combination of clinical and/or laboratory factors to support whether to admit the patient and treat with liposomal amphotericin B (14–17). Similarly, Médecins Sans Frontières (MSF) Holland in South Sudan have, since 2003, used simple VL risk scores to complement clinical judgement in deciding whether to treat with liposomal amphotericin B, broad-spectrum antibiotics, blood transfusions, and nutritional support (13,18,19). Further models exist (20,21), although the range of models, including their characteristics, comparative performance, and inherent biases, have yet to be systematically described. We therefore present a systematic review of VL prognostic models, with the aim of identifying, summarising, and appraising prognostic models that predict future clinical outcomes in patients diagnosed with VL. Such a review will allow policymakers to critically assess the implementation of models in treatment guidelines, and healthcare providers to assess the applicability of a particular model to their individual patient, empowering the creation of a patient-centred management plan. Additionally, investigators interested in prediction model research can use this review to identify evidence gaps and assess whether their patient data could be used to develop a new model, or preferably, validate and/or update a pre-existing model (22). A glossary of key terms relating to model development and evaluation is presented in Box 1. View this table: [Box 1:](http://medrxiv.org/content/early/2024/03/21/2024.03.20.24304622/T1) Box 1: Glossary of key terms relating to prediction model development and evaluation. Adapted from Collins et al 2024 (1) and Moons et al 2015 (2). Collins GS, Dhiman P, Ma J, Schlussel MM, Archer L, Van Calster B, et al. Evaluation of clinical prediction models (part 1): from development to external validation. BMJ. 2024 Jan 8;e074819. Moons KGM, Altman DG, Reitsma JB, Ioannidis JPA, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): Explanation and Elaboration. Ann Intern Med. 2015 Jan 6;162(1):W1–73. ## Methodology ### Protocol and registration A protocol for this systematic review has previously been published (1) and registered (PROSPERO registration number: CRD42023417226). We adhere to Transparent Reporting of Multivariable Prediction Models for Individual Prognosis or Diagnosis: Checklist for Systematic Reviews and Meta-analysis (TRIPOD-SRMA) when reporting this systematic review (23). Data extraction is guided by the Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies (CHARMS) (24) and the Prediction Model Risk of Bias Assessment Tool (PROBAST) (7,25). Risk of bias assessment is performed with PROBAST (7,25). ### Eligibility criteria We follow a Population, Index model, Comparator model, Outcomes, Timing, Setting (PICOTS) approach to frame our review question and define our eligibility criteria (Table 1) (24,26). Studies reporting model development, external validation, updating, or any combination thereof, are considered for inclusion. View this table: [Table 1:](http://medrxiv.org/content/early/2024/03/21/2024.03.20.24304622/T2) Table 1: PICOTS approach to frame the research question and define the inclusion criteria. VL: visceral leishmaniasis. To provide a broad overview of the available prognostic models, we consider all clinical outcomes in patients with a diagnosis of VL and impose no limitation on model setting or prediction horizon (elapsed time period between the intended time of model use and the outcome being predicted). In accordance with best practice in prediction modelling research (12,25,27), we define a prognostic model as a multivariable model (including two or more predictors) developed with the intention of predicting future outcomes at the individual patient level. Prognostic model studies are distinguished from predictor finding or prognostic factor studies, where the aim is to investigate the effect of a single or group of factors on an outcome of interest (28). We also exclude unpublished studies (including conference abstracts, educational theses), studies that only report diagnostic prediction models, and animal studies. In addition to studies describing prognostic models, we also identify systematic reviews, meta-analyses, and impact studies of eligible prognostic model and prognostic factor studies. These studies contribute to a narrative review and were not subject to formal data extraction. ### Information sources and search strategy An information specialist (EH) created the search strategies to retrieve relevant records from the following databases: Ovid Embase; Ovid MEDLINE; the Web of Science Core Collection, SciELO and LILACS. The databases were searched on 1st March 2023 from database inception. The search strategy used text words and relevant indexing terms to retrieve studies describing eligible prognostic models. The Ingui search filter was augmented with an additional search string as described by Geersing et al (29,30), and combined with VL-specific keywords. Google scholar was used to identify any complementary grey literature (full search strategy presented in Supplemental Text 1). ### Study selection Deduplication and screening of references were performed in Covidence (31). Screening was performed independently by two reviewers (JW, FC); initially at title and abstract level, and subsequently at full-text level. Where discordance existed, a third reviewer (PD), an experienced statistician, was consulted to make the final judgement. Subsequent forward and backward citation searching were performed to identify records missed by the initial search. ### Data collection process Study information was captured using a REDCap server hosted at the University of Oxford (32). A data extraction form was created and piloted as per the CHARMS checklist and PROBAST (Supplemental Table 1) (7,24,25). Two reviewers (JW, SH) independently extracted the study information. Where discordance remained after discussion, a final decision was made by a third expert reviewer (PD). Study authors were not contacted in the event of unclear or missing information. ### Risk of bias assessment Risk of bias was assessed using PROBAST (7,25). Two reviewers (JW, SH) independently assessed each model development (including updating) and external validation by answering 20 signalling questions across four domains (participants, predictors, outcome, and analysis). Responses were used to judge the overall risk of bias as either ‘low’, ‘high’ or ‘unclear’. Any discordance was resolved through discussion. ### Synthesis of results Information extracted from the identified studies is summarised with a narrative review. Tables and figures facilitate the presentation of important model characteristics. Given our aim is not to compare multiple validations of a single model, no meta-analysis is performed. ### Role of the funding source The funders had no role in the conception, design, or presentation of this systematic review. ### Patient and public involvement No patients nor members of the public were involved in this systematic review. ### Data sharing and availability Extracted information is presented in full either in this article and the supplemental material. Our findings will be disseminated through our research group’s website ([www.iddo.org/research-themes/visceral-leishmaniasis](http://www.iddo.org/research-themes/visceral-leishmaniasis)) and social media channels. ## Results ### Study selection After deduplication, we identified 2,661 records from the literature search (flow diagram presented in Figure 1). Title and abstract screening identified 61 records for full-text review, of which 11 records were included: eight prognostic model studies (13,14,20,21,33–36) and three systematic reviews (37–39). No impact studies were identified. ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/03/21/2024.03.20.24304622/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2024/03/21/2024.03.20.24304622/F1) Figure 1: Flow diagram of literature search performed on March 1st 2023, and subsequent record screening. Of the eight prognostic model studies, four described the development of prognostic models without external validation (20,21,33,34), three described the development of prognostic models with external validation (13,14,35), and one study validated a pre-existing model with subsequent updating (recalibration) (36). A narrative review of the identified systematic reviews is available in Supplemental Text 2. ### Model characteristics Important characteristics of the identified models are summarised in Table 2. Each study reported at least one prognostic model, with one study reporting two models (35) and a further study reporting four models (14). Including the updated model (36), a total of 12 models are described, predicting two different outcomes: in-hospital mortality (n=10) (13,14,21,34–36) and registry-reported mortality (n=2) (20,33). View this table: [Table 2:](http://medrxiv.org/content/early/2024/03/21/2024.03.20.24304622/T3) Table 2: Key characteristics on the 12 prognostic model developments, ordered by outcome and year published. Each row corresponds to a different model. *Including patients with missing predictor information, excluding patients with missing/excluded outcomes (unless otherwise stated). †Number of predictor parameters (degrees of freedom); for example, a binary or linear predictor is described with 1 parameter; a predictor with 4 categories is described with 3 parameters. Candidate predictors presented in brackets are estimated from incomplete reporting. ‡Number of candidate predictors unclear. Numbers presented are inferred from the study description of extracted information and baseline characteristics. §Number of events not disaggregated by model. ¶Age distribution tabulated by group (not reproduced here). **Sample size excludes both participants with missing predictors and missing/excluded outcomes -, not reported; clin., clinical; conf., confirmed; EPP, events per predictor parameter; excl., excluded; HIV, human immunodeficiency virus; IQR, interquartile range; lab., laboratory; susp., suspected; yrs, years. Most commonly, studies employed a retrospective cohort design using either hospital records (4 studies, n=5 models) (13,34–36) or national registry data (2 studies, n=2 models) (20,33). One study used a prospective cohort design (1 study, n=4 models) (14) and one study used a case-control design from hospital records (n=1 model) (21). The median number of patients per model development was 542 (range 90–12,333). Over half of all model patients contributed to the largest model from a national registry of confirmed VL cases in Brazil (52.4%, n=12,333) (20). Most models were developed in Brazil (6 studies, n=9 models) (14,20,21,33,34,36), with the remainder developed in East Africa (one study presenting two models with South Sudanese patients (35), and one study presenting one model with Ethiopian patients (13)). No models were developed in South Asia or the Mediterranean region. Participant age formed the inclusion criteria of eight models (14,34–36), of which five models only included adolescents and younger (14,34–36). No model excluded participants based on sex. Where reported, the median proportion of male participants was 56.2% (range 48.7–95.9%, n=7). Patients living with HIV were either excluded (n=2) (35), not reported (n=8) (14,33,34,36), and where reported, ranged from 7.0–19.3% (n=2) (13,20). Figure 2 presents the final and candidate predictors identified across model developments, although reporting was inconsistent. The four most common candidate predictors were jaundice (n=12 models), age (n=11), sex (n=10) and bleeding (n=10). Initial VL treatment was included as a candidate predictor in two models, although not included in the final models. Predictors most frequently included in the final models were jaundice (n=11), bleeding (n=8), and age (n=7). No models included sex as a final predictor. Excluding HIV testing, four models did not consider laboratory tests as a predictor (14,20,33). The remaining models (n=8) (13,14,21,34–36) considered and included laboratory tests as predictors in the final model. ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/03/21/2024.03.20.24304622/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2024/03/21/2024.03.20.24304622/F2) Figure 2: Illustration of the variation of predictors, both considered and included in prognostic model developments. The bar chart presents the number of models incorporating each predictor. To aid comparison, similar predictors have been grouped and renamed. See Supplementary Material for full predictor definitions and groupings. ALT, alanine transaminase; AST, aspartate transaminase; clin., clinical; HIV, human immunodeficiency virus; lab., laboratory; TB, tuberculosis. *Described by author, year, and model name where study presents more than one model †Cough and diarrhoea are combined as a single predictor ‡Oedema and ascites are assessed as a single predictor by [Kämink 2017]. §Considered predictors were assessed, but not included in the final model ¶Predictors in the updated model matched the final predictors in the model being updated [Sampaio 2010] **Predictors assessed in ≤ 2 models and which are not included in the final model: [Werneck 2003]: ‘abdominal distension’, episodes of blood transfusion; [de Araújo 2012]: fever, hepatomegaly, “other clinical manifestations”, initial VL treatment regimen, other VL drugs following initial VL treatment regimen, duration of treatment with antimonials; [Coura-Vital 2014]: fever, hepatomegaly, “other clinical manifestations”, race, education; [Abongomera 2017]: initial VL treatment regimen; [Kämink 2017, both models]: lymphadenopathy. All 12 models were presented as simplified risk scores, with cut-off thresholds suggested to facilitate clinical decision making. Outcome (i.e. mortality) probabilities corresponding to the risk scores were presented either in tabular format (n=4) (13,20,35), graphically and through a web application (n=4) (14), or not presented (n=4) (21,33,34,36). The full model equation, including model intercept, was reproducible for two models in the development study (20,36), and for a third model in the model updating study (34,36). Ten of the 12 models underwent one or more external validations; presented either in the development study (n=7 models, 15 validations) (13,14,35,36), or evaluated in separate studies (n=3, 4 validations) (14,36). Two Brazilian studies reported the validation of six models with data collected from the same hospital as model development (developed and validated in either Teresina, state of Piauí (n=5 models) (14,21) or Recife, state of Pernambuco (n=1 model) (36)). Prospectively collected data from Teresina were also used to validate the model developed in Recife (34), and a further model developed using Brazilian national registry data (20). A model developed in Ethiopia (Abdurafi health centre, Abdurafi, Amhara region) was validated using data from a nearby treatment centre (Leishmaniasis Research and Treatment Centre, Gondar, Gondar, Amhara region) (13), and two models developed in South Sudan were validated using retrospectively collected data from both the same treatment centre (Lankien hospital, Jonglei state) and a treatment centre from a neighbouring state in South Sudan (Malakal hospital, Upper Nile state) (35). Performance measures of discrimination (c-statistic, see Box 1) were presented for all model risk scores. For external validations, the median risk score c-statistic was 0.78 (range 0.62–0.92, n=10 models corresponding to 19 external validations). All studies reported the apparent discriminative performance, with a median risk score c-statistic of 0.86 (range 0.56 – 0.93, n=12 models). A c-statistic was presented for one full model equation (36) and for two model risk scores after accounting for overfitting (internal validation) (13,36). No studies presented overall measures of performance (such as R2) or calibration plots. One model’s calibration plot could be reproduced from an internal (split-sample) validation of the risk score (20). ### Risk of bias assessment Risk of bias assessments for all model developments and external validations, alongside key information on model performance, is presented in Table 3. View this table: [Table 3:](http://medrxiv.org/content/early/2024/03/21/2024.03.20.24304622/T4) Table 3: Summary of model presentations and reproducibility for model developments. Performance estimates and risk of bias assessment is presented for both model development (including updating) and external validations. *All internal validation c-statistics relate to the apparent performance of the risk score, i.e. not adjusted for overfitting, unless otherwise stated. 95% confidence intervals are reproduced when reported. Models receiving multiple external validations report multiple c-statistics. the external validations were assessed as having the same risk of bias across all categories and are therefore presented together. †Split-sample (random, 2:1 development:validation). ‡Full regression equation subsequently reported by Foinquinos et al, 2021, when presenting external validation. §Cross-validation (5-fold). ¶Assessing performance of the full model equation. **Assessment of risk of bias is performed separately for model developments, including updating, and external validations. For every model that received more than one external validation +, high risk of bias; -, low risk of bias; ?, unclear risk of bias; -, not reported; c-statistic, concordance-statistic; CI, confidence interval; dev: development; n/a, not applicable for external validations only; N, no; O, outcome; OA, overall assessment; P, participants; Pr, predictors; val, validation; Y, yes. All 12 model developments were assessed at an overall high risk of bias. Specifically, the model analysis domain was universally assessed at high risk of bias. One model obtained a sufficient sample size (event to predictor parameter ratio > 10), and adequately reported model performance, including calibration (20). All models (excluding the model updating) were developed using a univariable selection stage and did not adjust model predictions to account for optimism due to overfitting. For six models, the presented risk score was not reproducible from the presented regression model coefficients and reported methodology (14,34,36) (calculations reported in Supplemental Text 3). The outcome domain was assessed at low risk of bias for all model developments, except for two models where bias risk was unclear (20,33). The participants and predictors domains were assessed at high risk of bias for eight (13,20,21,33–36) and seven models (20,21,33–36), respectively. Specifically, risk of bias was considered high due to differences in predictor definitions and assessments for different patients (n=7) (20,21,33–36) and due to retrospective data collection (n=8) (13,20,21,33–36). One model included predictors that were measured after the likely time of intended model implementation (33). All extracted information, including characteristics of the external validation datasets, is presented in Supplemental Tables 2-6. This includes participant, predictor, and outcome characteristics (Supplemental Table 2), model analysis, specification, and performance characteristics (Supplemental Table 3), risk score thresholds and interpretations (Supplemental Table 4), risk of bias assessment (Supplemental Table 5), and predictor definitions and categories (Supplemental Table 6). ## Discussion Across a number of diseases, publications of prognostic models have surged over the last 20 years, driven by an increasing focus on personalised medicine, the need to provide evidence for guideline development, and a growing number of tools available for model development (10,27,40). VL is no exception, with a total of 12 VL prognostic models identified, of which nine were published in the last 10 years (2013–23). We show that all models were developed in Brazil or East Africa, and predict either in-hospital or registry-reported mortality. When using a prognostic model to predict a VL patient’s mortality risk, for example, in a hospital ward or outpatient clinic, the healthcare provider should be confident that by inputting model predictors (e.g. age, haemoglobin level, clinical signs and symptoms), they receive, as the model output, a reliable estimate of mortality risk. Empowered with this information, the patient can be counselled and shared treatment decisions reached to optimise the available resources and mitigate any increased mortality risk (e.g. allocation of hospital beds and choice of VL treatment). The ability of a model to estimate risk and guide decision making is fundamental: inaccurate risk predictions can lead to suboptimal and inequitable care. Prior to model implementation, VL stakeholders should consider (i) are model risk estimates and performance measures adequately presented, (ii) are they reliable (i.e. subject to bias), and importantly, (iii) how applicable is the model, including its reported performance, to the patient population of interest. The accuracy of a model’s predictions, where predicted risks and observed outcomes are compared (model calibration, see Box 1), is a fundamental measure of a model’s performance (41,42). However, with calibration only presented adequately for one model (20), we find that VL prognostic model research bucks a broader and concerning trend in prediction model studies: calibration is neglected (9,22,43). Instead, measures of discrimination (the ability of a model to rank patients according to outcome risk, e.g. c-statistic), and classification (where a single risk threshold is assessed, e.g. sensitivity, specificity), are preferentially reported, despite limited clinical utility (7). All VL model developments were assessed at high risk of bias. Model overfitting, a common source of bias, occurs when the modelling process captures idiosyncratic random variation in the development dataset that is not reflective of the true patient population. Small sample sizes, grouping of continuous predictors, and univariable predictor selection can all introduce bias through overfitting, leading to optimistic performance estimates and exaggerated risk predictions (7,25,44). Bearing this in mind, we demonstrate that all 12 VL prognostic models grouped continuous predictors (such as age and haemoglobin) and included a univariable selection stage in the modelling process. Regarding sample size, the upper limit of events per predictor parameter (EPP) to avoid overfitting is often suggested at 10-20 EPP (7). However, only one model had over 10 EPP (20), with the majority having under 5 EPP. Despite two models reporting model discrimination (c-statistics) adjusted for model overfitting (13,20), no adjusted risk estimates were presented. An important source of bias relates to the accuracy of the derived risk scores: we demonstrate that only half of the risk scores were reproducible from the presented regression coefficients according to the authors’ methodologies, resulting in suboptimal mortality risk estimates. In four studies, presenting four models, mortality estimates were only reported after dichotomising the risk as ‘high’ or ‘low’ (21,33,34,36). Grouping risk in such a way may guide decision-making in specific settings, but disregards useful information stored in the model, preventing effective use at the bedside or evaluation in new settings. To this end, presentation of the full model equation, including model intercept, is necessary (7). With only two studies presenting the full model equation in the original studies (20,36), investigators wishing to assess the performance of the remaining models in their own data would need to contact the study authors directly. Visceral leishmaniasis is a complex, heterogeneous disease. The ‘typical’ VL patient and their disease course varies widely between, and often within, endemic areas. This heterogeneity may be explained by a combination of host factors (including age, immunosuppression, malnutrition, and other comorbidities), parasite factors (including genetic variation within and between species), and environmental factors (including vector biology, access to healthcare, and choice of VL treatment) (2,4). The variation in these factors, many of which are unmeasured in routine practice, and surely many additional unknown factors, present prediction challenges when transporting a model from one setting to another. For example, without model updating (recalibration), the application of a model developed in a setting with relatively high inpatient mortality, such as Brazil or a highly HIV-endemic region of East Africa, would likely systematically overestimate mortality in South Asia, a region with relatively low mortality rates (38,39,45). With such variation, and even after disregarding bias introduced through model development, the appealing concept of ‘one model fits all’ becomes untenable. Indeed, the applicability of a model to new settings can only be assessed through external validation (46). Notably, all external validations were performed with datasets from the same country as model development, of which the majority were performed in the same treatment centre using data from a different time period (described as ‘narrow’ or ‘temporal’ validations). Unsurprisingly, all assessed models better discriminate VL patient mortality in the dataset they were developed from compared to the dataset they were evaluated in. This can be explained by true differences in patients, predictors and outcomes not accounted for by the model, and as previously discussed, optimism introduced through overfitting. Where outcome timing is described, one study reported a third of patients dying within 48 hours of admission (35). A second study reported the average time from treatment initiation to death as 5.5 days and 5.3 days, due to bleeding or co-infection respectively (21). With such a short period between predictor measurement and outcome, it is not surprising that predictors associated with organ failure on the causal pathway to death, such as jaundice, bleeding, co-infection, and dyspnoea, feature prominently in the final model predictors. Interestingly, symptom duration, nutritional status, and relapse status were infrequently included in the final model. Whilst these factors may be causally important; by the time a patient arrives in hospital, symptoms and signs of organ failure are typically more closely associated with death, and therefore preferentially selected for inclusion. A significant limitation of this review stems from poor reporting of the identified studies. We excluded unpublished manuscripts, including conference abstracts and educational theses, given concerns about quality and difficulty in accessing the original works. Furthermore, in keeping with our prespecified inclusion criteria, this review excluded studies reporting diagnostic models (47,48), models that only evaluated single predictors (49–51), prognostic factor studies (19,52–54), or registered, but unpublished prognostic model studies (for example, [NCT05602610](http://medrxiv.org/lookup/external-ref?link_type=CLINTRIALGOV&access_num=NCT05602610&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom)). Evaluation of the implementation of prognostic models in current practice and policy was also considered outside the scope of this review. ## Conclusion In 2015 the TRIPOD Statement was published, providing a 22-item checklist with the aim of facilitating the transparent reporting of prediction model studies (8,27). To reduce model bias and increase model uptake in clinical practice, we encourage investigators considering the development or validation of VL prediction models to consult these guidelines. Additionally, subsequently published resources are now available on how to present models to aid implementation (such as risk scores) (55), and for estimating adequate sample sizes (56,57). In summary, we present the first systematic review that identifies, summarises, and appraises prognostic models predicting clinical outcomes in patients with VL. We show that no models predict treatment failure or relapse, and no models were developed in South Asia or the Mediterranean region. These represent important gaps in our current knowledge. In the context of the current elimination programme in South Asia, estimating the risk of relapse could help at the population level by identifying infectious and potentially drug-resistant parasite reservoirs (58). Patients identified at high risk of relapse could be selected for more intensive management strategies, including further risk stratification with a parasitological test of cure, and active post-treatment follow-up. ## Supporting information Supplemental Table 1 [[supplements/304622_file05.docx]](pending:yes) Supplemental Table 2 [[supplements/304622_file06.docx]](pending:yes) Supplemental Table 3 [[supplements/304622_file07.docx]](pending:yes) Supplemental table 4 [[supplements/304622_file08.docx]](pending:yes) Supplemental table 5 [[supplements/304622_file09.xlsx]](pending:yes) Supplemental table 6 [[supplements/304622_file10.xlsx]](pending:yes) Supplemental text 1 [[supplements/304622_file11.docx]](pending:yes) Supplemental text 2 [[supplements/304622_file12.docx]](pending:yes) Supplemental text 3 [[supplements/304622_file13.docx]](pending:yes) ## Data Availability All data produced in the present work are contained in the manuscript and supplemental material * Received March 20, 2024. * Revision received March 20, 2024. * Accepted March 21, 2024. * © 2024, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## Bibliography 1. 1.Wilson J, Chowdhury F, Hassan S, Harriss EK, Alves F, Dahal P, et al. Prognostic prediction models for clinical outcomes in patients diagnosed with visceral leishmaniasis: protocol for a systematic review. BMJ Open. 2023 Oct 24;13(10):e075597. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYm1qb3BlbiI7czo1OiJyZXNpZCI7czoxMzoiMTMvMTAvZTA3NTU5NyI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzAzLzIxLzIwMjQuMDMuMjAuMjQzMDQ2MjIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 2. 2.Burza S, Croft SL, Boelaert M. Leishmaniasis. The Lancet. 2018 Sep;392(10151):951–70. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(18)31204-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30126638&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 3. 3.World Health Organization. Leishmaniasis [Internet]. [cited 2023 Aug 24]. Available from: [https://www.who.int/news-room/fact-sheets/detail/leishmaniasis](https://www.who.int/news-room/fact-sheets/detail/leishmaniasis) 4. 4.Alves F, Bilbe G, Blesson S, Goyal V, Monnerat S, Mowbray C, et al. Recent Development of Visceral Leishmaniasis Treatments: Successes, Pitfalls, and Perspectives. Clin Microbiol Rev. 2018 Oct;31(4):e00048–18. 5. 5.Alvar J, Aparicio P, Aseffa A, Den Boer M, Cañavate C, Dedet JP, et al. The Relationship between Leishmaniasis and AIDS: the Second 10 Years. Clin Microbiol Rev. 2008 Apr;21(2):334–59. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiY21yIjtzOjU6InJlc2lkIjtzOjg6IjIxLzIvMzM0IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDMvMjEvMjAyNC4wMy4yMC4yNDMwNDYyMi5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 6. 6.Moons KGM, Royston P, Vergouwe Y, Grobbee DE, Altman DG. Prognosis and prognostic research: what, why, and how? BMJ. 2009 Feb 23;338(feb23 1):b375–b375. [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjE2OiIzMzgvZmViMjNfMS9iMzc1IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDMvMjEvMjAyNC4wMy4yMC4yNDMwNDYyMi5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 7. 7.Moons KGM, Wolff RF, Riley RD, Whiting PF, Westwood M, Collins GS, et al. PROBAST: A Tool to Assess Risk of Bias and Applicability of Prediction Model Studies: Explanation and Elaboration. Ann Intern Med. 2019 Jan 1;170(1):W1. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7326/M18-1377&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30596876&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 8. 8.Moons KGM, Altman DG, Reitsma JB, Ioannidis JPA, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): Explanation and Elaboration. Ann Intern Med. 2015 Jan 6;162(1):W1–73. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7326/M14-0698&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25560730&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 9. 9.Wynants L, Van Calster B, Collins GS, Riley RD, Heinze G, Schuit E, et al. Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal. BMJ. 2020 Apr 7;m1328. 10. 10.Peetluk LS, Ridolfi FM, Rebeiro PF, Liu D, Rolla VC, Sterling TR. Systematic review of prediction models for pulmonary tuberculosis treatment outcomes in adults. BMJ Open. 2021 Mar;11(3):e044687. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYm1qb3BlbiI7czo1OiJyZXNpZCI7czoxMjoiMTEvMy9lMDQ0Njg3IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDMvMjEvMjAyNC4wMy4yMC4yNDMwNDYyMi5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 11. 11.Njim T, Tanyitiku BS. Prognostic models for the clinical management of malaria and its complications: a systematic review. BMJ Open. 2019 Nov;9(11):e030793. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiYm1qb3BlbiI7czo1OiJyZXNpZCI7czoxMjoiOS8xMS9lMDMwNzkzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDMvMjEvMjAyNC4wMy4yMC4yNDMwNDYyMi5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 12. 12.Steyerberg EW, Moons KGM, Van Der Windt DA, Hayden JA, Perel P, Schroter S, et al. Prognosis Research Strategy (PROGRESS) 3: Prognostic Model Research. PLoS Med. 2013 Feb 5;10(2):e1001381. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pmed.1001381&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23393430&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 13. 13.1. Satoskar AR Abongomera C, Ritmeijer K, Vogt F, Buyze J, Mekonnen Z, Admassu H, et al. Development and external validation of a clinical prognostic score for death in visceral leishmaniasis patients in a high HIV co-infection burden area in Ethiopia. Satoskar AR, editor. PLOS ONE. 2017 Jun 5;12(6):e0178996. 14. 14.Costa DL, Rocha RL, Chaves EDBF, Batista VGDV, Costa HL, Costa CHN. Predicting death from kala-azar: construction, development, and validation of a score set and accompanying software. Rev Soc Bras Med Trop. 2016 Dec;49(6):728–40. 15. 15.Costa, D. Fatores de prognóstico na leishmaniose visceral: alterações clínicas e laboratoriais associadas à resposta imune, aos distúrbios da coagulação e à morte [Internet]. Universidade Federal de Minas Gerais; 2009 [cited 2023 Dec 3]. Available from: [http://hdl.handle.net/1843/BUOS-96FFS7](http://hdl.handle.net/1843/BUOS-96FFS7) 16. 16.Ministry of Health (Brazil). Visceral leishmaniasis: clinical recommendations for lethality reduction (Leishmaniose visceral : recomendações clínicas para redução da letalidade) [Internet]. 1st ed. Brasilia, Brazil; 2011. Available from: [https://bvsms.saude.gov.br/bvs/publicacoes/leishmaniose\_visceral\_reducao\_letalidade.pdf](https://bvsms.saude.gov.br/bvs/publicacoes/leishmaniose_visceral_reducao_letalidade.pdf) 17. 17.Ministry of Health (Brazil). Guide to Health Surveillance. 5th Edition, revised and updated. (Guia de Vigilância em Saúde. 5a edição revisada e atualizada.) [Internet]. 5th ed. Brasilia, Brazil; 2022 [cited 2023 Aug 24]. 126 p. Available from: [https://bvsms.saude.gov.br/bvs/publicacoes/guia\_vigilancia\_saude\_5ed\_rev\_atual.pdf](https://bvsms.saude.gov.br/bvs/publicacoes/guia\_vigilancia\_saude_5ed_rev_atual.pdf) 18. 18.Gorski S, Collin SM, Ritmeijer K, Keus K, Gatluak F, Mueller M, et al. Visceral leishmaniasis relapse in Southern Sudan (1999-2007): a retrospective study of risk factors and trends. PLoS Negl Trop Dis. 2010 Jun 8;4(6):e705. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pntd.0000705&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20544032&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 19. 19.Collin S, Davidson R, Ritmeijer K, Keus K, Melaku Y, Kipngetich S, et al. Conflict and Kala-Azar: Determinants of Adverse Outcomes of Kala-Azar among Patients in Southern Sudan. Clin Infect Dis. 2004 Mar;38(5):612–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/381203&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14986243&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000189163900002&link_type=ISI) 20. 20.1. Santiago HDC Coura-Vital W, Araújo VEMD, Reis IA, Amancio FF, Reis AB, Carneiro M. Prognostic Factors and Scoring System for Death from Visceral Leishmaniasis: An Historical Cohort Study in Brazil. Santiago HDC, editor. PLoS Negl Trop Dis. 2014 Dec 11;8(12):e3374. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pntd.0003374&link_type=DOI) 21. 21.Werneck GL, Batista MSA, Gomes JRB, Costa DL, Costa CHN. Prognostic Factors for Death from Visceral Leishmaniasis in Teresina, Brazil. Infection. 2003 Jun;31(3):174–7. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12789476&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000183699300008&link_type=ISI) 22. 22.Collins GS, De Groot JA, Dutton S, Omar O, Shanyinde M, Tajar A, et al. External validation of multivariable prediction models: a systematic review of methodological conduct and reporting. BMC Med Res Methodol. 2014 Dec;14(1):40. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/1471-2288-14-40&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24645774&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 23. 23.Snell KIE, Levis B, Damen JAA, Dhiman P, Debray TPA, Hooft L, et al. Transparent reporting of multivariable prediction models for individual prognosis or diagnosis: checklist for systematic reviews and meta-analyses (TRIPOD-SRMA). BMJ. 2023 May 3;e073538. 24. 24.Moons KGM, De Groot JAH, Bouwmeester W, Vergouwe Y, Mallett S, Altman DG, et al. Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies: The CHARMS Checklist. PLoS Med. 2014 Oct 14;11(10):e1001744. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pmed.1001744&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25314315&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 25. 25.Wolff RF, Moons KGM, Riley RD, Whiting PF, Westwood M, Collins GS, et al. PROBAST: A Tool to Assess the Risk of Bias and Applicability of Prediction Model Studies. Ann Intern Med. 2019 Jan 1;170(1):51. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7326/M18-1376&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30596875&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 26. 26.Debray TPA, Damen JAAG, Snell KIE, Ensor J, Hooft L, Reitsma JB, et al. A guide to systematic review and meta-analysis of prediction model performance. BMJ. 2017 Jan 5;i6460. 27. 27.Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD): The TRIPOD Statement. Eur Urol. 2015 Jun;67(6):1142–51. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.eururo.2014.11.025&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25572824&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 28. 28.Riley RD, Moons KGM, Snell KIE, Ensor J, Hooft L, Altman DG, et al. A guide to systematic review and meta-analysis of prognostic factor studies. BMJ. 2019 Jan 30;k4597. 29. 29.Ingui BJ, Rogers MAM. Searching for Clinical Prediction Rules in MEDLINE. J Am Med Inform Assoc. 2001 Jul 1;8(4):391–7. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1136/jamia.2001.0080391&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11418546&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 30. 30.1. Smalheiser NR Geersing GJ, Bouwmeester W, Zuithoff P, Spijker R, Leeflang M, Moons K. Search Filters for Finding Prognostic and Diagnostic Prediction Studies in Medline to Enhance Systematic Reviews. Smalheiser NR, editor. PLoS ONE. 2012 Feb 29;7(2):e32844. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0032844&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22393453&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 31. 31.Covidence systematic review software, Veritas Health Innovation, Melbourne, Australia. 32. 32.Harris PA, Taylor R, Minor BL, Elliott V, Fernandez M, O’Neal L, et al. The REDCap consortium: Building an international community of software platform partners. J Biomed Inform. 2019 Jul;95:103208. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jbi.2019.103208&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31078660&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 33. 33.1. Reithinger R De Araújo VEM, Morais MHF, Reis IA, Rabello A, Carneiro M. Early Clinical Manifestations Associated with Death from Visceral Leishmaniasis. Reithinger R, editor. PLoS Negl Trop Dis. 2012 Feb 7;6(2):e1511. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pntd.0001511&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22347514&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 34. 34.1. Franco-Paredes C Sampaio MJADQ, Cavalcanti NV, Alves JGB, Fernandes Filho MJC, Correia JB. Risk Factors for Death in Children with Visceral Leishmaniasis. Franco-Paredes C, editor. PLoS Negl Trop Dis. 2010 Nov 2;4(11):e877. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pntd.0000877&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21072238&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 35. 35.1. Lockwood DNJ Kämink SS, Collin SM, Harrison T, Gatluak F, Mullahzada AW, Ritmeijer K. A clinical severity scoring system for visceral leishmaniasis in immunocompetent patients in South Sudan. Lockwood DNJ, editor. PLoS Negl Trop Dis. 2017 Oct 2;11(10):e0005921. 36. 36.1. Wang M Foinquinos J, Duarte MDC, Figueiroa JN, Correia JB, Cavalcanti NV. Temporal Validation of a Predictive Score for Death in Children with Visceral Leishmaniasis. Wang M, editor. J Trop Med. 2021 Dec 22;2021:1–6. 37. 37.1. Jaffe CL Cota GF, De Sousa MR, Rabello A. Predictors of Visceral Leishmaniasis Relapse in HIV-Infected Patients: A Systematic Review. Jaffe CL, editor. PLoS Negl Trop Dis. 2011 Jun 7;5(6):e1153. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pntd.0001153&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21666786&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 38. 38.1. Boelaert M Belo VS, Struchiner CJ, Barbosa DS, Nascimento BWL, Horta MAP, Da Silva ES, et al. Risk Factors for Adverse Prognosis and Death in American Visceral Leishmaniasis: A Meta-analysis. Boelaert M, editor. PLoS Negl Trop Dis. 2014 Jul 24;8(7):e2982. 39. 39.1. Werneck GL Abongomera C, Van Henten S, Vogt F, Buyze J, Verdonck K, Van Griensven J. Prognostic factors for mortality among patients with visceral leishmaniasis in East Africa: Systematic review and meta-analysis. Werneck GL, editor. PLoS Negl Trop Dis. 2020 May 15;14(5):e0008319. 40. 40.Wessler BS, Lai Yh L, Kramer W, Cangelosi M, Raman G, Lutz JS, et al. Clinical Prediction Models for Cardiovascular Disease: Tufts Predictive Analytics and Comparative Effectiveness Clinical Prediction Model Database. Circ Cardiovasc Qual Outcomes. 2015 Jul;8(4):368–75. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiY2lyY2N2b3EiO3M6NToicmVzaWQiO3M6NzoiOC80LzM2OCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDI0LzAzLzIxLzIwMjQuMDMuMjAuMjQzMDQ2MjIuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 41. 41.Shah ND, Steyerberg EW, Kent DM. Recalibrating Expectations. 2018; 42. 42.On behalf of Topic Group ‘Evaluating diagnostic tests and prediction models’ of the STRATOS initiative, Van Calster B, McLernon DJ, Van Smeden M, Wynants L, Steyerberg EW. Calibration: the Achilles heel of predictive analytics. BMC Med. 2019 Dec;17(1):230. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12916-019-1466-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31842878&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 43. 43.Bouwmeester W, Zuithoff NPA, Mallett S, Geerlings MI, Vergouwe Y, Steyerberg EW, et al. Reporting and methods in clinical prediction research: a systematic review. PLoS Med. 2012;9(5):1–12. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pmed.1001349&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22629233&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 44. 44.Dhiman P, Ma J, Qi C, Bullock G, Sergeant JC, Riley RD, et al. Sample size requirements are not being considered in studies developing prediction models for binary outcomes: a systematic review. BMC Med Res Methodol. 2023 Aug 19;23(1):188. 45. 45.Jervis S, Chapman LAC, Dwivedi S, Karthick M, Das A, Le Rutte EA, et al. Variations in visceral leishmaniasis burden, mortality and the pathway to care within Bihar, India. Parasit Vectors. 2017 Dec;10(1):601. 46. 46.Altman DG, Vergouwe Y, Royston P, Moons KGM. Prognosis and prognostic research: validating a prognostic model. BMJ. 2009 May 28;338(may28 1):b605–b605. [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjE2OiIzMzgvbWF5MjhfMS9iNjA1IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDMvMjEvMjAyNC4wMy4yMC4yNDMwNDYyMi5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 47. 47.Campos D. [Predictive score of infectious complications on kalazar of child]. J Pediatr (Rio J). 1996;72(6):411–4. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14688908&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 48. 48.1. Reithinger R Machado De Assis TS, Rabello A, Werneck GL. Predictive Models for the Diagnostic of Human Visceral Leishmaniasis in Brazil. Reithinger R, editor. PLoS Negl Trop Dis. 2012 Feb 28;6(2):e1542. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22389742&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 49. 49.Abdelmoula MS, M’Hamdi Z, Amri F, Tebib N, Ben Turkia H, Ben Dridi MF. [Visceral leishmaniasis in children: prognostic factors]. Tunis Med. 2003 Aug;81(8):535–9. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14608735&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 50. 50.Verrest L, Kip AE, Musa AM, Schoone GJ, Schallig HDFH, Mbui J, et al. Blood Parasite Load as an Early Marker to Predict Treatment Response in Visceral Leishmaniasis in Eastern Africa. Clin Infect Dis. 2021 Sep 7;73(5):775–82. 51. 51.Takele Y, Mulaw T, Adem E, Shaw CJ, Franssen SU, Womersley R, et al. Immunological factors, but not clinical features, predict visceral leishmaniasis relapse in patients co-infected with HIV. Cell Rep Med. 2022 Jan;3(1):100487. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.xcrm.2021.100487&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35106507&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F03%2F21%2F2024.03.20.24304622.atom) 52. 52.Da Silva TAM, Morais MHF, Lopes HMDOR, Gonçalves SA, Magalhães FDC, Amâncio FF, et al. Prognostic factors associated with death from visceral leishmaniasis: a case-control study in Brazil. Trans R Soc Trop Med Hyg. 2020 May 7;114(5):346–54. 53. 53.Oliveira-Sena IV, Werneck GL. Risk factors for in-hospital mortality from visceral leishmaniasis: A case-control study. J Infect Public Health. 2020 Apr;13(4):538–43. 54. 54.Kajaia M, Morse DL, Kamkamidze G, Butsashvili M, Chubabria G, Zenaishvili O, et al. Risk factors for relapse of visceral leishmaniasis in Georgia. Trop Med Int Health TM IH. 2011 Feb;16(2):186– 92. 55. 55.Bonnett LJ, Snell KIE, Collins GS, Riley RD. Guide to presenting clinical prediction models for use in clinical settings. BMJ. 2019 Apr 17;l737. 56. 56.Riley RD, Ensor J, Snell KIE, Harrell FE, Martin GP, Reitsma JB, et al. Calculating the sample size required for developing a clinical prediction model. BMJ. 2020 Mar 18;368:m441. [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjE2OiIzNjgvbWFyMThfMi9tNDQxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMDMvMjEvMjAyNC4wMy4yMC4yNDMwNDYyMi5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 57. 57.Riley RD, Snell KIE, Archer L, Ensor J, Debray TPA, Van Calster B, et al. Evaluation of clinical prediction models (part 3): calculating the sample size required for an external validation study. BMJ. 2024 Jan 22;e074821. 58. 58.World Health Organization. Regional strategic framework for accelerating and sustaining elimination of kala-azar in the South-East Asia Region, 2022-2026 [Internet]. New Delhi: World Health Organization, South-East Asian Region; 2022. Available from: [https://iris.who.int/bitstream/handle/10665/361887/9789290209812-eng.pdf](https://iris.who.int/bitstream/handle/10665/361887/9789290209812-eng.pdf)