Abstract
We analyzed outcomes of 102,985 SARS-CoV-2 confirmed cases of patients attending the IMSS (Mexican Institute for National Insurance) from January 2 to August 3, 2020. We calculated relative IFR by age group and gender and introduced the concept of adjusted lethalities, that can be used to project the burden of the disease for a population with different demographic characteristics.
1. Introduction
Since it was first identified in pneumonia patients in Wuhan, China, the coronavirus disease 2019 (COVID-19) by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has been characterized by its fast-spreading and high related disease burden [1]. The spectrum of symptoms is wide and ranges from asymptomatic infection to severe respiratory illness [2]. By August 4, 2020, almost 20 million confirmed cases of COVID-19 had been reported globally and a fatal outcome was registered in nearly a half million of them [3]. Gender and age-related differences in disease severity and risk of death have been documented [4].
The lethality of a disease, also known as Infection Fatality Rate (IFR) is the fraction of infected individuals that die from the disease. It’s estimation is of the upmost importance for the design of and application of containment measures and to decide if lockdowns are implemented as when these should be reinforced or lifted. Unfortunately, our data does not come from a random sample and we cannot estimate the IFR, instead, we estimate the relative share of the IFR by age group and gender and introduce the concept of adjusted lethalities that can be used to project the burden of the disease for a population whose demographic characteristics is known.
Methodology
We used the database of the Instituto Mexicano de Seguro Social (IMSS, acronym for Mexican Institute for National Insurance) with confirmed SARS-CoV-2 cases. The IMSS provides health services to about 10% of the population in Mexico, over 12 million persons. The database on SARS-CoV-2 we used contains confirmed cases from January 2 to June 23, 2020. We included only cases with disease outcome as dead or recovered, leaving a database with 102,985 cases, from which 55 % were males and 45% females.
When a patient is diagnosed with COVID-19, it follows one of the paths depicted in Figure 1. With this data we constructed Table 1 after dividing the population in seven age categories and two genders.
Table 2 shows a processing of the information in Table 1. Vectors h, c, d and dc in Table 2 are described as follows:
h = relative frequency of hospitalized patients.
c = fraction requiring ICU among those that went into hospitalization.
d = fraction dying among those that went into hospitalization.
dc = fraction dying among those that went into ICU.
As mentioned before, it is not possible to estimate the IFR from the data available (Table 1) since the death rates are conditional on patients mostly symptomatic and thus they are not a representative sample of the response to an infection. Nevertheless, there is a couple of things that we can obtain from Table 2. The fist one is the relative lethality and the second adjusted lethality.
2. Relative lethality
Table 2 tells us, for instance, that the probability that a female person in age group 40-49 that is hospitalized from infection with SARS-CoV-2 dies is 0.3548, but this does not provides information on the probability that a female infected person in age group 40-49 dies from the disease. Nevertheless, observe that if fij is the fraction in the population of individuals of gender i, i = 1,2 and group age j,j = 1, 2,…, 7., and we assume that everyone is equally likely to become infected but there is a differential response to the disease, then, if p is the fraction of individuals in the population that is infected and βij is the fraction of infected individuals of gender i and age group j that become hospitalized, then:
Where hij is the fraction of hospitalized that belong to age group i and gender j. Now, if αij is the fraction of individuals hospitalized that die from COVID-19, then where dij is the fraction of individuals of these hospitalized that dies.
Let θij be the lethality or infection fatality rate of gender i and age group j, then, the lethality of gender i and age group j can be written as:
Thus, the IFR by gender and group age follows this relationship:
The reason why we only establish that θij is proportional to some value is because we ignore the value of p, the true proportion of infected individuals in the population. Thus, we define as the relative lethalitiy:
Even although the value of p is not known, the ratio of any two relative lethalities reflects the ratio of the true lethalities, that is: therefore this quotient can be used to compare the lethality of any two groups. Table 3 shows in the last column the relative infection fatality rates in (5), after normalization. In this table fM and fF are the relative frequencies by age group and gender in the Mexican population. In this table we can see, for instance, that the lethality of COVID-19 in age group 60+ is 0.37511/0.19795 = 1.9 times higher in males than in females, and that the lethality in the 60+ group compared to the group with less than 20 years age is: times larger. Table 3 also shows that the lethality among people 50 years or older is about 6 times larger than among people less than 50 years old.
3. Adjusted lethality
It is possible to estimate the true values of θM and θF. The lethality of the disease is the probability that a person at random that has been infected with SARS-CoV-2 dies, which is: which assumes all individuals are equally likely to become infected. If we have an estimate of the lethality of the disease, θ, then we can rescale and multiplying by a constant c so that
Some estimates of the IFR are in the range 0.05% – 0.25% [5, 6, 7]. Here we will use the lethality reported by [5] of 86 in 100, 000 (about 0.1%) because it is a more recent study with a large sample size, thus, we use θ = 0.001. The adjusted lethalities by age group and gender are shown in Table 4.
One immediate application of Table 4 is the possibility of projecting the IFR for another population whose demographics by age and gender are known. Consider for instance the demographics of the Za’atari refugee camp, in Jordan1, which is shown in Table 5. We use θM and θp from Table 4 and the fM and fF from the camp’s demographic and thus the IFR projected for this camp is:
Observe this adjusted lethality is about 2.7 times smaller than the overall IFR, which is due to the fact that Za’atari refugee camp is a young population with 3/4 of the refugees being less than 30 years old. Although this comparison used θ = 0.001 as the overall IFR of SARS-CoV-2, the observed ratio of 2.7 is independent of this value because the adjusting constant c in 7 cancels out. Clearly, this projection includes only demographic factors and not other health and socioeconomic factors or availability of health services. Figure 2 shows a comparison between the demographics of the Za’atari camp and the Adjusted lethalities from Table 4 by age group and gender.
The distribution of deaths across age groups and gender
Another use of the adjusted lethalities in Table 4 is the estimation of the distribution of deaths across group ages and genders. The question we want to answer is: what is the probability that a dead individual belongs to a particular gender and age group? This is clearly given by fM ○ θM and fF ○ θF after normalization. For the Mexican population, a plot of this distribution is given in Figure 3.
Data Availability
The data set is only available to researchers of the IMSS for their analysis and exploration.
Conflicts of interest
Authors declare no conflict of interest.
Funding and study approval
This work is part of the program “Building the Evidence on Protracted Forced Displacement: A Multi-Stakeholder Partnership”. The program is funded by UK aid from the United Kingdom’s Department for International Development (DFID), it is managed by the World Bank Group (WBG) and was established in partnership with the United Nations High Commissioner for Refugees (UNHCR). The scope of the program is to expand the global knowledge on forced displacement by funding quality research and disseminating results for the use of practitioners and policy makers. This work does not necessarily reflect the views of DFID, the WBG or UNHCR. This study had approval R-2020-601-07 by the Health Research Ethics Committee (601) of the IMSS.
Footnotes
The previous version analyzed 34,686 patients. We updated the database and now we include the results for 102,985 patients.