ABSTRACT
Introduction Data on burden of SARS-CoV-2 infections by age group and for different severity levels are lacking. We estimated the South African SARS-CoV-2 disease burden and severity, describing changes in the shape of the disease burden pyramid with successive waves.
Methods We estimated SARS-CoV-2 medically and non-medically attended illness stratified by severity (mild, severe non-fatal, and fatal) during the initial five waves, spanning 1 March 2020 through 13 August 2022. We utilised individual-level national surveillance, healthcare-utilisation and serosurvey data to calculate wave-specific hospitalisation-fatality (HFR) and infection-fatality ratios (IFR). We estimated wave-specific incidence rates per 100,000 population with 95% confidence intervals derived from bootstrapping the individual-level data.
Results On 13 August 2022, the estimated cumulative number of SARS-CoV-2 infections in South Africa was 105 million, of which 399,886 (0.38%) were severe non-fatal and 258,754 (0.25%) fatal. 29% of severe non-fatal illness and 55% of deaths occurred outside hospital. The highest burden of severe and fatal illness was during the Delta wave (wave 3), and the HFR across the initial three waves was similar (range 31%-34%). Although there were more infections during the Omicron BA.1 wave (wave 4), there was a substantial reduction in HFR (14%). Successive waves saw a reduction in the rate of increase in mortality and hospitalisations with increasing age.
Conclusions The substantial South African national burden of SARS-CoV-2 for the initial five waves contradicts the belief of minimal impact in Africa. A high proportion of severe non-fatal and fatal illness occurred outside of hospital, highlighting the importance of studies of health-seeking and vital registration systems to document the full burden of illness. Highest burden of severe illness and death was in the Delta wave. Following Omicron emergence severe illness reduced, and age-distribution for the incidence of medically attended severe non-fatal illness shifted to a J-shape, possibly reflecting the shift from widespread transmission to an endemic pattern.
What is already known on this topic?As SARS-CoV-2 spread globally, initial assumptions painted a bleak picture for Africa due to its existing challenges in healthcare service delivery, multimorbidity, poverty and lack of resources needed to fight the infection. The number of cases and deaths reported during the pandemic seemed to contradict these initial assumptions. South Africa recorded over 4 million laboratory-confirmed cases of COVID-19 during the first three years of the pandemic. However, it is estimated that only a tenth of the cases were diagnosed. With the lack of testing, inconsistent healthcare-seeking behaviour, changes in attack, reinfection and symptomatic rates, the true burden of SARS-CoV-2 across the different age groups and severity levels were largely unknown. Although real-time epidemiological data was crucial for informing intervention strategies throughout the pandemic, it is now essential to quantify and describe the evolution of the epidemiology over successive pandemic waves as more information was made available.
What this study adds?We found a high burden of severe illness and death in the first three waves of SARS-CoV-2, peaking in the third (Delta) wave. The emergence of the Omicron BA.1 variant was associated with very high rates of infection but substantial reductions in disease severity. Incidence of severe illness and hospitalisation fatality, generally increased with increasing age. Successive waves saw a reduction in the rate of the increase in mortality with increasing age and increases in hospitalisation fatality ratios in children below 5 years of age suggesting shift from the epidemic state to a J-shaped distribution in mortality, typical of seasonal respiratory viruses. Notably a high proportion of severe illness (29%) and death (55%) occurred outside hospital.
How this study might affect research, practice or policy?Our study provides insights into the changes in patterns of infection and disease following introduction of a novel pathogen into a susceptible population which may be useful for future pandemic planning. The high proportion of undiagnosed and unreported illness and the high proportion of severe illness and death occurring outside of the hospital suggest that strengthening of access to diagnosis and care is needed in our setting.
INTRODUCTION
As of the end of August 2023, confirmed numbers of COVID-19 cases and deaths worldwide exceeded 770 million and 6.9 million, respectively.1 South Africa is an upper middle-income country in Sub-Saharan Africa with a population of more than 60 million individuals.2,3 In 2022, 28% of the population was aged below 15 years and the population HIV prevalence was 12.6%.3 More than 4 million laboratory-confirmed cases of COVID-19 were identified in South Africa from the first confirmed case in early March 2020 until 25 March 20234, cumulatively comprising almost half (42.7%) the total confirmed COVID-19 cases in Africa.5 SARS-CoV-2 testing capacity in Africa is limited and in South Africa, less than 10% of all SARS-CoV-2 cases during the first three waves are estimated to have been diagnosed.6,7 South Africa’s vaccination roll-out was initiated among healthcare workers in February 2021 and extended to the elderly and adolescents in May and October 2021, respectively.8 Vaccination coverage within South Africa reached 27% by 12 January 2022.9
From 5 March 2020 through 31 August 2022, South Africa experienced five waves of SARS-CoV-2, each dominated by a different variant (D614G, Beta, Delta, Omicron BA.1/2 and Omicron BA.4/5 respectively)10. As the virus becomes endemic with successive waves of infection, a rise in population SARS-CoV-2 immunity induced by natural infection and vaccination can lead to changes in the epidemiology of disease.11 These changes may include reductions in disease severity, as well as changes in the overall burden and age distributions of infections, illnesses and deaths. Furthermore, different emerging SARS-CoV-2 variants may be associated with immune evasion and variation in virulence, infectiousness and/or transmissibility, which can affect rates of infection and reinfection as well as variant’s intrinsic severity of illness.12–14
South Africa’s government funded15 public healthcare sector is the primary provider of healthcare for the majority of the population16, yet resources are limited17. Although public clinics, hospitals, or other institutions are the primary point of healthcare access for 72% of the population, healthcare-seeking behaviour varies.16 Considering the lack of testing as well as variable healthcare-seeking behaviour and changes in the symptomatic proportion, attack rates, and reinfection frequency across the five SARS-CoV-2 waves, the shape of the disease burden pyramid is largely unknown.
Data on the burden of SARS-CoV-2 in successive waves across different levels of severity are limited globally. This is particularly important in Africa, where resources are limited and official statistics suggest that the burden of cases and mortality was substantially lower than in other regions.5 We aimed to estimate the burden of SARS-CoV-2 infection and severe disease in South Africa from March 2020 through August 2022, and to describe the variations in the shape of the disease burden pyramid across the first five pandemic waves. At the start of the pandemic, it was critical to have realistic epidemiological data to model the impact of interventions. The models incorporated all accessible SARS-CoV-2 epidemiological data and were support tools to South African policy makers; updating stakeholders and the South African public throughout the course of the pandemic.18 However, now it is important to develop best estimates of the epidemiology, and how it changed over time as information about the spread of a novel respiratory infection became available.
MATERIALS AND METHODS
1. Conceptual outline of burden strata with data sources
1.1 Strata definitions
The SARS-CoV-2 burden comprises both asymptomatic infection (no clinical manifestation of disease) and symptomatic illness. We estimated the burden in three mutually exclusive severity strata: non-severe infection (asymptomatic infection and mild or moderate illness), severe non-fatal illness, and fatal illness.19 Severe non-fatal and fatal illness were further stratified into medically and non-medically attended illness (Figure 1). Individuals with severe illness who were admitted to hospital or individuals who died in hospital were considered medically attended. Individuals with severe illness who were not hospitalised and those who died out of hospital were considered non-medically attended.19
1.2 Data sources
Individual-level SARS-CoV-2 public and private hospital admissions and subsequent in-hospital deaths from the DATCOV national surveillance programme for SARS-CoV-2 hospitalisations.8 DATCOV includes data on COVID-19 hospitalisations from all private and public healthcare facilities in South Africa. Hospitalised patients comprised individuals with a positive SARS-CoV-2 test, irrespective of whether hospitalisation was attributable to SARS-CoV-2 or not. Reporting of all laboratory-confirmed cases was legally mandated. DATCOV variables included demographic details such as age and sex, admission information such as the date and reason for admission, and patient outcome such as death or discharge.
South African Medical Research Council (SAMRC) estimates of excess deaths by sex and 5-year age bands.20 The SAMRC, in partnership with the University of Cape Town (UCT), adapted the existing annual Rapid Mortality Surveillance (RMS) process to produce a near real-time (weekly) system for following and observing COVID-19 excess deaths. Weekly updates to the deaths recorded on the National Population Register (NPR), together with the classification of deaths due to natural or unnatural cause were obtained and adjusted for both under and late registration of deaths. Excess SARS-CoV-2 associated deaths are estimated by comparing observed to expected deaths derived from a negative binomial regression model of deaths in pre-COVID-19 years, described in detail elsewhere.21 While we attempted to account for the proportion of deaths attributable to SARS-CoV-2, robust data on this are lacking and it is possible that this varied over the pandemic.
Healthcare utilisation surveys (HUS) conducted in three communities in three provinces.22 From November 2020 through April 2021, fieldworkers enrolled 23,003 individuals from 5,804 randomly selected households. All individuals reporting severe respiratory illness (SRI) since the start of the SARS-CoV-2 pandemic were asked about healthcare-seeking behaviour including the proportion of these seeking care at healthcare facilities.
Age-stratified infection attack rates by SARS-CoV-2 wave obtained from the Prospective Household study of SARS-CoV-2, Influenza and Respiratory Syncytial virus community burden, Transmission dynamics and viral interaction in South Africa – Coronavirus disease 2019 (PHIRST-C) cohort.23 This cohort comprised 1,200 participants randomly selected from two communities in two provinces, including 643 participants from a rural site in Agincourt (Mpumalanga Province, South Africa) and 557 participants from an urban site in Jouberton (North-West Province, South Africa). Ten consecutive serum specimens were obtained from each participant between July 2020 and April 2022 and tested for anti-SARS-CoV-2 antibodies.6,7,12,23 The estimated age-stratified infection attack rates from the urban site were employed in calculations, because these were thought to be more representative of South Africa as a whole since the majority of South Africans live in urban areas.24 The infection attack rate was defined as the number of new infections divided by the population at risk over a defined period.25
Mid-year population estimates, unstratified and stratified in 5-year age bands and sex strata, provided by the government agency, Statistics South Africa.26,27
2. Definitions
2.1 Wave
The total number of SARS-CoV-2 hospitalisations and in-hospital deaths were calculated by epidemiological year (epiyear) and epidemiological week (epiweek). The peaks (maximum SARS-CoV-2 hospitalisations and in-hospital deaths), as well as the nadirs (minimum hospitalisations and in-hospital deaths between these peaks) were then determined. For both hospitalisations and in-hospital deaths, the first wave was defined to extend from the start of the epiweek associated with the start of the pandemic in South Africa to the end of the epiweek of the nadir after this peak. The subsequent waves were defined to extend from the start of the epiweek following the nadir prior to the peak till the end of the epiweek of the nadir following the peak (Figure 2, Supplementary Table 1). However, to achieve agreement between the in-hospital and SAMRC excess deaths wave definitions, an epiweek was included at the end of wave 2 and the start of wave 3. Wave 5 was truncated at epiweek 32 of 2022. The SARS-CoV-2 hospitalisations and in-hospital deaths wave definitions were employed for the respective hospitalisation and in-hospital fatalities stratum estimates.
2.2 Country population at wave peak
The extracted mid-year population estimates (Section 1.2(e)) were obtained for each year, and the proportion weekly change in the population was estimated by dividing the proportion change in the population numbers between the epidemiological years and the number of weeks. Under the assumption of a linear weekly proportional change, the weekly population numbers were then interpolated by addition of the starting population and the calculated change.
3. Estimation approach
All analyses and calculations were carried out in RStudio 2023.06.0+421 for Windows28, running R Statistical software version 4.2.329. The burden estimation approach is outlined hereafter; calculations for each stratum were implemented on an age-stratified and per-wave basis unless otherwise noted. Furthermore, national burden estimates were subsequently obtained from age-stratified figures. Figure 1 depicts the burden pyramid and related disease severity strata with data inputs and relationships between strata.
3.1 Severe non-fatal and fatal illness
a) Medically attended severe non-fatal illness (incidence of hospitalisation)
Using the SARS-CoV-2 hospitalisation wave definitions, all hospital admissions (B), hospital admissions with admission reasons known (C), and hospital admissions explicitly attributed to SARS-CoV-2 (A) were extracted from DATCOV, respectively. The proportion of hospital admissions explicitly attributed to SARS-CoV-2 among all admissions with known admission reasons was defined as: The total in-hospital deaths (D) were determined from DATCOV using the SARS-CoV-2 in-hospital deaths definitions. The adjusted number of severe, medically attended SARS-CoV-2 hospitalisations was obtained by subtracting the in-hospital deaths (D) from the total number of admissions (B) and multiplying the outcome by the proportion of explicitly attributed SARS-CoV-2 admissions (from (1)). That is, Finally, the medically attended severe non-fatal illness incidence rate per 100,000 (SM) was calculated by dividing the adjusted number of SARS-CoV-2 hospitalisations (from (2)) by the respective national population estimate at the peak of the SARS-CoV-2 hospitalisations wave (PH) and multiplying by 100,000:
b) Medically attended fatal illness (incidence of in-hospital death)
The in-hospital explicitly attributed SARS-CoV-2 deaths from explicitly attributed SARS-CoV-2 hospital admissions (E) were determined from DATCOV using the SARS-CoV-2 in-hospital deaths wave definitions. The hospitalisation-fatality ratio (HFR) (defined as the proportion of SARS-CoV-2 deaths among SARS-CoV-2 hospitalised individuals)30 was determined as follows: The adjusted number of total hospitalisations was then calculated by dividing the adjusted number of SARS-CoV-2 hospitalisations (from (2)) by the proportion of hospitalisations that did not result in death, i.e., 1 – HFR (from (4)). That is, The medically attended fatal illness incidence rate per 100,000 (FM) was then determined by subtracting from (5) the adjusted SARS-CoV-2 hospitalisations (from (2)), dividing by the respective national population estimate at the peak of the SARS-CoV-2 in-hospital deaths wave (PD), and multiplying by 100,000:
c) Non-medically attended severe non-fatal illness
The incidence of non-medically attended severe non-fatal illness (SN) is defined as the difference between the incidence of hospitalisation (SM) (from (3)) divided by the average proportion of severe illness seeking care from the HUS (J) and the incidence of hospitalisation (SM), i.e., We assumed the average proportion of severe illness seeking care from the HUS remains unchanged irrespective of age strata or wave as the survey did not have sufficient power to stratify, and estimates were not available by wave.
d) Non-medically attended fatal illness (incidence of out-of-hospital deaths)
We assumed that 85% of excess deaths (G) were attributable to SARS-CoV-2 based on published data. The incidence of out-of-hospital deaths per 100,000 (FN) was then (8) calculated as the SARS-CoV-2 attributable excess deaths (0.85G) less the adjusted SARS-CoV-2 in-hospital deaths, divided by the respective national population estimates at the peak of the SARS-CoV-2 deaths waves (PD) times 100,000. That is, For the age-specific estimates, age groups for which in-hospital deaths exceeded the attributable excess deaths, the non-medically attended fatal illness estimate was set to zero, i.e., the SARS-CoV-2 attributable excess deaths were assumed equal to the in-hospital deaths.
3.2 Non-severe, non-fatal illness (i.e., moderate, mild and asymptomatic illness)
The non-severe, non-fatal infection incidence (I) was obtained by multiplying the wave and age-specific attack rates (H) from PHIRST-C (refer to Section 1.2 (d)) by 100,000 and subtracting out the medically and non-medically attended severe non-fatal and fatal illness (from (3),(6) – (8)) since disease severity strata are assumed mutually exclusive:
4. Estimation of uncertainty
Confidence intervals surrounding the incidence estimates were obtained from bootstrapping the individual-level data. That is, the wave-specific admission and in-hospital death data from DATCOV were each randomly sampled with replacement 1000 times stratified by wave. The average proportion of severe illness seeking care from the HUS was randomly generated from the binomial distribution with a probability of success out of a number of trails defined as the total number of individuals reporting severe illness across HUS sites given by the actual average and subsequently used within calculations for each of the 1000 replications on a wave-specific basis. It was assumed that the health seeking behaviour was consistent across age groups. Age-specific attack rates were obtained by simulating 1000 rates from the random binomial distribution with a probability of success defined as the actual attack rate from PHIRST-C and enforcing a threshold criterion to prevent the subsequent infection incidence from being negative. This threshold was defined to be the maximum of the sum of the 95% confidence interval upper bounds of the total incidence of severe non-fatal illness and total incidence of fatal illness. Should the generated random attack rate times 100 000 not be greater than the threshold, we continue to draw random attack rates until the threshold is exceeded. 95% confidence intervals were estimated as the 2.5 and 97.5 % quantiles of the respective bootstrapped/simulated estimates. No confidence intervals were generated for the incidence of out-of-hospital deaths and total incidence of death due to unavailability of line lists for bootstrapping. Confidence intervals for the incidence of non-severe non-fatal and non-fatal illness, IFR and SRIFR are solely derived from the uncertainty around the infection attack rates and incidence of severe non-fatal non medically and medically attended illness.
5. Ethical considerations and data sharing
Excess death data were provided in aggregate form; therefore, individuals’ consent was not required. The Human Research Ethics Committee (Medical) at the University of the Witwatersrand (Johannesburg, South Africa) approved the DATCOV protocol as part of a national surveillance programme (M160667). Because COVID-19 is a notifiable disease, individual patient consent was waived. The PHIRST-C and HUS protocol were approved by the University of Witwatersrand, Johannesburg, South Africa, Human Research Ethics Committee (reference 150808 and M200862, respectively). The data for this analysis was obtained from several different sources. Requests for the underlying data should be made from the custodians of the data. Aggregated DATCOV data are available on request to the South African National Institute of Communicable Diseases (NICD). The data dictionary is available on request to the data custodian, Dr Waasila Jassat (waasilaj{at}genesis-analytics.com). The analysis code and details of data custodians are available in the Github repository located at https://github.com/BoltonL/Burden_estimation_public. Detailed computational outputs from the analyses are provided in the Supplementary materials.
6. Patient and Public Involvement
The study involved a synthesis of retrospective data and thus it was not possible to involve patients or the public in the design, conduct, reporting or dissemination plans of this research.
RESULTS
Overall
Overall, across the first five waves of SARS-CoV-2 in South Africa, from 1 March 2020 through 13 August 2022, we estimate that there were 104,577,430 SARS-CoV-2 infection episodes, of which 399,886 (0.38%) were severe non-fatal illness episodes and 258,754 (0.25%) deaths out of a population of approximately 300 million individuals under observation over the period (Figure 2). 29% of severe non-fatal illness and 55% of deaths occurred outside of hospital.
Severe disease and death overall
Incidence of both medically- and non-medically attended severe non-fatal and fatal illness increased from wave 1 to 3, with highest incidence occurring during the Delta wave (wave 3) (Figure 3, Table 1). The incidence of death, both medically- and nonmedically attended, also increased across the first three waves, peaking during the Delta (3rd) wave, with 69.2 in-hospital and 87.1 out-of-hospital deaths per 100,000 population, respectively. Thereafter, severe illness and death incidences decreased progressively in waves 4 and 5. The hospitalisation-fatality ratio (HFR), was approximately 32% in the first 3 waves, decreasing slightly from wave one to three (from 34% to 31%), but then dropped substantially in wave four and five (to 14%). In contrast, the out-of-hospital severe respiratory infection-fatality ratio (SRIFR) showed no consistent trend ranging from 37% to 65% across all five waves (Table 2).
Infections and non-severe non-fatal illness overall
Similar to severe illness and death, the number of SARS-CoV-2 infections increased over the first three waves from 15.1 million in wave one to 20.6 million in wave three (Table 2). Following the emergence of the Omicron variant in wave 4 the number of infections peaked in wave 4 at 34.2 million and decreased thereafter in wave 5 to 20.2 million. In the first three waves the infection fatality ratio (IFR) ranged between 0.27%-0.59% and was highest in the second (Beta) wave. In the fourth and fifth waves the IFR dropped substantially to 0.07%.
Severe disease and death by age group
The incidence of medically and non-medically attended severe non-fatal and fatal illness for those aged 65 years and older increased noticeably from wave 1 to wave 2 and a further sharp increase for individuals 75 years and older in wave 3 (Figure 3). The incidence of medically attended severe non-fatal and fatal illness in those aged 80 years and over was 802.5 (95%CI: 768.6-835.3) and 894.1 (95% CI: 815.7-978.2) per 100,000, respectively in wave 3. Following the emergence of the Omicron variant in wave 4, the incidence of severe non-fatal illness and deaths decreased markedly among individuals aged 15 years and older. Among individuals aged under 15 years incidence of severe illness was slightly increased in the fourth wave reducing again in the fifth wave. Deaths among individuals aged less than 15 years remained low in all five waves. Generally, as infection waves progress, the age-specific incidence of medically attended severe non-fatal illness approaches a J-shape curve (higher at the youngest and oldest ages, but higher at the oldest ages).
Changes in HFR and IFR with age
Generally, the HFR increased with increasing age, showing a small increase in young children in some waves (Figure 4, Supplementary Figure 1). In wave 1, the 10-to-14-year-olds had the lowest HFR of 6% and those 80 years and older, the greatest at 57%. In individuals aged 30 years and older there was a marked drop in HFR in the fourth and fifth waves compared to the first three waves. Individuals aged 80 years and older had the highest HFR across waves and only experienced a 53% reduction in HFR ratio from wave one to five. In contrast, the HFR for individuals aged 30 years and older had a noticeably lower rate across age groups in waves 4 and 5 than waves 1 to 3. Consider for example, in wave 1 the difference in HFR between 30-to-34-year-olds and 80-years and above is about 42%, whilst in wave 5 this difference is around 18%. Like the HFR, the older age groups (65 years and above) experienced the highest IFR across all waves. For all waves the IFR was extremely low in individuals aged <40 years.
DISCUSSION
We estimate the burden of SARS-CoV-2 infection and disease severity in South Africa over the first five waves of infection to understand changes in disease burden with successive waves. We found a high burden of severe illness and death in the first three waves of SARS-CoV-2, peaking in the third (Delta) wave. The emergence of the Omicron BA.1 variant was associated with very high rates of infection but substantial reductions in disease severity. Incidence of severe illness and hospitalisation fatality, generally increased with increasing age. Successive waves saw a reduction in the rate of the increase in mortality with increasing age and increases in hospitalisation fatality ratios in children below 5 years of age suggesting shift from the epidemic state to a J-shaped distribution in mortality, typical of seasonal respiratory viruses. Notably a high proportion of severe illness (29%) and death (55%) occurred outside hospital.
The third (Delta) wave of infection had the highest incidence of severe non-fatal and fatal illness. Wave 4 (Omicron BA.1/2 variant) was characterised by marked reductions in severe disease; however, the incidence of asymptomatic, mild, or moderate disease was highest in this wave. Increasing severe disease in the Beta14 and Delta32 waves has been described from South Africa and other countries, possibly related to increased virulence and relaxation of restrictions in later waves. The high attack rates and lower severity following Omicron emergence can be explained in part by the increased transmissibility and immune evasive properties of Omicron.12,33 Together these led to very high infection attack rates. The high proportion of breakthrough infections in the Omicron wave likely contributed to reduced overall severity together with changes in virus tropism increasing the proportion of infections in the upper airway.34
Successive waves of infection were associated with shifts in the age distribution of severe disease. In all five waves, severe disease was concentrated in the elderly as has been described previously, driven in part by increasing comorbidities in this age group.14,35,36 Following the emergence of the Omicron variant, the reductions in severe disease were mostly seen in older adults, leading to a flattening of the steepness of increasing severity with increasing age and proportionately more severe disease in young children. This shift towards an age pattern more typical of seasonal respiratory viruses where severe disease is concentrated in the elderly and young children, could represent the transition from a pandemic to endemic epidemiology.11 Notably, despite the overall reduction in disease severity, more children and adolescents were hospitalised during the Omicron wave (BA.1/2 variant) than in the previous infection waves. This pattern of proportionately more disease in children following Omicron emergence in South Africa has been described previously, however not in the context of estimation of the burden pyramid in all age groups.8,37,38 Possible reasons for the shift in age distribution include changes in population immunity, pathogen virulence or changes in health seeking behaviour or testing practices. Persons aged 60 years and older were prioritised for vaccination in South Africa as from May 2021 and extended to 12 to 17 year olds in October 2021.8 This may also have contributed to the sharp decline in HFR among the elderly from wave 1 to wave 5 since vaccination coverage within South Africa reached 27% by 12 January 2022.9 However, the reduction in mortality in people living with HIV was not as dramatic with the emergence of the Omicron waves compared to HIV negative.39
A high proportion of severe disease and deaths occurred outside of hospitals40, highlighting the importance of studies of healthcare-seeking and access and strengthening of vital registration systems to document the full burden of illness.20 The high number of severe cases at the peaks of the waves caused pressure on the healthcare system, potentially reducing access and exacerbating underlying health inequities.22,41 An important limitation of our study is that we only had healthcare utilisation data from three communities in three provinces for the first two waves of the pandemic.22,41 Our assumption that healthcare seeking remained constant may not be valid and could be the reason why we do not see clear trends in the SRIFR over successive waves as we do for HFR.
Our study had other limitations. Out of hospital deaths were ascertained from modelling of excess COVID-19-associated deaths, using all cause vital registration data, thus not all excess deaths may have been due to COVID-19. We accounted for this by assuming that the attributable proportion was fixed for all waves; if the proportion did vary over time, this could have led to bias in estimates. Data for the different strata of severity were derived from different sources. For health seeking and infection attack rates, national data were not available, and we had to extrapolate nationally under the assumption that the available data were representative. If these measures varied by geographical region, this would have led to biased estimates. Although the infection attack rates accounted for reinfections, the estimated hospitalisations and deaths within our study occur as they do, whether they result from reinfections or not. Within our estimation of the medically attended severe non-fatal and fatal illness, it must be noted that the proportion of in-hospital deaths attributed to SARS-CoV-2 may not necessarily have been the same as the proportion of SARS-CoV-2 hospitalisation. Finally, we used the mid-year population estimates from 2020 through 2022 provided by Statistics South Africa as our population denominators throughout this study. These estimates are obtained from the Spectrum Policy Modelling system.42 However, the Thembisa model is also available to obtain population estimates.3 The total population estimated by the latter model (~59 million)3 is different from the recently published Census 2022 (~62 million)43 due to an underestimation of those 55 years and older.
In conclusion, we have demonstrated a high burden of infection and severe disease associated with SARS-CoV-2 over five successive waves in South Africa. With the onset of SARS-CoV-2 it was assumed that Africa would carry a large burden of infection and mortality due to weak health systems, lack of access to therapeutics and vaccines, poverty, malnutrition and high prevalence of underlying illness such as HIV and tuberculosis44,45, as well as high levels of persons left undiagnosed and untreated or not effectively treated. While reports of confirmed cases suggest that Africa was relatively spared, our analysis combining multiple data sources, suggests that this was not the case for South Africa. The estimated cumulative SARS-CoV-2 fatalities in South Africa across the first five waves (258,754) exceeds the number of confirmed SARS-CoV-2 deaths of high-income countries such as Germany (150 237) and the United Kingdom (205 780).46 Even more stark, our estimated cumulative SARS-CoV-2 cases over this period (105 million) far exceeds that reported for these high-income countries, namely 31.6 and 23.4 million, respectively.46 Furthermore, despite differences in methodology, our estimated IFR for wave 1 (0.27%) was comparable to that of Leticia, Columbia (0.28%).47 The discrepancy between low numbers of confirmed cases and actual number of infections were similarly observed across six districts in Zambia. During the first wave, a SARS-CoV-2 prevalence study reported 454 708 infections compared to a mere 4 917 laboratory confirmed cases.48 In future, it will be important to estimate the ongoing age-specific burden of COVID-19 compared to other respiratory viral illnesses in order to decide on priorities for prevention interventions such as vaccination.
Data Availability
The data for this analysis was obtained from several different sources. Requests for the underlying data should be made from the custodians of the data. Aggregated DATCOV data are available on request to the South African National Institute of Communicable Diseases (NICD). The data dictionary is available on request to the data custodian, Dr Waasila Jassat (waasilaj{at}genesis-analytics.com).
AUTHOR CONTRIBUTIONS
LB: formal analyses, investigation, software, validation, visualisation, writing – original, writing – review and editing. ST: conceptualisation, supervision, methodology, writing – subject-specific commentary in original, writing – review and editing. CC: conceptualisation, supervision, methodology, project administration, data provision, writing – subject-specific commentary in original, writing – review and editing, resources. JP: supervision, methodology, writing – review. SW: data collection, advisor, writing – review and editing. DB and RD: methodology, primary data – SAMRC excess deaths, writing – subject-specific commentary in original, writing – review and editing. WJ: primary data – DATCOV, writing - review and editing. KS: analyses of serological data, writing - review and editing. JK, NM, AvG: data collection – PHIRST-C, writing – review and editing. NW – data collection – PHIRST-C, HUS, writing – review and editing.
FUNDING
SACEMA (LB and JRCP) was supported by the South African Department of Science and Innovation and the National Research Foundation for the duration of this research (grant number 44895). NW obtained funding for the HUS from the South African Medical Research Council (Reference number SHIPNCD 76756), The Wellcome Trust and the United Kingdom Foreign, Commonwealth and Development Office (Grant no 221003/Z/20/Z) and United States Centers for Disease Control and Prevention (Grant number 5 U01IP001048-05-00).
COMPETING INTERESTS
All authors have completed the ICME uniform disclosure form and declare:
J.P and L.B. received support for the current manuscript from Wellcome Trust and the South African Department of Science and Innovation (Centre of Excellence Grant). N.W. and C.C. received support from The US Centers for Disease control and prevention and C.C. furthermore received support from Wellcome Trust and South African Medical Research Council. J.P. has further received research grants from the Bill and Melinda Gates foundation and Stellenbosch University, N.W. from the Bill and Melinda Gates foundation, C.C. from PATH, the Bill and Melinda Gates foundation and Sanofi Pasteur, and S.W. from the Bill and Melinda Gates foundation, the US Centers for Disease control and prevention and the South African National Institute for Communicable Diseases. L.B. has undertaken contract research for the South African National Blood Services, the South African National Institute for Communicable Diseases, Department of Paediatrics and Child Health at Stellenbosch University (African Academy of Sciences/Bill and Melinda Gates Foundation grant) and University of Witwatersrand (Thuthuka grant). J.P. and L.B. has been paid consulting fees by the National Institute for Public Health and the Environment, The Netherland, and Department of Paediatrics and Child Health at Stellenbosch University, respectively. J.P. participated in a data safety monitoring or advisory board for the Medical Research council for global infectious disease analysis, Imperial college London. J.K. received support for attending meetings or travel from Fogarty International Center, National Institutes of Health, Bethesda, Maryland, Bill and Melinda Gates Foundation and University of Pittsburg, L.B. from Grand Challenges Africa, Science for Africa, World Society for Paediatric Infectious diseases, Global Women’s Research Society and Department of Paediatrics and Child Health at Stellenbosch University; N.W. from the World Health Organisation and J.P. from the Bill and Melinda Gates Foundation. J.K. held a leadership or fiduciary role in other board, society, committee or advocacy group, paid or unpaid at the South African Field Epidemiology Training Programme Alumni Association. N.M.’s institution has received funding from Pfizer for an unrelated research project. N.M sits on the boards of the Wits Health Consortium (Pty) Ltd – an unpaid position - and the Setshaba Research Centre for which he is paid per board meeting. He has no other potential conflicts of interest. A.vG. has received funding from the US Centers for Disease control and prevention, SA Medical Research Council (SAMRC)/UKZN KRISP, CDC Atlanta and CDC SA (through AFENET), Africa PGI (through ASLM), Fleming Fund Regional Africa WGS (SEQAFRICA) and WHO AFRO.
All other authors have no competing interests to declare.
FIGURE LEGENDS
ACKNOWLEDGMENTS
This work is based on research supported by the South African National Research Foundation (NRF). Any opinion, finding, and conclusion or recommendation expressed in this material is that of the authors and the NRF does not accept any liability in this regard. We acknowledge the support provided by OpenAI’s ChatGPT, which offered AI-driven insights that facilitated our editorial and review processes in the preparation of the manuscript by providing feedback and suggestions to improve the manuscript.