Abstract
The outbreak of COVID-19 has caused tremendous pressure on medical systems. Adequate isolation facilities are essential to control outbreaks, so this study aims to quickly estimate the demand and number of isolation beds. We established a discrete simulation model for epidemiology. By adjusting or fitting necessary epidemic parameters, the effects of the following indicators on the development of the epidemic and the occupation of medical resources were explained: (1) incubation period, (2) response speed and detection capacity of the hospital, (3) disease cure time, and (4) population mobility. Finally, a method for predicting the reasonable number of isolation beds was summarized through multiple linear regression. The prediction equation can be easily and quickly applied to estimate the demanded number of isolation beds in a COVID-19-affected city. A detailed explanation is given for the specific measurement of each parameter in the article.
1. Introduction
SARS-CoV-2 is a novel coronavirus that has the ability of human-to-human transmission1,2. Coronavirus disease 2019 (COVID-19) caused by SARS-CoV-2 has been defined by the World Health Organization (WHO) as a pandemic (the worldwide spread of a new disease) 3. As of April 5, 2020, more than 1100000 cases of COVID-19 have been reported in different countries and territories 4. Currently, researchers around the world are making every effort to clarify the biological and epidemiological characteristics of SARS-CoV-2 and strive to explore effective coping strategies 5-7.
COVID-19 is extremely contagious, and its explosive growth in a short space of time has caused tremendous pressure on medical resources 8. Conventional medical conditions have difficulty meeting the needs of the detection capability for suspected cases and the number of isolation beds for treatment and isolation 9-11. The number of isolation beds is crucial to reduce the scale of infection and reduce the number of fatalities. Too few isolation beds can lead to the continuation of the epidemic, and too many isolation beds may cause waste and environmental damage 12-14.
To explore a reasonable number of isolation beds, we established a discrete simulation model of epidemics based on COVID-19. By setting different epidemic indicators (incubation period, hospital response time, healing time, population mobility rate), we analyzed the changing laws of the epidemic situation, peak value, and scale of the epidemic in different situations. In particular, we pay attention to the occupation of medical resources during the outbreak. We summarized some epidemic indicators related to the number of isolation beds through multiple linear regression and estimated the number of isolation beds through these indicators. The conclusion is practical, which can provide support for the reasonable scheduling of medical resources and the search for effective solutions in the current outbreak or in similar future outbreaks.
2. Result
2.1 Impact of epidemic factors on the infection situation under sufficient isolation facilities
2.1.1 Impact of incubation period on the epidemic development
The incubation period is an asymptomatic stage in the early stages of disease development, at which point patients themselves will not suspect that they have been infected. We compared the infection situation of different incubation periods. ANOVA showed that the mean values of all indicators among groups were not exactly the same (P<0.05). The detailed differences among groups are shown in Table 1.
Further analysis was performed using multiple comparisons. The maximum number of incubation cases, the sum of infected cases and the corresponding date for the peak number of inpatients were significantly different between any two groups (P<0.05). The long incubation period promoted these epidemic indicators. The maximum number of newly confirmed cases and their corresponding dates, corresponding date of peak incubation cases, maximum value of Rt, duration of the epidemic, and maximum number of inpatients were not exactly equal among the groups in different incubation periods, and there were significant differences among some groups (P<0.05). The above indicators increased with the increase in the incubation period. (Figure 1)
2.1.2 Impact of the hospital response time on the epidemic development
In the simulation model, hospital response time refers to the time it takes for a patient to develop a first symptom until a clear diagnosis is obtained. This time is extended if the patient is unwilling to go to the hospital to undergo detection testing, if the hospital detection method fails to diagnose the disease early, or if there are insufficient hospital detection reagents.
ANOVA showed that there were no significant differences in the corresponding date of the maximum number of incubation cases among different response time groups (P>0.05), and the mean of the other indicators among the groups was not exactly equal (P<0.05). The detailed differences among the groups are shown in Table 1.
The multiple comparisons showed that the sum of infected cases between any two groups was significantly different (P<0.05). The sum of the infected cases increased with the extension of response time. The maximum number of newly confirmed cases and its corresponding date, the maximum number of incubation cases, the maximum of Rt, the duration of the epidemic, the maximum number of inpatients and its corresponding date were not exactly equal among the groups at different response times, and there were significant differences among some groups (P<0.05). The above indicators increased with the extension of the response time (Figure 2).
2.1.3 Impact of the healing time on the epidemic development
The healing time refers to the average time from admission to discharge. ANOVA showed that the mean values of the maximum value of Rt, the maximum number of inpatients and their corresponding dates among different healing time groups were not exactly the same (P<0.05), and the other indicators were not considered to be significantly different (P>0.05). The detailed differences among the groups are shown in Table 1.
Further multiple comparisons showed that the maximum number of inpatients between any two groups was significantly different (P<0.05). The maximum number of inpatients increased with the extension of the healing time. The maximum value of Rt and the corresponding date of peak inpatient number were not exactly equal among the groups at different healing times, and there were significant differences among some groups (P<0.05). The extension of the healing period promoted the increase in the above indicators (Figure 3).
2.1.4 Impact of the population mobility on the epidemic development
The population mobility rate refers to the proportion of people in motion to the total population. We compared the infection situation of different population mobility rates. ANOVA showed that there were no significant differences in the maximum value of Rt and the duration of the epidemic among the groups (P>0.05), and the mean of other indicators among the groups was not exactly equal (P<0.05). The detailed differences among the groups are shown in Table 1.
The multiple comparisons showed that the sum of the infected cases between any two groups was significantly different (P<0.05). The increase in the population mobility rate caused a higher sum of the infected cases. The maximum number of newly confirmed cases and its corresponding date, the maximum number of incubation cases and its corresponding date, the maximum number of inpatients and their corresponding dates were not exactly equal among the groups in different population mobility rates, and there were significant differences among some groups (P<0.05). Among them, the maximum number of newly confirmed cases, the maximum number of incubation cases and the maximum number of inpatients increased with the increase in the population activity rate. At the extreme value of 0%, that is, when everyone was inactive, the corresponding date of peak incubation cases and the corresponding date of peak inpatient number were significantly advanced, which was significantly different from that of the other groups (Figure 4).
2.2 Impact of the hospital isolation capacity on the infection situation
The hospital isolation capacity is defined by the proportion of the actual quantity of isolation beds to the demanded quantity for isolation beds. ANOVA showed that there were no significant differences in the maximum number of newly confirmed cases, the maximum number of incubation cases and the maximum value of Rt among different isolation capacity groups (P>0.05), and the mean of the other indicators among the groups was not exactly equal (P<0.05). The detailed differences among groups are shown in Table 2.
The multiple comparisons showed that the corresponding dates of the peak number of inpatients and the duration of isolation facilities at their full capacity between any two groups were significantly different (P<0.05). The corresponding date of the peak inpatient number was delayed with the decrease in isolation capacity. The duration of isolation facilities at their full capacity increased with the lack of isolation beds, which showed a quadratic relationship (Supplement material). In addition, the corresponding date of peak newly confirmed cases, the corresponding date of incubation cases, the sum of infected cases and the duration of the epidemic were not exactly equal among the groups, and there were significant differences among some groups (P<0.05). In the severely inadequate isolation capacity group (40%), the corresponding date of incubation cases, the mean value of Rt, the sum of infected cases and the duration of the epidemic were significantly increased (P<0.05) (Figure 5).
2.3 Multiple regression analysis and parameter prediction of the reasonable number of isolation facilities
2.3.1 Multivariate regression analysis
To further explore the rational setting of isolation beds in the medical system under multifactor epidemic conditions, we analyzed the relationship between different epidemic indicators and a reasonable number of isolation beds by multiple regression analysis. The t-test showed that the independent variables of incubation period, population mobility rate, hospital response time and healing time significantly affected the reasonable number of isolation beds (P<0.05) (Table 3). Finally, we obtained the following regression equation (R2=0.841):
N indicates a reasonable number of isolation beds. P means population, which refers to the total population in the corresponding area. I means incubation period of the epidemic. M means population mobility rate, which refers to the proportion of people in motion to the total population. T1 means hospital response time, which refers to the time it takes for a patient to develop the first symptom until a clear diagnosis is obtained. T2 means healing time, which refers to the average time from admission to discharge. T1 and T2 can be estimated based on a certain number of cases. Considering that a certain number of isolation beds should be reserved to cope with emergency situations, we set V as the reserve amount. This article suggests that it is generally reserved at 10% according to the simulation results.
2.3.2 Prediction of a reasonable number of isolation beds in different healing times
Here, the prediction method was applied to an example (Figure 6). The predicted number of isolation beds were basically consistent with the model operating results. The parameter setting is the same as model 2.1.3.
3. Discussion
3.1 Impact of the incubation period on the epidemic development
According to result 2.1.1, we found that a longer incubation period significantly promoted the infectivity, scale and duration of the epidemic. By tracking this phenomenon, it was found that the patients in the incubation period and without mobility restriction caused a high level of transmission before displaying symptoms by contacting others, which was similar to the views of Li P and Jiang X et al. 15,16. To contain an outbreak, early detection of suspected cases is critical 17. Some studies have described that a longer incubation period may be beneficial for epidemic control 18, as this allows the Centers for Disease Control and Prevention (CDC) to have more time to deal with the overall epidemic. This conclusion may be more applicable to some known diseases, but for unknown diseases, we believe that a longer incubation period represents a more dangerous signal, making the development of the epidemic uncontrollable18,19. Due to the reduced predictability of the disease outbreak scale, it is more difficult to track patients. As a result, the disease may spread to a wider range of people, making it difficult to control.
Our results showed that increasing hospital response speed could improve infectivity and scale of an outbreak, which is consistent with previous research20-22. Shortening the hospital response time depends on the public’s awareness of epidemic prevention and the level of medical technology. On the one hand, the public needs to pay more attention to the epidemic and actively cooperate with early detection; on the other hand, medical technology determines the time it takes for the detection method to give an accurate result. In addition, if sufficient isolation facilities can be provided, the centralized isolation of all suspected patients who cannot be excluded can also help reduce the hospital response time 8,23.
Compared with the hospital response time, the impact of the population mobility rate on the duration of the epidemic is not significant in the case of abundant medical resources. However, unrestricted population mobility can cause a large medical load and consumption of medical supplies and will generate a large number of infected cases, resulting in adverse socioeconomic impacts 24-26. In reality, medical resources are not only limited, but also lack most of the time. Therefore, it cannot be considered that the population mobility has no effect on the duration of the outbreak. The detailed consequences of inadequate medical resources are discussed in section 3.2. In extreme cases, all people stopped activities, i.e., when the mobility rate is 0%, the overall scale of the epidemic is dramatically reduced, and the corresponding date of the epidemic peak sharply advances. It is speculated that the disease may saturate in a small area after taking this extreme prevention and control measure; thus, the transmission will be completely blocked. Since there is no new-generation infection, different transmission laws are displayed.
3.2 Impact of insufficient isolation facilities on the epidemic development
Since there is no specific treatment or vaccine for COVID-19, containment of the epidemic depends more on traditional public health measures 27. The results showed that the low capacity of hospital isolation would lead to serious consequences. This is consistent with previous studies showing that intervention through timely isolation measures is effective in controlling the epidemic 28,29.
In addition, the simulation model showed that the duration of isolation facilities at full capacity showed a quadratic relationship with the gap in the number of isolation facilities. That is, as the gap in the number of isolated beds expands, the growth rate of full-load time gradually accelerates, which may increase the uncontrollability of the epidemic. In our model, the hospital isolation capacity does not affect the ability of to detect the disease and diagnosis patients, so it has no significant effect on the maximum number of newly confirmed cases and their corresponding dates. However, the increase in the duration of isolation facilities at full load shows a more severe overload of the medical system. Thus, under the condition of insufficient isolation capacity, expanding the original size of the hospital or constructing fangcang shelter hospitals are the key measures to contain outbreaks 9,23,30.
3.3 Multiple regression analysis and parameter prediction of the reasonable number of isolation facilities
We summarized the related factors that affected the demanded quantity of isolation beds by multiple regression analysis. Furthermore, multivariate regression analysis was used to estimate the reasonable number of isolation beds: The regression equation shows that the population mobility rate is the variable with the highest weight, which indicates that the restriction of population mobility is the critical factor to contain outbreaks and effectively reduce the load on the medical system 20,24,31,32. We believe that reducing the epidemic scale by restricting population mobility can also help to provide time for the establishment of temporary isolation.
In practice, the incubation period (I) can be estimated from the time between traceable harmful exposure to the time of the first symptom 19,33. The population mobility rate (M) can be roughly estimated by the ratio of the population in unrestricted mobility, including medical personnel and administrative personnel, to the total population. 20,34. Hospital response time (T1) and healing time (T2) can be estimated based on a certain number of cases. In addition, we recommend V = 10% as a reserve to address emergency situations under actual conditions. The above indicators are easy to obtain and estimate, which provides a feasible guarantee for using this method to estimate a reasonable number of isolation facilities.
More importantly, estimating the number of isolation facilities based on the epidemic situation and relevant parameters of the medical system will help to predict the pressure of the medical system in different areas in advance. This will provide decision-making support for the rational arrangement of medical resources and epidemic control.
4. Method
4.1 Establishment of discrete simulation model and parameter interpretation
4.1.1 Establishment of the discrete simulation model
In this discrete simulation model, we use the Java language to carry out object-oriented programming. The eight states of people were defined as normal, shadow, supershadow, suspected, confirmed, isolated, cured, and dead. At the same time, for the times of being infected, suspected, confirmed, isolated, cured, and died were independent attributes configured to simulate the process with the actual world process, so the program can produce all attributes, including each moment, and record, analyze and calculate the statistics for each simulation individual.
In the model, the length of the incubation period, the time from being a suspected case to being diagnosed, the length of isolation, the rate of population mobility, the probability of infection, and the probability of death after infection can be adjusted or fitted as the necessary parameters for simulation. Most of the time, the parameters follow the normal distribution model, and the mean and standard deviation are defined by the parameters. The probability follows the random number model and is set by the probability value. The simulation object state transition logic is shown in Figure 7.
4.1.2 General assumptions
During the simulation process, the total population of the city is constant, and the number of initial cases is known.
The incubation period (shadow time), time of death (died time) and time of cure (cured time) of an individual were in accordance with the normal distribution.
There is a fixed response time interval between the onset of patient symptoms and the moment of hospital diagnosis. The hospital always has enough resources to make a diagnosis when the response time is reached.
When exposed to an infected person within a dangerous distance, the probability of infection and death after infection is constant.
After being cured, the hospital will discharge the patient and be released from isolation at once.
Unless otherwise specified, these characteristics do not change over time during disease transmission.
The simulation model ends when all patients have been discharged.
The patient can not be contagious or infected again after being cured.
4.1.3 Parameter interpretation
Population mobility rate
The percentage of the population that has willingness to move. Healing time: The mean time between being in isolation and being discharged.
Incubation period
The time from infection to self-detection of suspected symptoms. Fatality rate: The probability of death after diagnosis.
Dead time
Mean time from diagnosis to death.
Hospital response time
The time from the patient’s suspected symptoms to a definitive diagnosis. Transmission rate: The probability of being infected by contact with an infected person within an unsafe distance.
4.2 Model parameters setting
In order to prevent deviations in the simulation process, each set of parameters was repeated 10 times. After removing outliers, the mean was used to draw the curve and analysis.
4.2.1 Impact of the incubation period on the epidemic development
This model is used to discuss the influence of the disease incubation period on epidemic infection and medical resource occupation. In this model, we make the assumption that the hospital’s isolation capacity is strong enough to admit all patients.
To achieve a single variable, we assign the following parameters: the total population=5000, number of initially infected persons=20, population mobility rate=1, healing time=15 days (standard deviation=2 days), hospital response time=1 day, fatality rate=0.05, and dead time=10 days (standard deviation=5 days) remained the same during the simulation. The experimental groups were as follows: the mean incubation period was set at 3, 7, 10, 14 and 21 days.
4.2.2 Impact of the hospital response time on the epidemic development
This model is used to discuss the impact of the hospital response time on epidemic infection and medical resource occupation. In this model, the hospital’s isolation capacity is strong enough to admit all patients.
To achieve a single variable, we assign the following parameters: the total population=5000, number of initially infected persons=20, incubation period=7 days (standard deviation=5 days), population mobility rate=1, healing time=15 days (standard deviation=2 days), fatality rate=0.05, and dead time=10 days (standard deviation=5 days) remained the same during the simulation. The experimental groups were as follows: the hospital response time was set at 1, 3, 5, 7 and 10 days.
4.2.3 Impact of the healing time on the epidemic development
This model is used to discuss the influence of the hospital cure time on the epidemic infection and medical resource occupation. In this model, the hospital’s isolation capacity is strong enough to admit all patients.
To achieve a single variable, we assign the following parameters: the total population=5000, number of initially infected persons=20, incubation period=7 days (standard deviation=5 days), population mobility rate=1, hospital response time=1 day, fatality rate=0.05, and dead time=10 days (standard deviation=5 days) remained the same during the simulation. The experimental groups were as follows: the hospital healing time was set at 5, 7, 10, 15 and 20 days.
4.2.4 Impact of the population mobility on the epidemic development
This model was used to discuss the influence of the hospital cure time on the epidemic infection and medical resource occupation. In this model, the hospital’s isolation capacity is strong enough to admit all patients.
To achieve a single variable, we assign the following parameters: the total population=5000, number of initially infected persons=20, incubation period=7 days (standard deviation=5 days), hospital response time=1 day, healing time=15 days (standard deviation=2 days), fatality rate=0.05, and dead time=10 days (standard deviation=5 days) remained the same during the simulation. The experimental groups were as follows: the population flow rate was set to 0, 30%, 50%, 80% and 100%.
4.2.5 Impact of the hospital isolation capacity on the infection situation
This model was used to discuss the influence of the hospital isolation capacity on the epidemic infection.
To achieve a single variable, we assign the following parameters: the total population=5000, number of initially infected persons=20, incubation period=7 days (standard deviation=5 days), population mobility rate=1, hospital response time=1 day, healing time=15 days (standard deviation=2 days), fatality rate=0.05, and dead time=10 days (standard deviation=5 days) remained the same during the simulation. The experimental groups were as follows: the hospital isolation capacity was set to 40%, 60%, 80% and 100%.
4.2.2 Multiple regression analysis and parameter prediction of the reasonable number of isolation facilities
Multilinear regression (MLR) analysis was used to evaluate the impact of the simulation parameters on the dependent variable demand for the number of isolation beds. Meanwhile, the prediction method was applied to an example under different healing times to test the accuracy.
4.4 Statistical analysis
Continuous variables are compared using the average. Using SPSS v26.0 (IBM authorized Central South University to use) for data analysis, analysis of variance (ANOVA) was used to analyze the level of significant difference between the groups. When the variance is homogeneous, the least significant difference (LSD) method and multiple comparisons are used to analyze between any two groups; when the variance is not homogeneous, Tamhane’s T2 tests and the multiple comparison are used to compare the mean between any two groups. The Grubbs method was used to address outliers. For all statistical analyses, the test level was α = 0.05.
R is used to represent the goodness of fit of the multiple regression to measure the fitting degree of the estimated model to the observed values. The table of regression analysis lists the results of the significance test of independent variables (using t-test) and the P value of t-test, indicating whether independent variables have a significant influence on dependent variables.
Data Availability
The codes of the current study are available in the GitHub repository, https://github.com/coolleafly/COV_SIM/. The average data generated by 10 simulations for each model are included in this published article (and its supplementary information files). Other data are available from corresponding author upon reasonable request.
Data availability
The codes of the current study are available in the GitHub repository, https://github.com/coolleafly/COV_SIM/. The average data generated by 10 simulations for each model are included in this published article (and its supplementary information files). Other data are available from corresponding author upon reasonable request.
Contributors
XL and YL conceptualized the study, analyzed the data, and drafted and finalized the manuscript. YC and LX contributed to the concept, draft, and finalization of the paper. YD, GH, and YL read drafts and provided input. All authors approved the final version of the manuscript.
Declaration of interests
We declare no competing interests.
Acknowledgments
We thank Bruce Yong for providing a prototype of the program. We referred to his open-source virus broadcast simulation project and used it as the prototype of our simulation design. With Bruce Yong’s consent, we redesigned some programs according to the epidemic situation. Thank you to Prof. Zhoushun Zheng, Prof. Canhua Jiang and Prof. Changyun Fang of Central South University for their support for this project. Thank you to the MAIGO mathematical modeling team for their help with this project.
This work was supported by the Hunan Province Science, Technology Department (Hunan Natural Science Fund - Youth Foundation Project, 2018JJ3850), the Hunan Health Commission (B2019192), Natural science foundation of Hunan province (2017JJ2392), Scientific research project of Hunan provincial health commission (B20180054), and Changsha science and technology project (kq1706072).
Footnotes
↵# Xinyu Li and Yufeng Cai are co-first authors on this work.