We consider SIR model which divides the population into three compartments:
All three variables are normlized by dividing the total population so that their values are in . Introduce two parameters:
The model is
with initial condition , and . Note that the notation is the famous Basic Reproduction Number of the disease not the initial value of .
Explanation: the product is the number of susceptible person will encounter an infected one. Multiply the rate of infection is the new infected population. is the rate of recovered population.
Simply add these three equations, we have and thus for all . This is known as the conservation of population. No death or birth is included in the model.
Since , for all , we can eliminate and obtain a system of only.
The first equation of implies is decreasing and the last one implies is increasing. As they are bounded in , their limit exist as and denote by .
We then let and the deriavatives on the left hand side of become zero as the functions apporache to constants. Therefore the right hand side of should also be zero. We then obtain the relation and consequently .
Let's try to solve . Divide the first equation in by the second one imples an ODE
with initial condition . It can be easily solved to get
Substitute in terms of in the second equation of , we get a nonlinear ODE of
Let and use , we obtain from the relation
whose solution can be easily obtained by, e.g., Newton's method. In real application, ( e.g. 1 patient over million people) and we can replace by and solve the equation
to get an accurate approximation of the population ratio which remains non-infected in the end.
We rewrite the equation for as
As is decreasing and in and for the epidemic model , we can divide into three stages according to the sign of .
We apply perturbation analysis for solving nonlinear ODEs. We use normalized variables and replace by 1. The nonlinear ODE we try to solve is
with an apporiate initial condition . Let . From the third equation of SIR model , we have the relation .
Then solution to can be written as
The challenge is that we do not know the antiderative of for such a nonlinear function.
What can we solve? We can solve when is simple polynomials such as linear or quadratic. So we can use polynomial approximations to at different point to find approximations of . More specifically, given time and a value , we build the condition into the solution by the following form
Note that we have to assume the interval does not contains the root of . Otherwise the integeral on the left could be divergence.
Consider . We get the linear approximation
Note that when , we cannot integrate from . That is and cannot be zero. Using , we get
So we can use to determine .
Consider where are two real roots of . Here we consider only the case the quadratic function has two real roots. To be well defined, we assume the interval so that the roots of are not in .
Use the factorization
and integrate , we get
where rate . We can solve to get
with parameters
which is a logistic function transition from to as the limits
Its critical point, i.e., is given by
Using the relation , we get the formulae on
We list the function and its derivatives
The Taylor series at a certain point is
We shall apply the formula to several points and get approximations at different stages.
We apply the formula to . Then
For the linear approximation and , we obtain
Namely in the initial stage, it is like the solution of ODE . The solution grows with rate and decays with rate .
As mentioned early, and cannot be zero. We can determine by looking at for which the initial condition is given. From
we get which is small but positive. That is is a small perturbation of but not zero.
The quadratic approximation of near is which implies
Instead of pluging into the formula to simplify, we leave the general formula as it can be computed using values of . Note that the rate is the same as the one in the linear approximation.
We compare the analytic approximation with the numerical solution. In these figures, blue curves are numerical solution obtained by ode45
in MATLAB and red is the linear approximation and black is the quadratic one.
In the early stage, e.g, , both linear and quadratic approximations are very accurate. But after that, the linear approximation keeps exponentially increasing with the same rate while the quadratic one will slow down and approaches to a limit (for ) or zero (for ) due to the behavior of logistic functions.
We can use formula to predict the peak time but be aware that could be optimistic. The infected number could still increase for a while.
We compute derivatives of at
The function value . On the other hand, by the formulation of , we obtain the relation and thus the derivatives can be simplified to
For the linear approximation, we have . The linear approximation of is
and linear approximation of is
with rate . Recall that the increasing rate in the early stage is which is usally bigger than the decay rate in the end.
If we chose , we get constant approximations which are not interesting. Instead we chose when we observe the function starts to decay for a while but close to the flat tail. The linear approximation tells us although the decay is not as fast as the pandemic starts, it will decay to zero exponentially.
For the quadratic approximation, we have the Taylor expansion
Therefore
Again we plug into the general formulation and compare with the numerical solution.
Again the quadratic one provides a much better approximation In the ending stage and not trustable in the initial stage.
We shall apply the formula at , where is the peak time when achieves the maximum. Denote by
From the equation for ,
by solving , we know
Use relation , we can easily get
Therefore the linear approximation of is the constant approximation and
The quadratic approximation is given by
from which we get
By the symmetry, we have simplified formulae
and
The comparison to the numerical solution can be found in the figures below. The approximation is surprisingly good for all time . It is worthing to point out the shape of the function is determined by the single parameter . To determine the location, the peak time is also needed.
We can glue quadratic approximation in different stages to get a piecewise logistic function approximation of the true solution of SIR model although we cannot solve it analytically.
We can assume the peak time is observable from the data. We chose a small interval near like . Denote the quadratic approximation in the early, peak, and ending stage as , respectively. We will use the piecewise approximation
Similar definition is applied to obtain .
We plot the approximation by black dash lines which are almost not distinguishable from the numerical solution.