Return to COVID-19 projections

Sydney COVID-19 outbreak projections – 2 August 2021

What’s new?

There are no changes to the interpretation of the model. Today’s announcement of 207 local cases in NSW continues to be consistent with the models. The Gompertz model seems to be performing well with relatively good levels of error in its 7- and 14-day projected estimates (from a week, and a fortnight ago).  There is some evidence that it might be slightly underestimating.

The model projections suggest that the outbreak is likely to continue to grow, with the rate of growth still increasing.  We are likely to see another 1500-2000 new cases over the coming week.  Although the current lock down conditions in Greater Sydney have been extended until the end of August 2021, it is not likely that the outbreak will be better than we are now unless there is a very substantial improvement in the transmission dynamics of the outbreak.

 

Context and timeline of the Sydney 2021 outbreak

I started these analysis in early July with the new COVID-19 (delta variant) outbreak in Sydney NSW.  The purpose of these charts is that they provide some projections into the near future. Hopefully this allows for some data driven expectations.  The Sydney 2021 series starts on 12 July 2021 and are available here: https://vitualis.com/?page_id=4071

The current Sydney outbreak started on 17 June 2021 – the first day with reports of community cases in Bondi.  The NSW Government formally commenced stay-at-home orders (“lockdown”) for a number of inner-Sydney regions on 25 June 2021 at 2359, having announced this earlier in the day. This was then broadened 18 hours later to cover the whole of Greater Sydney.  The initial plan was a two-week lockdown, with it potentially being lifted on 9 July 2021.

By 9 July 2021, there were early signs that COVID-19 transmission was worsening.  On 15 July 2021, the lockdown was again extended until the end of July, and then extended again, this time for four weeks on 28 July 2021.  Conditions at the end of the month for people living with South Western, and Western Sydney included a limit to travelling only 5 km from home, masks at all times outside, with only essential workers allowed to leave the local government area, with the requirement of a COVID-19 swab every 72 hours.

 

Projection of new daily cases, and cumulative counts of COVID-19 with data up to 2 August 2021

 

What is this?

The top image is a chart of the cumulative (total) COVID-19 cases in NSW, starting from 17 June 2021, and the lower image is a chart of the daily new cases.  Only local cases are included (i.e., excluding cases identified in quarantine).  Projections are given for the next 7 days.  It should be noted that estimates have high levels of uncertainty beyond a few days and must be interpreted cautiously.

The projections are made using a model by fitting the cumulative case data since 17 June 2021 to a Gompertz curve using non-linear regression. The dark central dashed lines are the model estimates, with 95% confidence intervals of the estimate. On the lower chart, the blue gradations can be understood as the degree of uncertainty in the model projections.

Note: previously I used 95% prediction intervals, which can be interpreted as the model’s prediction of where 95% of the samples will fall. The 95% confidence interval does not imply this.  Rather, it can be interpreted as the degree of imprecision/uncertainty of the model estimate – the range of values that are 95% compatible with the estimate itself.  The reason I made a change is that there is a problem with the way I’m calculating the interval which likely makes it inaccurate, with a bias towards it assuming greater error than suggested by the model.

 

Gompertz and Richards’ growth curve

The Gompertz function is a type of sigmoid, or “S”-shaped curve. It’s been around since the early 19th century and was initially used to describe and model demographic mortality curves, and hence, well known to actuaries. The Gompertz function can also be used to accurately model biological growth (e.g., epidemics, tumour size, enzymatic reactions). I have chosen to use this model to help with creating insights as earlier in the pandemic, it was found to be useful in modelling cumulative cases of COVID-19 from the Chinese outbreaks (Jia et al. arXiv:2003.05447v2 [q-bio.PE]).

I had previously used the Richards’ growth curve (or the generalised logistic function), which is a broad family of sigmoid (S-shaped) curves that can describe well many types of growth, including epidemics. It has been demonstrated to have utility in modelling COVID-19 outbreaks in 2020 (Lee et al. PLoS One 2020 doi: 10.1371/journal.pone.0236860).

Why the change?

It was clear that both models were not fitting the data well, but the Richards’ curve model was performing especially poorly, with estimates that were obviously implausible.  I’ve undertaken some assessment of the degree of predictive error in both the Gompertz and Richards’ growth curve models.  These charts compare the 7- and 14-day total case projections of the models, to what actually occurred in reality 7 and 14 days later.  For interpretation, above the 0% error line means that the model provided an over-estimate compared to reality, and below the 0% error line an underestimate.

Both models were in early July providing substantial underestimates, which is what I suspected when I started this series. The Richards’ growth curve model has struggled with the fit to the data, with poor predictive accuracy.  It is entirely useless at 14 days.  The Gompertz model can be seen as doing quite well, especially its 7-day projections.  The Richards’ growth model described the Melbourne 2020 outbreak well, especially when I applied it after the daily number of new cases had reached its peak.   Retrospectively using the model early in the outbreak found a similar pattern of poor performance.

 

Daily case trends

Comparison between the Gompertz and Richards’ growth curve model projections, along with smoothed data trends (7-day simple moving average, and GAM) with data up to 2 August 2021

The generalised additive model gives a descriptive “reality check” to the models.  The GAM can be considered as an advanced smoothed trend of the daily counts.  My interpretation is that the initial lockdown did reduce the grow rate in cases, but did not reverse the trend.  In early July 2021, we see a sudden increased rate of growth in cases, which represents the outbreak and community transmission in South Western Sydney.

 

Want to know more?

Primary data source is from NSW Health for daily new cases.  The analysis is performed using RStudio Cloud using R version 4.1.0.

Today’s charts

Data: au_covid
R code: models