SARS-CoV-2 genome sequence prevalence and growth rate update: 24 May 2023
Updated 8 August 2024
Applies to England
Variant prevalence
Testing policy and sequencing should be considered when interpreting variant data and is described in full in the variant technical briefing series. The prevalence of lineages amongst UK sequences by Phylogenetic Assignment of Named Global Outbreak Lineages (Pangolin) designation is presented in Figure 1.
Lineages are shown if there are more than or equal to 5,000 sequences since 5 December 2022 or if they represent more than or equal to 1% of sequences within a single week over the last 6 weeks. Lineages that do not meet these criteria are combined with their parent lineage (for example, BA.2.4 is combined with BA.2).
The lineages have been assigned using the accurate Ultrafast Sample placement on Existing tRee (UShER) mode and version 1.19 of the Pangolin data.
Figure 1. Prevalence of Pangolin lineages in the UK with sequence data with a specimen date from week beginning 5 December 2022 to week beginning 1 May 2023, as of 18 May 2023
The total number of valid sequence results per week is shown by the black line. The ‘Other’ category in this plot contains all lineages that do not meet the relevant criteria after combining smaller sub-lineages. ‘Unassigned’ are sequences that could not be assigned a lineage by Pangolin. Lineages present in at least 2% of sequences in the most recent week are labelled to the right of the plot.
Variant modelling
Two models are currently used to estimate the growth advantage of emerging lineages: a logistic regression generalised linear model (GLM) and a generalised additive model (GAM). Models are fit to a geographically stratified sample of Pillar 1 cases to ensure that relative growth rates are estimated in relation to a local set of co-circulating lineages. Tests associated with travel are excluded. A full description of methods can be found in the variant technical briefing series.
Growth rates were based on sequences sampled through Pillar 1 testing (primarily positive tests conducted in hospital) in England (Table 1). The sampling range for both the logistic regression GLM and GAM is from 28 November 2022 to 15 May 2023. Any group of lineages shown in the plot above were modelled provided there were at least 50 sequenced cases in the sample set.
The model fit for any lineage with a positive growth rate advantage (with 95% confidence intervals (CIs) that do not cross zero) are shown in Figure 2. The lineages with positive growth rate are XBB.1.16 (50.23% GAM), XBB.1.5.13 (30.41% GAM), XBB.1.9.2 (22.37% GAM) and XBB.1.16.1 (21.8% GAM).
Table 1. Growth rate (GR) of English sequence lineages as of 15 May 2023
Lineage | Pillar 1 sample size | Weekly growth rate advantage (GAM) | Estimated prevalence (GAM) | Weekly growth rate advantage (GLM) |
---|---|---|---|---|
XBB.1.16 | 307 | 50.23% (95% CI: 48.07 to 52.39) | 9.67% (95% CI: 7.36 to 12.61) | 49.1% (95% CI: 32.17 to 66.04) |
XBB.1.5.13 | 423 | 30.41% (95% CI: 10.48 to 50.33) | 1.99% (95% CI: 1.23 to 3.21) | 42.93% (95% CI: 41.34 to 44.52) |
XBB.1.9.2 | 690 | 22.37% (95% CI: 11.22 to 33.52) | 6.52% (95% CI: 4.7 to 8.97) | 17.16% (95% CI: 1.21 to 33.1) |
XBB.1.16.1 | 89 | 21.8% (95% CI: 1.42 to 42.19) | 1.51% (95% CI: 0.8 to 2.85) | 6.73% (95% CI: -27.58 to 41.05) |
EM.1 | 423 | 17.78% (95% CI: -5.22 to 40.78) | 0.94% (95% CI: 0.5 to 1.75) | 51.17% (95% CI: 12.38 to 89.95) |
Unassigned | 2,683 | 5.81% (95% CI: -11.71 to 23.33) | 3.88% (95% CI: 2.75 to 5.45) | 12.05% (95% CI: -8.78 to 32.87) |
XBB.1.5.20 | 500 | 4.4% (95% CI: -28.14 to 36.95) | 0.52% (95% CI: 0.24 to 1.11) | 16.41% (95% CI: -28.86 to 61.67) |
XBB.1.17.1 | 178 | -2.42% (95% CI: -42.12 to 37.28) | 1.01% (95% CI: 0.48 to 2.12) | -1.75% (95% CI: -39.5 to 35.99) |
EG.1 | 380 | -3.34% (95% CI: -34.75 to 28.07) | 2.41% (95% CI: 1.44 to 4.02) | 0.18% (95% CI: -23.39 to 23.75) |
Other | 9,071 | -4.49% (95% CI: -13.15 to 4.18) | 10.84% (95% CI: 8.9 to 13.14) | -8.54% (95% CI: -20.72 to 3.64) |
XBB.1.5.12 | 179 | -12.19% (95% CI: -45.08 to 20.7) | 0.46% (95% CI: 0.21 to 0.99) | -22.25% (95% CI: -64.41 to 19.91) |
CH.1.1.1 | 4,110 | -14.71% (95% CI: -39.43 to 10) | 2.11% (95% CI: 1.33 to 3.31) | -17.54% (95% CI: -39.83 to 4.76) |
XBB.1.9.1 | 4,645 | -16.64% (95% CI: -37.57 to 4.3) | 18.88% (95% CI: 14.91 to 23.62) | 0.91% (95% CI: -8.86 to 10.69) |
XBB.1.5 | 11,094 | -17.37% (95% CI: -34.69 to -0.05) | 21.13% (95% CI: 17.54 to 25.23) | -14.88% (95% CI: -24.03 to -5.73) |
CH.1.1 | 5,946 | -22.6% (95% CI: -44.57 to -0.62) | 1.19% (95% CI: 0.72 to 1.97) | -37.39% (95% CI: -68.19 to -6.58) |
BQ.1 | 3,057 | -26.97% (95% CI: -47.81 to -6.12) | 0.1% (95% CI: 0.04 to 0.26) | 6.62% (95% CI: -103.59 to 116.82) |
XBB.1.5.7 | 2,101 | -29.31% (95% CI: -53.78 to -4.85) | 1.9% (95% CI: 1.2 to 2.99) | -27.96% (95% CI: -50.41 to -5.5) |
BQ.1.1 | 12,399 | -42.94% (95% CI: -73.27 to -12.61) | 0.27% (95% CI: 0.12 to 0.59) | -29.33% (95% CI: -81.61 to 22.96) |
XBB.2.3 | 142 | -44.5% (95% CI: -114.33 to 25.34) | 1.09% (95% CI: 0.47 to 2.51) | 24.67% (95% CI: -9.66 to 59) |
XBB.1.5.18 | 1,329 | -48.98% (95% CI: -97.08 to -0.89) | 0.52% (95% CI: 0.24 to 1.15) | -30.52% (95% CI: -65.96 to 4.91) |
Listed parent lineages include all sub-lineages, other than those explicitly modelled. ‘Unassigned’ are sequences that could not be assigned a lineage by Pangolin.
Sample size is for Pillar 1 samples in England.
Sampling range for both logistic regression and generalised additive models (GAM) is from 28 November 2022 to 15 May 2023.
Figure 2. Modelled prevalence of lineage groups with a growth rate advantage over other circulating lineages
The black line shows the central estimate and blue shaded regions the 95% confidence intervals. Points show the national level proportions, with the size indicative of sample numbers for that particular lineage. The grey portion of the ribbon denotes that this period of time is likely to be backfilled with more sequenced cases, making proportions unreliable.
Sources and acknowledgments
Data sources
Data used in this investigation is derived from the Cloud Infrastructure for Microbial Bioinformatics (CLIMB) data set, the UK Health Security Agency (UKHSA) genomic programme data set and the UKHSA Second Generation Surveillance System.
Authors of this report
UKHSA Genomics Public Health Analysis Team
UKHSA Infectious Disease Modelling Team