Molecular evolution of SARS-CoV-2


 

Trevor Bedford (@trvrb)
Associate Professor, Fred Hutchinson Cancer Research Center
17 Feb 2021
Anticipatory Vaccine Design for SARS-CoV-2
Bill & Melinda Gates Foundation
 
Slides at: bedford.io/talks

1. Real-time tracking of SARS-CoV-2 evolution

2. Emergence of variants of concern

3. Expectations for antigenic evolution

4. Current circulation patterns

Real-time tracking of SARS-CoV-2 evolution

Over 550k SARS-CoV-2 genomes shared to GISAID and evolution tracked in real-time at nextstrain.org

SARS-CoV-2 lineages establish globally in February and March

Limited early mutations like D614G spread globally during initial wave

Summer and fall variants were confined to regional dominance

Emergence of variants of concern

484K and 501Y repeatedly emerging across the world

Emergence of 501Y.V1 (B.1.1.7) in the UK

Emergence of 501Y.V2 (B.1.351) in the South Africa

Emergence of 501Y.V3 (P.1) in the Brazil

Substantial convergent evolution

Working hypothesis of within-host evolution occurring during prolonged infection, driven by natural selection for immune escape.

Rapid within-host evolution during persistent infection

484K and 501Y observed during this evolution

Expectations for antigenic evolution

Up to December, my expectation was evolution as seen in seasonal coronaviruses

OC43 and 229E show flu B-like rates of adaptive evolution in S1

~23 mutations per year across SARS-CoV-2 genome

Substantial increase in spike S1 amino acid substitutions in VOCs

Accumulation of substitutions faster in NTD than RBD

Spike S1 rate of 2.9 subs per year similar to rate in HA1 of flu A

Influenza H3N2 drifts at ~1 two-fold HI dilution per year

SARS-CoV-2 VOCs have evolved ~10 amino acid sites in S1 in just over a year. This is rapid even for influenza A. 501Y.V2 shows an ~8-fold drop in neutralization titer, equivalent to ~3 years of H3N2 evolution.

Current circulation patterns

VOCs are growing in frequency with B.1.1.7 leading the curve

Repeated convergent evolution across sites in spike (452R)

Repeated convergent evolution across sites in spike (681H)

Repeated convergent evolution across sites in spike (69del)

Repeated convergent evolution across sites in spike (combined)

Acknowledgements

SARS-CoV-2 genomic epi: Data producers from all over the world, GISAID and the Nextstrain team

Bedford Lab: Alli Black, John Huddleston, James Hadfield, Katie Kistler, Louise Moncla, Maya Lewinsohn, Thomas Sibley, Jover Lee, Kairsten Fay, Misja Ilcisin, Cassia Wagner, Miguel Paredes, Nicola Müller, Marlin Figgins, Eli Harkins