Making viral phylogenetic inferences actionable

Trevor Bedford (@trvrb)
11 Jul 2017
VIDD Seminar
Fred Hutch

This talk

We work at the interface of virology, evolution and epidemiology

Methods focus on sequencing to reconstruct pathogen spread

Epidemic process

Sample some individuals

Sequence and determine phylogeny

Sequence and determine phylogeny

Localized Middle Eastern MERS-CoV phylogeny

Regional West African Ebola phylogeny

Global influenza phylogeny

Phylogenetic tracking has the capacity to revolutionize epidemiology


  • Influenza circulation and antigenic drift
  • Ebola spread in West Africa
  • Zika spread in the Americas
  • "Real-time" analyses
  • Future directions


Influenza virion

Population turnover (in H3N2) is extremely rapid

Antigenic drift necessitates frequent H3N2 vaccine updates

Integrating influenza antigenic dynamics with molecular evolution

with Andrew Rambaut, Marc Suchard, Philippe Lemey and others

Global circulation patterns of seasonal influenza viruses vary with rates of antigenic drift

with Colin Russell, Philippe Lemey, Steven Riley and many others

Scientific publishing practices vs
a fast evolving virus

Vaccine strain selection timeline


Project to provide a real-time view of the evolving influenza population


Project to provide a real-time view of the evolving influenza population

All in collaboration with Richard Neher

nextflu pipeline

  1. Download all recent HA sequences from GISAID
  2. Filter to remove outliers
  3. Subsample across time and space
  4. Align sequences
  5. Build tree
  6. Estimate clade frequencies
  7. Infer antigenic phenotypes
  8. Export for visualization

Extended previous antigenic model to directly infer titer drops on a phylogeny

Working directly with CDC to provide analytics and the WHO to provide technical reports


Virus genomes reveal factors that spread and sustained the Ebola epidemic

with Gytis Dudas, Andrew Rambaut, Luiz Carvalho, Marc Suchard, Philippe Lemey,
and many others

Sequencing of 1610 Ebola virus genomes collected during the 2013-2016 West African epidemic

Phylogenetic reconstruction of evolution and spread

Tracking migration events

Factors influencing migration rates

Effect of borders on migration rates

Spatial structure at the country level

Substantial mixing at the regional level

Regional outbreaks due to multiple introductions

Ebola spread in West Africa followed a gravity model with moderate slowing by international borders, in which spread is driven by short-lived migratory clusters


Zika's arrival and spread in the Americas

Establishment and cryptic transmission of Zika virus in Brazil and the Americas

with Nuno Faria, Nick Loman, Oli Pybus, Luiz Alcantara, Ester Sabino, Josh Quick,
Alli Black, Ingra Morales, Julien Thézé, Marcio Nunes, Jacqueline de Jesus,
Marta Giovanetti, Moritz Kraemer, Sarah Hill and many others

Road trip through northeast Brazil to collect samples and sequence

Case reports and diagnostics suggest initiation in northeast Brazil

Phylogeny infers an origin in northeast Brazil

Genomic epidemiology reveals multiple introductions of Zika virus into the United States

with Nathan Grubaugh, Kristian Andersen, Jason Ladner, Gustavo Palacios, Sharon Isern, Oli Pybus, Moritz Kraemer, Gytis Dudas, Amanda Tan, Karthik Gangavarapu, Michael Wiley, Stephen White, Julien Thézé, Scott Michael, Leah Gillis, Pardis Sabeti, and many others

Outbreak of locally-acquired infections focused in Miami-Dade county

Phylogeny shows introductions from the Caribbean and a surprising degree of clustering

Flow of infected travelers greatest from Caribbean

Clustering suggests fewer, longer transmission chains and higher R0

Genomic epidemiology of Zika in the US Virgin Islands

with Alli Black, Barney Potter, Gytis Dudas, Esther Ellis, Brett Ellis,
Kristian Andersen, Nathan Grubaugh, Leora Feldstein, and others
(and special thanks to Adam Geballe)

Preliminary analysis of 31 genomes shows multiple introductions to USVI

Important analyses, let's make them more rapid and more automated

Key challenges

  • Timely analysis and sharing of results critical
  • Dissemination must be scalable
  • Integrate many data sources
  • Results must be easily interpretable and queryable


Project to conduct real-time molecular epidemiology and evolutionary analysis of emerging epidemics

with Richard Neher, James Hadfield, Colin Megill,
Sidney Bell, Charlton Callender, Barney Potter,
and John Huddleston

Nextstrain architecture

All code open source at

Rapid on-the-ground sequencing by Ian Goodfellow, Matt Cotten and colleagues

Moving forward

Lab network

Dengue antigenic dynamics

with Sidney Bell

Phylogenetic analysis of MERS-CoV spillover patterns

with Gytis Dudas, Luiz Max Carvalho and Andrew Rambaut

Genomic epidemiology of mumps virus
in Washington State

with Louise Moncla, Alli Black, Chas DeBolt, Ailyn Perez-Osorio, Scott Lindquist,
Alex Greninger and others

Research directions

  • Real-time forecasting for influenza
  • Improve outbreak tracking, extend to new systems
  • Nextstrain for all the pathogens
  • Mobile viral sequencing
  • Continue building relationships with public health entities


Bedford Lab: Alli Black, Sidney Bell, Gytis Dudas, Stephanie Stacy, John Huddleston,
Barney Potter, James Hadfield, Louise Moncla, Maya Lewinsohn

Influenza: WHO Global Influenza Surveillance Network, GISAID, Richard Neher, Colin Russell, Andrew Rambaut, Marc Suchard, Philippe Lemey, Steven Riley

Ebola: Gytis Dudas, Andrew Rambaut, Luiz Carvalho, Philippe Lemey, Marc Suchard, Andrew Tatem

Zika: Nick Loman, Nuno Faria, Oli Pybus, Josh Quick, Kristian Andersen, Nathan Grubaugh, Jason Ladner, Gustavo Palacios, Sharon Isern, Gytis Dudas, Alli Black, Barney Potter, Esther Ellis

Nextstrain: Richard Neher, James Hadfield, Colin Megill, Sidney Bell, Charlton Callender, Barney Potter, John Huddleston