Within-host diversity improves phylogenetic and transmission reconstruction of SARS-CoV-2 outbreaks

  1. Arturo Torres Ortiz  Is a corresponding author
  2. Michelle Kendall
  3. Nathaniel Storey
  4. James Hatcher
  5. Helen Dunn
  6. Sunando Roy
  7. Rachel Williams
  8. Charlotte Williams
  9. Richard A Goldstein
  10. Xavier Didelot
  11. Kathryn Harris
  12. Judith Breuer
  13. Louis Grandjean  Is a corresponding author
  1. Imperial College London, United Kingdom
  2. University of Warwick, United Kingdom
  3. Great Ormond Street Hospital, United Kingdom
  4. University College London, United Kingdom

Abstract

Accurate inference of who infected whom in an infectious disease outbreak is critical for the delivery of effective infection prevention and control. The increased resolution of pathogen whole-genome sequencing has significantly improved our ability to infer transmission events. Despite this, transmission inference often remains limited by the lack of genomic variation between the source case and infected contacts. Although within-host genetic diversity is common among a wide variety of pathogens, conventional whole-genome sequencing phylogenetic approaches exclusively use consensus sequences, which consider only the most prevalent nucleotide at each position and therefore fail to capture low frequency variation within samples. We hypothesized that including within-sample variation in a phylogenetic model would help to identify who infected whom in instances in which this was previously impossible. Using whole-genome sequences from SARS-CoV-2 multi-institutional outbreaks as an example, we show how within-sample diversity is partially maintained among repeated serial samples from the same host, it can transmitted between those cases with known epidemiological links, and how this improves phylogenetic inference and our understanding of who infected whom. Our technique is applicable to other infectious diseases and has immediate clinical utility in infection prevention and control.

Data availability

Samples sequenced as part of this study have been submitted to the European Nucleotide Archive under accession PRJEB53224.Sample metadata is included in Supplementary file 1.All custom code used in this article can be accessed at https://github.com/arturotorreso/scov2_withinHost.git.

The following data sets were generated

Article and author information

Author details

  1. Arturo Torres Ortiz

    Department of Infectious Diseases, Imperial College London, London, United Kingdom
    For correspondence
    a.ortiz@ucl.ac.uk
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-2492-0958
  2. Michelle Kendall

    Department of Statistics, University of Warwick, Coventry, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-7344-7071
  3. Nathaniel Storey

    Department of Microbiology, Great Ormond Street Hospital, London, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
  4. James Hatcher

    Department of Microbiology, Great Ormond Street Hospital, London, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
  5. Helen Dunn

    Department of Microbiology, Great Ormond Street Hospital, London, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
  6. Sunando Roy

    Department of Infection, Immunity and Inflammation, University College London, London, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
  7. Rachel Williams

    UCL Genomics, University College London, London, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
  8. Charlotte Williams

    UCL Genomics, University College London, London, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
  9. Richard A Goldstein

    UCL Genomics, University College London, London, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-5148-4672
  10. Xavier Didelot

    Department of Statistics, University of Warwick, Coventry, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-1885-500X
  11. Kathryn Harris

    Department of Microbiology, Great Ormond Street Hospital, London, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
  12. Judith Breuer

    Department of Infection, Immunity and Inflammation, University College London, London, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-8246-0534
  13. Louis Grandjean

    Department of Infection, Immunity and Inflammation, University College London, London, United Kingdom
    For correspondence
    l.grandjean@ucl.ac.uk
    Competing interests
    The authors declare that no competing interests exist.

Funding

Wellcome Trust (201470/Z/16/Z)

  • Louis Grandjean

National Institute of Allergy and Infectious Diseases (1R01AI146338)

  • Louis Grandjean

NIHR

  • Xavier Didelot

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Copyright

© 2023, Torres Ortiz et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 942
    views
  • 121
    downloads
  • 6
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Arturo Torres Ortiz
  2. Michelle Kendall
  3. Nathaniel Storey
  4. James Hatcher
  5. Helen Dunn
  6. Sunando Roy
  7. Rachel Williams
  8. Charlotte Williams
  9. Richard A Goldstein
  10. Xavier Didelot
  11. Kathryn Harris
  12. Judith Breuer
  13. Louis Grandjean
(2023)
Within-host diversity improves phylogenetic and transmission reconstruction of SARS-CoV-2 outbreaks
eLife 12:e84384.
https://doi.org/10.7554/eLife.84384

Share this article

https://doi.org/10.7554/eLife.84384

Further reading

    1. Epidemiology and Global Health
    Xiaoning Wang, Jinxiang Zhao ... Dong Liu
    Research Article

    Artificially sweetened beverages containing noncaloric monosaccharides were suggested as healthier alternatives to sugar-sweetened beverages. Nevertheless, the potential detrimental effects of these noncaloric monosaccharides on blood vessel function remain inadequately understood. We have established a zebrafish model that exhibits significant excessive angiogenesis induced by high glucose, resembling the hyperangiogenic characteristics observed in proliferative diabetic retinopathy (PDR). Utilizing this model, we observed that glucose and noncaloric monosaccharides could induce excessive formation of blood vessels, especially intersegmental vessels (ISVs). The excessively branched vessels were observed to be formed by ectopic activation of quiescent endothelial cells (ECs) into tip cells. Single-cell transcriptomic sequencing analysis of the ECs in the embryos exposed to high glucose revealed an augmented ratio of capillary ECs, proliferating ECs, and a series of upregulated proangiogenic genes. Further analysis and experiments validated that reduced foxo1a mediated the excessive angiogenesis induced by monosaccharides via upregulating the expression of marcksl1a. This study has provided new evidence showing the negative effects of noncaloric monosaccharides on the vascular system and the underlying mechanisms.

    1. Epidemiology and Global Health
    2. Microbiology and Infectious Disease
    Amanda C Perofsky, John Huddleston ... Cécile Viboud
    Research Article

    Influenza viruses continually evolve new antigenic variants, through mutations in epitopes of their major surface proteins, hemagglutinin (HA) and neuraminidase (NA). Antigenic drift potentiates the reinfection of previously infected individuals, but the contribution of this process to variability in annual epidemics is not well understood. Here, we link influenza A(H3N2) virus evolution to regional epidemic dynamics in the United States during 1997—2019. We integrate phenotypic measures of HA antigenic drift and sequence-based measures of HA and NA fitness to infer antigenic and genetic distances between viruses circulating in successive seasons. We estimate the magnitude, severity, timing, transmission rate, age-specific patterns, and subtype dominance of each regional outbreak and find that genetic distance based on broad sets of epitope sites is the strongest evolutionary predictor of A(H3N2) virus epidemiology. Increased HA and NA epitope distance between seasons correlates with larger, more intense epidemics, higher transmission, greater A(H3N2) subtype dominance, and a greater proportion of cases in adults relative to children, consistent with increased population susceptibility. Based on random forest models, A(H1N1) incidence impacts A(H3N2) epidemics to a greater extent than viral evolution, suggesting that subtype interference is a major driver of influenza A virus infection ynamics, presumably via heterosubtypic cross-immunity.