Improving statistical power in severe malaria genetic association studies by augmenting phenotypic precision

  1. James A Watson  Is a corresponding author
  2. Carolyne M Ndila
  3. Sophie Uyoga
  4. Alexander Macharia
  5. Gideon Nyutu
  6. Shebe Mohammed
  7. Caroline Ngetsa
  8. Neema Mturi
  9. Norbert Peshu
  10. Benjamin Tsofa
  11. Kirk Rockett
  12. Stije Leopold
  13. Hugh Kingston
  14. Elizabeth C George
  15. Kathryn Maitland
  16. Nicholas PJ Day
  17. Arjen M Dondorp
  18. Philip Bejon
  19. Thomas Williams
  20. Chris C Holmes
  21. Nicholas J White
  1. Mahidol Oxford Tropical Medicine Research Unit, Thailand
  2. KEMRI-Wellcome Trust Research Programme, Kenya
  3. Kenya Medical Research Institute-Wellcome Trust Research Programme, Kenya
  4. Wellcome Trust Centre for Human Genetics, United Kingdom
  5. Medical Research Council Clinical Trials Unit, United Kingdom
  6. Mahidol University, Thailand
  7. Kilifi KEMRI-Wellcome Trust Collaborative Research Programme, Kenya
  8. Department of Statistics, University of Oxford, United Kingdom

Abstract

Severe falciparum malaria has substantially affected human evolution. Genetic association studies of patients with clinically defined severe malaria and matched population controls have helped characterise human genetic susceptibility to severe malaria, but phenotypic imprecision compromises discovered associations. In areas of high malaria transmission the diagnosis of severe malaria in young children and, in particular, the distinction from bacterial sepsis, is imprecise. We developed a probabilistic diagnostic model of severe malaria using platelet and white count data. Under this model we re-analysed clinical and genetic data from 2,220 Kenyan children with clinically defined severe malaria and 3,940 population controls, adjusting for phenotype mis-labelling. Our model, validated by the distribution of sickle trait, estimated that approximately one third of cases did not have severe malaria. We propose a data-tilting approach for case-control studies with phenotype mis-labelling and show that this reduces false discovery rates and improves statistical power in genome-wide association studies.

Data availability

A curated minimal clinical dataset is currently available alongisde the code on the github repository. This will also be made available at publication via the KEMRI-Wellcome Harvard Dataverse(https://dataverse.harvard.edu/dataverse/kwtrp).Whole genome data are available from European Genome-Phenome Archive (dataset accession ID: EGAD00010001742).Requests for access to appropriately anonymized clinical data and directly typed genetic variants for the Kenyan severe malaria cohort can be made by application to the data access committee at the KEMRI-Wellcome Trust Research Programme by e-mail to mmunene@kemri-wellcome.org.The FEAST trial datasets are available from the principal investigator on reasonable request (k.maitland@imperial.ac.uk).Requests for access to appropriately anonymized clinical data from the AQ and AAV Vietnam study and the Asian paediatric cohort can be made via the Mahidol Oxford Tropical Medicine Research Unit data access committee by emailing the corresponding author JAW (jwatowatson@gmail.com) or Rita Chanviriyavuth (rita@tropmedres.ac).

The following previously published data sets were used

Article and author information

Author details

  1. James A Watson

    Malaria, Mahidol Oxford Tropical Medicine Research Unit, Bangkok, Thailand
    For correspondence
    jwatowatson@gmail.com
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-5524-0325
  2. Carolyne M Ndila

    KEMRI-Wellcome Trust Research Programme, Kilifi, Kenya
    Competing interests
    The authors declare that no competing interests exist.
  3. Sophie Uyoga

    Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya
    Competing interests
    The authors declare that no competing interests exist.
  4. Alexander Macharia

    Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya
    Competing interests
    The authors declare that no competing interests exist.
  5. Gideon Nyutu

    KEMRI-Wellcome Trust Research Programme, Kilifi, Kenya
    Competing interests
    The authors declare that no competing interests exist.
  6. Shebe Mohammed

    KEMRI-Wellcome Trust Research Programme, Kilifi, Kenya
    Competing interests
    The authors declare that no competing interests exist.
  7. Caroline Ngetsa

    KEMRI-Wellcome Trust Research Programme, Kilifi, Kenya
    Competing interests
    The authors declare that no competing interests exist.
  8. Neema Mturi

    Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya
    Competing interests
    The authors declare that no competing interests exist.
  9. Norbert Peshu

    Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya
    Competing interests
    The authors declare that no competing interests exist.
  10. Benjamin Tsofa

    KEMRI-Wellcome Trust Research Programme, Kilifi, Kenya
    Competing interests
    The authors declare that no competing interests exist.
  11. Kirk Rockett

    Wellcome Trust Centre for Human Genetics, Oxford, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
  12. Stije Leopold

    Malaria, Mahidol Oxford Tropical Medicine Research Unit, Bangkok, Thailand
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-0482-5689
  13. Hugh Kingston

    Malaria, Mahidol Oxford Tropical Medicine Research Unit, Bangkok, Thailand
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-1869-8307
  14. Elizabeth C George

    Medical Research Council Clinical Trials Unit, London, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
  15. Kathryn Maitland

    Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-0007-0645
  16. Nicholas PJ Day

    Mahidol Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-2309-1171
  17. Arjen M Dondorp

    Mahidol Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-5190-2395
  18. Philip Bejon

    Pathogen Vector Host Biology, Kilifi KEMRI-Wellcome Trust Collaborative Research Programme, Kilifi, Kenya
    Competing interests
    The authors declare that no competing interests exist.
  19. Thomas Williams

    Kenya Medical Research Institute-Wellcome Trust Research Programme, Kilifi, Kenya
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-4456-2382
  20. Chris C Holmes

    Department of Statistics, University of Oxford, Oxford, United Kingdom
    Competing interests
    The authors declare that no competing interests exist.
  21. Nicholas J White

    Malaria, Mahidol Oxford Tropical Medicine Research Unit, Bangkok, Thailand
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-1897-1978

Funding

Wellcome Trust (209265/Z/17/Z)

  • Kathryn Maitland
  • Nicholas PJ Day

Wellcome Trust (202800/Z/16/Z)

  • Thomas Williams

Wellcome Trust (093956/Z/10/C)

  • Nicholas J White

Medical Research Council (MC\UU\12023/26)

  • Elizabeth C George

Wellcome Trust (WT077383/Z/05/Z)

  • Kirk Rockett

Medical Research Council (G0801439)

  • Elizabeth C George
  • Kathryn Maitland

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Ethics

Human subjects: All clinical data are from published studies in which all participants or guardians gave fully informed consent. Access to the human genetic data was approved by the MalariaGen data access committee.

Copyright

© 2021, Watson et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 1,049
    views
  • 128
    downloads
  • 24
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. James A Watson
  2. Carolyne M Ndila
  3. Sophie Uyoga
  4. Alexander Macharia
  5. Gideon Nyutu
  6. Shebe Mohammed
  7. Caroline Ngetsa
  8. Neema Mturi
  9. Norbert Peshu
  10. Benjamin Tsofa
  11. Kirk Rockett
  12. Stije Leopold
  13. Hugh Kingston
  14. Elizabeth C George
  15. Kathryn Maitland
  16. Nicholas PJ Day
  17. Arjen M Dondorp
  18. Philip Bejon
  19. Thomas Williams
  20. Chris C Holmes
  21. Nicholas J White
(2021)
Improving statistical power in severe malaria genetic association studies by augmenting phenotypic precision
eLife 10:e69698.
https://doi.org/10.7554/eLife.69698

Share this article

https://doi.org/10.7554/eLife.69698

Further reading

    1. Epidemiology and Global Health
    Riccardo Spott, Mathias W Pletz ... Christian Brandt
    Research Article

    Given the rapid cross-country spread of SARS-CoV-2 and the resulting difficulty in tracking lineage spread, we investigated the potential of combining mobile service data and fine-granular metadata (such as postal codes and genomic data) to advance integrated genomic surveillance of the pandemic in the federal state of Thuringia, Germany. We sequenced over 6500 SARS-CoV-2 Alpha genomes (B.1.1.7) across 7 months within Thuringia while collecting patients’ isolation dates and postal codes. Our dataset is complemented by over 66,000 publicly available German Alpha genomes and mobile service data for Thuringia. We identified the existence and spread of nine persistent mutation variants within the Alpha lineage, seven of which formed separate phylogenetic clusters with different spreading patterns in Thuringia. The remaining two are subclusters. Mobile service data can indicate these clusters’ spread and highlight a potential sampling bias, especially of low-prevalence variants. Thereby, mobile service data can be used either retrospectively to assess surveillance coverage and efficiency from already collected data or to actively guide part of a surveillance sampling process to districts where these variants are expected to emerge. The latter concept was successfully implemented as a proof-of-concept for a mobility-guided sampling strategy in response to the surveillance of Omicron sublineage BQ.1.1. The combination of mobile service data and SARS-CoV-2 surveillance by genome sequencing is a valuable tool for more targeted and responsive surveillance.

    1. Epidemiology and Global Health
    Marina Padilha, Victor Nahuel Keller ... Gilberto Kac
    Research Article

    Background: The role of circulating metabolites on child development is understudied. We investigated associations between children's serum metabolome and early childhood development (ECD).

    Methods: Untargeted metabolomics was performed on serum samples of 5,004 children aged 6-59 months, a subset of participants from the Brazilian National Survey on Child Nutrition (ENANI-2019). ECD was assessed using the Survey of Well-being of Young Children's milestones questionnaire. The graded response model was used to estimate developmental age. Developmental quotient (DQ) was calculated as the developmental age divided by chronological age. Partial least square regression selected metabolites with a variable importance projection ≥ 1. The interaction between significant metabolites and the child's age was tested.

    Results: Twenty-eight top-ranked metabolites were included in linear regression models adjusted for the child's nutritional status, diet quality, and infant age. Cresol sulfate (β = -0.07; adjusted-p < 0.001), hippuric acid (β = -0.06; adjusted-p < 0.001), phenylacetylglutamine (β = -0.06; adjusted-p < 0.001), and trimethylamine-N-oxide (β = -0.05; adjusted-p = 0.002) showed inverse associations with DQ. We observed opposite directions in the association of DQ for creatinine (for children aged -1 SD: β = -0.05; p =0.01; +1 SD: β = 0.05; p =0.02) and methylhistidine (-1 SD: β = - 0.04; p =0.04; +1 SD: β = 0.04; p =0.03).

    Conclusion: Serum biomarkers, including dietary and microbial-derived metabolites involved in the gut-brain axis, may potentially be used to track children at risk for developmental delays.

    Funding: Supported by the Brazilian Ministry of Health and the Brazilian National Research Council.