Inferring Strain Mixture within Clinical Plasmodium falciparum Isolates from Genomic Sequence Data

John D. O’Brien, Zamin Iqbal, Jason Wendler, Lucas Amenga-Etego

Research output: Contribution to journalArticlepeer-review

14 Citations (Scopus)

Abstract

We present a rigorous statistical model that infers the structure of P. falciparum mixtures—including the number of strains present, their proportion within the samples, and the amount of unexplained mixture—using whole genome sequence (WGS) data. Applied to simulation data, artificial laboratory mixtures, and field samples, the model provides reasonable inference with as few as 10 reads or 50 SNPs and works efficiently even with much larger data sets. Source code and example data for the model are provided in an open source fashion. We discuss the possible uses of this model as a window into within-host selection for clinical and epidemiological studies.

Original languageEnglish
Article numbere1004824
JournalPLoS Computational Biology
Volume12
Issue number6
DOIs
Publication statusPublished - Jun 2016
Externally publishedYes

Fingerprint

Dive into the research topics of 'Inferring Strain Mixture within Clinical Plasmodium falciparum Isolates from Genomic Sequence Data'. Together they form a unique fingerprint.

Cite this