TY - JOUR
T1 - Inferring Strain Mixture within Clinical Plasmodium falciparum Isolates from Genomic Sequence Data
AU - O’Brien, John D.
AU - Iqbal, Zamin
AU - Wendler, Jason
AU - Amenga-Etego, Lucas
N1 - Publisher Copyright:
© 2016 Public Library of Science et al. All rights reserved.
PY - 2016/6
Y1 - 2016/6
N2 - We present a rigorous statistical model that infers the structure of P. falciparum mixtures—including the number of strains present, their proportion within the samples, and the amount of unexplained mixture—using whole genome sequence (WGS) data. Applied to simulation data, artificial laboratory mixtures, and field samples, the model provides reasonable inference with as few as 10 reads or 50 SNPs and works efficiently even with much larger data sets. Source code and example data for the model are provided in an open source fashion. We discuss the possible uses of this model as a window into within-host selection for clinical and epidemiological studies.
AB - We present a rigorous statistical model that infers the structure of P. falciparum mixtures—including the number of strains present, their proportion within the samples, and the amount of unexplained mixture—using whole genome sequence (WGS) data. Applied to simulation data, artificial laboratory mixtures, and field samples, the model provides reasonable inference with as few as 10 reads or 50 SNPs and works efficiently even with much larger data sets. Source code and example data for the model are provided in an open source fashion. We discuss the possible uses of this model as a window into within-host selection for clinical and epidemiological studies.
UR - http://www.scopus.com/inward/record.url?scp=84978791332&partnerID=8YFLogxK
U2 - 10.1371/journal.pcbi.1004824
DO - 10.1371/journal.pcbi.1004824
M3 - Article
C2 - 27362949
AN - SCOPUS:84978791332
SN - 1553-734X
VL - 12
JO - PLoS Computational Biology
JF - PLoS Computational Biology
IS - 6
M1 - e1004824
ER -