TY - JOUR
T1 - Pf8
T2 - an open dataset of Plasmodium falciparum genome variation in 33,325 worldwide samples
AU - Malaria Genomic Epidemiology Network (MalariaGEN)
AU - Abdel Hamid, Muzamil Mahdi
AU - Abdelraheem, Mohamed Hassan
AU - Acheampong, Desmond Omane
AU - Adam, Ishag
AU - Aide, Pedro
AU - Ajibaye, Olusola
AU - Ali, Mozam
AU - Almagro-Garcia, Jacob
AU - Amambua-Ngwa, Alfred
AU - Amenga-Etego, Lucas
AU - Aniebo, Ifeyinwa
AU - Aninagyei, Enoch
AU - Ansah, Felix
AU - Apinjoh, Tobias O.
AU - Ariani, Cristina V.
AU - Auburn, Sarah
AU - Awandare, Gordon A.
AU - Balmer, Andrew
AU - Bejon, Philip
AU - Boene, Simone
AU - Bwire, George
AU - Candrinho, Baltazar
AU - Chidimatembue, Arlindo
AU - Chindavongsa, Keobouphaphone
AU - Comiche, Kiba
AU - Conway, David
AU - Dara, Antoine
AU - Diakite, Mahamadou
AU - Djimde, Abdoulaye
AU - Dondorp, Arjen
AU - Doumbia, Seydou
AU - Drury, Eleanor
AU - Fanello, Caterina A.
AU - Ferdig, Mike
AU - Figueroa, Katherine
AU - Gamboa, Dionicia
AU - Golassa, Lemu
AU - Gonçalves, Sónia
AU - Guindo, Merepen dite Agnes
AU - Hamaluba, Mainga
AU - Hanboonkunupakarn, Borimas
AU - Howe, Kevin
AU - Hussien, Maazza
AU - Imwong, Mallika
AU - Ishengoma, Deus
AU - Jeans, Julia
AU - Kabaghe, Alinune
AU - Kamuhabwa, Appolinary
AU - Kindermans, Jean Marie
AU - Konate, Drissa S.
N1 - Publisher Copyright:
Copyright: © 2025 Malaria Genomic Epidemiology Network (MalariaGEN) et al.
PY - 2025
Y1 - 2025
N2 - We describe the Pf8 data resource, the latest MalariaGEN release of curated genome variation data on over 33,000 Plasmodium falciparum samples from 99 partner studies and 122 locations over more than 50 years. This release provides open access to raw sequencing data and genotypes at over 12 million genomic positions. For the first time, it includes copy-number variation (CNV) calls in the drug-resistance associated genes gch1 and crt. As in Pf7, CNV calls are provided for mdr1 and plasmepsin2/3, along with calls for deletion in hrp2 and hrp3, genes associated with rapid diagnostic test failures. This data resource additionally features derived datasets, interactive web applications for exploring patterns of drug resistance and variation in over 5,000 genes, an updated Python package providing methods for accessing and analysing the data, and open access analysis notebooks that can be used as starting points for further analyses. In addition, informative example analyses show contrasting profiles of the decline of chloroquine resistance-associated mutations in Africa, and variation in copy number variation across 10 distinct sub-populations. To the best of our knowledge, Pf8 is the largest open data set of genome variation in any eukaryotic species, making it an invaluable foundational resource for understanding evolution, including that of pathogens.
AB - We describe the Pf8 data resource, the latest MalariaGEN release of curated genome variation data on over 33,000 Plasmodium falciparum samples from 99 partner studies and 122 locations over more than 50 years. This release provides open access to raw sequencing data and genotypes at over 12 million genomic positions. For the first time, it includes copy-number variation (CNV) calls in the drug-resistance associated genes gch1 and crt. As in Pf7, CNV calls are provided for mdr1 and plasmepsin2/3, along with calls for deletion in hrp2 and hrp3, genes associated with rapid diagnostic test failures. This data resource additionally features derived datasets, interactive web applications for exploring patterns of drug resistance and variation in over 5,000 genes, an updated Python package providing methods for accessing and analysing the data, and open access analysis notebooks that can be used as starting points for further analyses. In addition, informative example analyses show contrasting profiles of the decline of chloroquine resistance-associated mutations in Africa, and variation in copy number variation across 10 distinct sub-populations. To the best of our knowledge, Pf8 is the largest open data set of genome variation in any eukaryotic species, making it an invaluable foundational resource for understanding evolution, including that of pathogens.
KW - Plasmodium falciparum
KW - data resource
KW - genomic epidemiology
KW - genomic surveillance
KW - genomics
KW - malaria
KW - open data sharing
UR - https://www.scopus.com/pages/publications/105024496716
U2 - 10.12688/wellcomeopenres.24031.1
DO - 10.12688/wellcomeopenres.24031.1
M3 - Article
AN - SCOPUS:105024496716
SN - 2398-502X
VL - 10
JO - Wellcome Open Research
JF - Wellcome Open Research
M1 - 325
ER -