Structural variation in malaria parasites

Overview

Copy number variation in GTP
cyclyohydrolase I: the amplicons containing
this gene vary in size, and are found in 2-11
copies in malaria populations in
Thai-Myanmar border

Large scale deletion, copy number variation, insertions, rearrangements and are common in pathogen genomes, show high mutation rates, and may underlie rapid adaptation to drugs and other selection. However, structural variants are more difficult to score than SNPs so have received less attention. We have investigated the role of copy number variation in drug resistance evolution and compensation, as well as population genomics of copy number variation in Plasmodium populations worldwide. Now tools, such as nanopore long read sequencing (see Projects in Development) now provide new opportunities for work on structural variants in schistosomes

Representative Papers

  • Nair, Shalini, Xue Li, Standwell C Nkhoma, and Tim Anderson. 2022. “Fitness Costs of Pfhrp2 and Pfhrp3 Deletions Underlying Diagnostic Evasion in Malaria Parasites”. The Journal of Infectious Diseases 226 (9): 1637-45. https://doi.org/10.1093/infdis/jiac240.

    BACKGROUND: Rapid diagnostic tests based on detection of histidine-rich proteins (HRPs) are widely used for malaria diagnosis, but parasites carrying pfhrp deletions can evade detection and are increasing in frequency in some countries. Models aim to predict conditions under which pfhrp2 and/or pfhrp3 deletions will increase, but a key parameter-the fitness cost of deletions-is unknown.

    METHODS: We removed pfhrp2 and/or pfhrp3 from a Malawian parasite clone using gene editing approaches) and measured fitness costs by conducting pairwise competition experiments.

    RESULTS: We observed significant fitness costs of 0.087 ± 0.008 (1 standard error) per asexual cycle for pfhrp2 deletion and 0.113 ± 0.008 for the pfhrp2/3 double deletion, relative to the unedited progenitor parasite. Selection against deletions is strong and comparable to that resulting from drug resistance mutations.

    CONCLUSIONS: Prior modeling suggested that diagnostic selection may drive increased frequency of pfhrp deletions only when fitness costs are mild. Our experiments show that costs of pfhrp deletions are higher than these thresholds, but modeling and empirical results can be reconciled if the duration of infection is short. These results may inform future modeling to understand why pfhrp2/3 deletions are increasing in some locations (Ethiopia and Eritrea) but not in others (Mekong region).

  • Cheeseman, Ian H, Becky Miller, John C Tan, Asako Tan, Shalini Nair, Standwell C Nkhoma, Marcos De Donato, et al. (2016) 2016. “Population Structure Shapes Copy Number Variation in Malaria Parasites.”. Molecular Biology and Evolution 33 (3): 603-20. https://doi.org/10.1093/molbev/msv282.

    If copy number variants (CNVs) are predominantly deleterious, we would expect them to be more efficiently purged from populations with a large effective population size (Ne) than from populations with a small Ne. Malaria parasites (Plasmodium falciparum) provide an excellent organism to examine this prediction, because this protozoan shows a broad spectrum of population structures within a single species, with large, stable, outbred populations in Africa, small unstable inbred populations in South America and with intermediate population characteristics in South East Asia. We characterized 122 single-clone parasites, without prior laboratory culture, from malaria-infected patients in seven countries in Africa, South East Asia and South America using a high-density single-nucleotide polymorphism/CNV microarray. We scored 134 high-confidence CNVs across the parasite exome, including 33 deletions and 102 amplifications, which ranged in size from <500 bp to 59 kb, as well as 10,107 flanking, biallelic single-nucleotide polymorphisms. Overall, CNVs were rare, small, and skewed toward low frequency variants, consistent with the deleterious model. Relative to African and South East Asian populations, CNVs were significantly more common in South America, showed significantly less skew in allele frequencies, and were significantly larger. On this background of low frequency CNV, we also identified several high-frequency CNVs under putative positive selection using an FST outlier analysis. These included known adaptive CNVs containing rh2b and pfmdr1, and several other CNVs (e.g., DNA helicase and three conserved proteins) that require further investigation. Our data are consistent with a significant impact of genetic structure on CNV burden in an important human pathogen.