Previous Article | Next Article ![]()
Antimicrobial Agents and Chemotherapy, June 2005, p. 2180-2188, Vol. 49, No. 6
0066-4804/05/$08.00+0 doi:10.1128/AAC.49.6.2180-2188.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Southwest Foundation for Biomedical Research, P.O. Box 760549, San Antonio, Texas 78245,1 Shoklo Malaria Research Unit, Mae Sot, Tak, Thailand,2 Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand,3 Nuffield Department of Medicine, John Radcliffe Hospital, Oxford, United Kingdom4
Received 16 December 2004/ Returned for modification 24 January 2005/ Accepted 14 February 2005
|
|
|---|
|
|
|---|
While these results are exciting, some caution is needed in interpreting such candidate gene studies. There are two particular problems. First, when multiple tests are carried out, some significant results may occur by chance (false positives or type I errors). This problem may be overcome by using more-stringent criteria for determining significance by procedures such as the Bonferroni correction (4). However, this may result in discounting truly significant results and inflating numbers of false negatives (type II errors) and so is not entirely satisfactory (19). A second major problem with association studies is population structure (23-25). If samples are collected from two or more populations that differ in both allele frequency and phenotype, then spurious association may occur with genetic loci that have no real relationship with the phenotype of interest. For example, suppose parasites are sampled from two populations, one with 90% and the other with 10% of parasites resistant to a drug. Suppose also that allele frequencies have diverged by genetic drift in these two populations at a "neutral" locus that has no influence on resistance status, so that frequencies of an allele A at this locus are 40% in one population and 60% in the other. Assuming equal numbers of parasites from each population are analyzed, the population sample will contain 50% resistant parasites and 50% will carry allele A. However, resistant parasites will carry allele A more often than allele B, generating a false positive association between this locus and drug resistance. For the same reasons, sampling from multiple populations results in elevated LD and can generate correlations between drug phenotypes even when the different sets of genes underlie resistance to each drug.
Study replication using samples collected in a single region over a limited time frame remains the most convincing test of whether genotype-phenotype associations are real or spurious (11). To minimize the potential problem of geographical population structure, we reexamine associations between the 11 candidate genes and resistance to antimalarial drugs in parasites collected from a single clinic on the Thailand-Burma border. Furthermore, to minimize type I errors due to multiple statistical comparisons, we used a two-phase study design, in which significant associations were reexamined in an independent population sample. We investigated genotypic associations with response to mefloquine (MFQ), artesunate (AS), dihydro-artemisinin (DHA), lumefantrine (LUM), doxycycline (DOX), and atovaquone (AQ) in addition to QN and CQ. We found no evidence for association between polymorphisms in eight of the new transporters and drug response. However, we did observe significant association between an ABC transporter (G7) on chromosome 13 (Chr 13) and artesunate response in both population samples examined.
|
|
|---|
Parasite sampling.
We collected 5 ml of P. falciparum-infected blood samples with >0.5% parasitemia from patients visiting the malaria clinic at Mawker-Thai on the Thai-Burma border. This site is
100 miles from Maela, Thailand, where recent studies of in vitro resistance have been conducted (5, 22). We collected samples from patients that had not visited the malaria clinic within the past 60 days and had no history of prior malaria treatment. The clinic serves people on both sides of the border; 90% of patients travel from within a 10-km radius around the clinic (François Nosten, personal communication). Collection protocols were approved by the Ethical Committee of the Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand, and by the Institutional Review board at the University of Texas Health Science Center at San Antonio. Parasite isolates used in phase I were collected between December 1998 and July 2001, while parasites used in phase II were collected between November 2001 and September 2003. Patients in this region have been treated with mefloquine-artesunate combination therapy since 1994 (18).
Measurement of in vitro resistance. Infected blood samples were transported to the laboratory in Mae Sot, Thailand, within 4 h of collection. We measured in vitro responses to CQ, QN, MFQ, LUM, AS, DHA, AQ, and DOX using a 48-h [3H]hypoxanthine incorporation assay (5, 7). Tests were performed using a starting parasitemia of between 0.5 and 1%, and duplicate assays were performed at each of 7 or 11 doubling drug concentrations. Dose-response curves, IC50s, and coefficients of variation were calculated by fitting the data to an inhibitory E-max pharmacokinetic model using WINNONLIN version 3.1 (Pharsight Corporation). To ensure data quality, we rejected all IC50s with coefficients of variation (standard error x 100)/mean of estimated IC50s of >20% and those in which the incorporation in control wells (parasites, no drug) was <5 times the background level (red blood cells only). A single outlier was removed in some cases. IC50s were retained if removal of an outlier reduced the coefficient of variation (CV) to less than 20% and did not alter the IC50 value by greater than 10%. For curves from highly resistant samples, the range of dilutions used in the drug tests was insufficiently wide to obtain accurate measures of IC50. In these cases, the curves were "forced" by adding an extra data point (zero, indicating no growth) at the next doubling concentration. This procedure results in conservative IC50s, while allowing us to retain data from interesting parasite isolates with unusually high IC50s.
Microsatellite genotyping of infections. Infections containing multiple clones provide ambiguous information for both genotype and drug response data. We genotyped seven microsatellite loci to identify infections containing multiple clones. We also used the genotype information to identify multilocus haplotypes that were represented more than once in the data set. Inclusion of "clonally identical" parasites can bias association studies, just as inclusion of relatives can bias association studies in human populations. Parasite DNA was prepared by phenol/chloroform extraction of whole blood, following removal of buffy coats as described previously (15). The microsatellite loci amplified were ARA2 (Chr 11), POLYA (Chr 4), TA1 (Chr 6), C2M1 (Chr 2), C3M54 (Chr 3), TA60 (Chr 13), C3M85 (Chr 3), and C4M30 (Chr 4). Primers and protocols for amplifying these loci are described on the NIH website (http://www.ncbi.nlm.nih.gov/projects/Malaria/Mapsmarkers/PfGMap/pfgmap.html). We amplified microsatellites using fluorescent end-labeled oligonucleotides, ran products on an ABI 3100 capillary sequencer, and scored alleles using GENESCAN and GENOTYPER software. Multiple-clone infections were defined as those in which one or more of the seven loci showed multiple alleles. Minor alleles were scored if they were >33% of the height of predominant alleles (2, 3).
Genotyping of transporter polymorphisms. We used a combination of primer extension ("SNaPshot" genotyping), sequencing, and microsatellite genotyping to score polymorphisms in the 11 candidate transporter genes (Table 1). Primer extension was performed using the ABI PRISM SNaPshot Multiplex kit (Applied Biosystems) and scored on an ABI 3100 capillary sequencer using GENESCAN and GENOTYPER software. In this technique, mutations are detected by fluorescent single-base extension of primers designed adjacent to polymorphic sites. Multiple mutations can be genotyped in a single reaction and scored in a single lane of a gel or a single capillary on an automated sequencer. The methods used were similar to protocols described previously (15). Table 1 lists primer sequences and conditions used for the different loci. For pfcrt we used a combination of primer extension and direct sequencing. We genotyped five codons (codons 220, 271, 326, 356, and 371) in the 3' region of pfcrt by primer extension, while mutations in the 5' region of pfcrt (amino acids 72, 74, 75, and 76) were scored by direct sequencing using ABI PRISM BigDye Terminator cycle sequencing ready reaction kit (Applied Biosystems). We used sequencing, rather than primer extension, for the 5' region of pfcrt, since the cluster of mutations from amino acids 72 to 76 makes primer design problematic for primer extension or other single nucleotide polymorphism (SNP) typing methods. We used PCR spin columns (QIAGEN) to remove excess nucleotides and oligonucleotides from the PCRs prior to the sequencing reactions. As controls, we genotyped a variety of laboratory parasite lines (3D7, HB3, Dd2, VS/1, and FCR3) with known pfmdr and pfcrt sequences. These were obtained from MR4/American Type Culture Collection (Manassas, Virginia).
|
View this table: [in a new window] |
TABLE 1. Methods and oligonucleotide sequences used for genotyping transporter loci
|
|
|
|---|
2 drugs remained. We found 99 genotypes that were represented a single time, while 9 genotypes (defined by microsatellite loci and transporter polymorphisms) were represented two to four times in the data set. To minimize bias, we analyzed unique genotypes only, including only one representative of each genotype and using mean values for the phenotype data from multiply represented clones. This reduced the sample size to 108 for phase I. The IC50s for the eight drugs are summarized in Table 2. All 108 of the pfcrt sequences examined contained 76T, critical for CQ resistance. All were identical to the published Dd2 sequence (CIETSESSTTI at positions 72, 74, 75, 76, 220, 271, 326, 356, and 371, respectively) except for one parasite that had I rather than T at residue 356. pfcrt was therefore excluded from subsequent analyses. Two SNPs in pfmdr (codons 1034 and 1246) and 1 SNP in G2 (codon 151) were monomorphic, leaving 12 polymorphisms (11 SNPs and 1 3-bp indel) in coding regions and 1 microsatellite and 2 SNPs within introns in the 10 genes (pfcrt excluded) for analysis. Both in vitro drug response and polymorphism data for phase I are provided in the supplemental material. |
View this table: [in a new window] |
TABLE 2. Summary statistics for IC50 data from phases I and IIa
|
|
View this table: [in a new window] |
TABLE 3. Associations between SNPs or indels in 10 putative transporter genes and IC50s for eight antimalarial drugsa
|
![]() View larger version (30K): [in a new window] |
FIG. 1. (A) Box-and-whisker plots showing associations between polymorphisms and drug response in phase I samples. (B) The same associations examined in phase II samples. Asterisks above graphs indicate significance level (*, P < 0.05; **, P < 0.01; ***, P < 0.001). The boxes indicate the 25th and 75th percentiles, the whiskers indicate the 10th and 90th percentiles, and the lines show the median values. The numbers at the top of each graph indicate the sample sizes for each of the alternate states at the polymorphism examined. Only the pfmdr-1042 and G7 polymorphisms remain significantly associated with drug response in both phase I and II population samples examined.
|
Phenotype-phenotype associations. We examined correlations between IC50s for the eight drugs in both phase I (108 isolates) and phase II (199 isolates) population samples (Fig. 2 and Table 4). We found strong consistent correlations (P < 0.0001) in comparisons between AS, DHA, LUM, MFQ, and QN in both population samples. Correlations between AS and DHA and between MFQ and LUM/QN were particularly strong, with r2 values of >0.3 in both phase I and phase II samples (Fig. 2). Phenotypic correlations between the responses to these drugs have been observed in previous southeast Asian studies (5, 21). We also observed weaker correlations (r2 < 0.1) between CQ and AQ/LUM and between DOX and DHA in phase II samples. We found no significant phenotypic correlation between CQ and QN responses in either phase I or phase II samples (Fig. 2). In comparison, there was a strong correlation between CQ and QN in Mu et al.'s previous southeast Asian data set (r2 = 0.478; P < 0.001) (14). This correlation became weaker but remained significant (r2 = 0.145; P < 0.018) when the four isolates bearing wild-type pfcrt alleles were excluded.
![]() View larger version (20K): [in a new window] |
FIG. 2. Scatter plots showing phenotypic correlations between log (IC50) values for different drugs. (A) CQ and QN; (B) DHA and AS; (C) MFQ and QN; (D) LUM and MFQ. Correlation coefficients (r2), sample sizes, and significance for all pairwise comparisons are shown in Table 4.
|
|
View this table: [in a new window] |
TABLE 4. Matrix of correlation coefficients (r2) between drug responsesa
|
![]() View larger version (26K): [in a new window] |
FIG. 3. Linkage disequilibrium between 15 polymorphic markers (SNPs or microsatellites) in 10 transporter genes. Panel A shows results from this study. Panel B shows results from Mu et al.'s Southeast Asian data set for comparison (14). Significance was assessed by permutation as described in the text (black shading, P < 0.01; grey shading, P < 0.05). Significant comparisons between markers located within the same gene are marked with a white "w" to distinguish them from between-gene comparisons.
|
|
|
|---|
Our current results differed in a number of respects from the previous southeast Asian data set (14). We observed no significant associations between CQ response and the polymorphisms in the nine novel transporters and just a single association with QN response in the phase I samples examined. Furthermore, we observed just two additional associations between responses to other drugs examined and the nine novel candidate transporter loci (G7 with AS and G49-1116 with DHA). With the exception of comparisons involving pfmdr-1042 and between G7 and AS, these associations disappeared when retested in a large independent sample of parasites, suggesting that they were type I errors. We observed lower levels of LD after correction for sample size differences. Finally, we observed no correlation between CQ and QN drug responses in the phase I samples or phase II samples (Fig. 2 and 3). This study and the previous association analysis (14) differ in two key respects: in the methods used for measuring in vitro resistance and in the sampling regime and study design. Examination of these aspects of the two studies' designs may indicate why our results differ.
The ability to detect associations between genotype and phenotype is strongly dependent on the quality of the phenotype data. In the original study (14), drug response was measured in isolates adapted to laboratory culture using a minimum of five replicate runs per parasite. As a further control, Dd2 assays were run concurrently, and the data were standardized to Dd2. These procedures were adopted to maximize the quality of the in vitro data. In the present study, we measured in vitro resistance using isolates freshly isolated from patients, and we used two replicates at each drug concentration. Can additional noise in the in vitro data reported here explain the difference between the two studies? We believe that there is a strong signal in the in vitro data reported here. (i) We used stringent criteria to remove low-quality in vitro data (see Materials and Methods). In particular, IC50 data were used only if standard errors were less than 20% of the mean (CV < 20%). (ii) The data show expected strong phenotypic correlations between AS and DHA and between MFQ and QN and LUM (Fig. 2). Similar strong correlations have been observed in previous southeast Asian studies (5, 17, 30), suggesting that the data are reliable. (iii) We also see expected strong associations between pfmdr-1042 and in vitro resistance to MFQ and LUM (21). Unlike the other associations detected in the first phase of this study, these associations remained on retesting in a larger parasite sample. Since pfmdr is a known modulator of resistance to a variety of antimalarial compounds (20-22, 26), this provides an important positive control for this data set. (iv) Real-time PCR measurement of pfmdr copy number showed highly significant associations with drug response. An increased pfmdr copy number was significantly associated with higher IC50s for MFQ (P < 0.0001), LUM (P < 0.0001), QN (P = 0.0001), DHA (P < 0.05), and AS (P < 0.05) (T. J. C. Anderson et al., unpublished data), confirming results from a number of recent studies (20-22) and providing further strong validation for this data set. (v) Finally, lower precision in our IC50 measures is likely to have been offset by larger sample sizes. Our sample comprised 108 unique parasite isolates, of which 85 and 100 gave reliable in vitro data for CQ and QN. This is more than two times larger than the southeast Asian sample collection examined previously (14).
On the other hand, three features of the in vitro data reported here do suggest some measurement error. First, we found significant differences between IC50s for strain K1 parasites in the different assay periods for a number of drugs. We are unsure of the reasons for this. Second, we observed significant differences in drug responses measured during the phase I and phase II samplings (Table 2). Some of these differences may reflect real temporal changes in allele frequencies in loci encoding drug resistance genes. However, decreases in DOX and AQ IC50s are more difficult to explain. Third, we observed elevated levels of resistance to some drugs within particular batches of plates used in the drug resistance assays, suggesting some error in preparation of assay plates. The consistent associations with pfmdr polymorphisms (Table 3; Fig. 1) and existence of expected correlations between drug responses (Fig. 2 and Table 4) demonstrate a strong signal-to-noise ratio in the in vitro data and make it unlikely that associations with genes of large effect would be missed. However, we cannot exclude the possibility that noise in our phenotype data precluded us from detecting subtle associations and resulted in Type II errors in the present study.
Association studies are notoriously prone to type I errors due to multiple testing and population structure (23-25). Here we used a two-phase design to minimize type I errors, while in the previous study (14), a single population sample was used and statistical approaches were employed to minimize such errors. Furthermore, we used parasite isolates collected from a single location over a short time period, while the previous study used cultured parasite isolates collected from disparate locations over a longer period. The two-phase design used here clearly helped us to avoid type I errors, since of eight positive associations detected in the first phase, five were rejected during the phase II survey.
To minimize type I errors due to population structure in the previous data set (14), two procedures were used. First, SNPs in housekeeping genes were typed to provide "genomic controls" for spurious associations. None of these housekeeping genes showed positive associations. However, unfortunately, only 2/39 genes examined were sufficiently polymorphic for inclusion. Second, population samples were grouped into three different geographical regions (Africa, southeast Asia, and South America), and samples from each region were analyzed independently. However, within-region population structure could still generate bias and lead to type I errors. If population structure is an important concern in southeast Asia, then we would expect to see greater LD in the southeast Asian population analyzed previously than in the data set presented here from a local P. falciparum population. Consistent with this, we found significantly less LD in the parasite population from Mawker-Thai than in our previous data set (14) after equalizing the sample sizes in the two studies. The fact that there is less LD in a local southeast Asian parasite population suggests that population structure could have generated some of the borderline associations reported previously (14).
Mixed population samples are likely to be a problem only if population allele frequencies and drug responses vary regionally or temporally within southeast Asia. There is independent evidence for weak population structure in southeast Asia. We have documented weak but significant population structure in allele frequencies at 9/11 putatively neutral SNPs on chromosomes 2 and 3, with much of the variation being attributable to populations from Laos and Bangladesh (Anderson et al., unpublished). There is also good evidence that levels of in vitro resistance vary both within and between countries within southeast Asia. These data come from measurement of in vitro resistance (17, 30), from drug efficacy rates (27), and from genotyping of known drug resistance markers (16).
In conclusion, we found suggestive evidence that G7 is associated with AS response, and we encourage others to try to replicate this result. However, we did not find strong evidence supporting a role for the remaining eight new candidate genes and resistance to CQ or QN or to six other drugs. Our failure to replicate these associations (14) may stem from differences in methods used for measuring resistance phenotypes and/or differences in design used in this and the previous study (14). Sampling from a single location and the two-phase study design provides an effective approach to minimizing problems with population structure and type I errors resulting from multiple testing. On the other hand, replicate measurement of IC50s in cultured parasites, as used in the initial study (14), provide high quality in vitro data and can help avoid type II errors (false negatives). We suggest that these approaches should be combined in future candidate gene association studies for drug resistance genes. These largely negative results should not be used as grounds to discard this approach to resistance gene location. Transporter genes remain extremely promising candidates. While eight of nine loci examined do not appear to show associations in southeast Asia, it is possible that some of these loci play a role in drug resistance in other regions of the world. In particular, the associations observed between polymorphisms in G7, G30, and G55 with CQ and/or QN response in African parasites in the work of Mu et al. (14) urgently require reexamination. African parasite populations show less geographical variation in allele frequencies than Asian or South American populations (2). Associations between SNPs and in vitro response are therefore less susceptible to bias due to population structure in African samples. A further caveat is worth mentioning. There is increasing evidence that pfmdr copy number, rather than point mutations, plays the key role in drug resistance in southeast Asia (20-22, 29). If copy number rather than point mutations underlies resistance in other transporters, then screens for association with SNPs may fail to incriminate candidate transporter loci. Examination of copy number and/or expression levels would provide a valuable complementary approach to SNP association for screening candidate transporter loci.
The SMRU is part of the Wellcome-Trust Mahidol University-Oxford Tropical Medicine Research Program supported by the Wellcome Trust of Great Britain. F.N. is a Wellcome Trust Senior Clinical Fellow. This investigation was conducted in a facility constructed with support from Research Facilities Improvement Program grant number C06 RR-98001 from the National Center for Research Resources at NIH. This work was funded by NIH grant AI48071 to T.J.C.A.
Supplemental material for this article may be found at http://aac.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»