Previous Article | Next Article ![]()
Antimicrobial Agents and Chemotherapy, May 2005, p. 2015-2025, Vol. 49, No. 5
0066-4804/05/$08.00+0 doi:10.1128/AAC.49.5.2015-2025.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Francesca Ceccherini-Silberstein,1,
*
Fulvio Erba,1
Maria Santoro,1
Caterina Gori,2
Maria Concetta Bellocchi,2
Sara Giannella,2
Maria Paola Trotta,2
Antonella d'Arminio Monforte,3
Andrea Antinori,2 and
Carlo Federico Perno2
Department of Experimental Medicine, University of Rome "Tor Vergata," Via Montpellier 1, 00133 Rome,1 National Institute for Infectious Diseases "L. Spallanzani," Via Portuense 292, 00149 Rome,2 Institute of Infectious and Tropical Diseases, University of Milan, Hospital Sacco, Via G. B. Grassi 74, 20157 Milan, Italy3
Received 6 August 2004/ Returned for modification 12 October 2004/ Accepted 3 January 2005
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
109 viral particles are produced daily), the accumulation of proviral variants, and genetic recombination (2). Among the different areas of the viral genome, the pol gene, which encodes enzymes such as reverse transcriptase and protease, is subjected not only to natural variation pressure but also to mutations imposed by the pharmacological treatment (6, 31). The HIV-1 protease enzyme is responsible for the posttranslational processing of the viral polyproteins encoded by gag (p55) and gag-pol (p160), thus yielding the mature structural core proteins (the matrix, capsid, nucleocapsid, and p6 proteins) and the essential enzymes (protease, reverse transcriptase, and integrase) required to produce mature infectious particles (32).
For its crucial role in the HIV-1 life cycle, protease represents an important target for the antiretroviral therapy. To date seven protease inhibitors (PIs; indinavir, ritonavir, saquinavir, nelfinavir, amprenavir, lopinavir, and atazanavir) have been approved by the Food and Drug Administration (FDA) and are clinically available. Unfortunately, when antiretroviral therapy fails to be fully suppressive, viral variants with reduced susceptibilities to PIs can emerge (8, 10, 21). Resistance to PIs is mediated by the appearance of protease amino acid substitutions (at positions either in direct contact with the inhibitor or at distant sites) that reduce the binding affinity between the inhibitor and the mutant protease enzyme (6). These amino acid substitutions, defined in the literature as major mutations, may deeply impair the protease catalytic activity and, consequently, the replication capacity of the virus (5, 19). Restoration of the replication capacity, which allows the viability of virus particles, is due to the presence of mutations defined in the literature as compensatory mutations (5, 9). Several studies have contributed to our current knowledge of the drug-related variants of HIV-1 protease. To date, mutations at 50 of 99 residues of protease have been related to one or more experimentally tested PIs, thus attesting to the high degree of flexibility of the protease enzyme (11, 13, 23, 29, 37); 22 of these residues are involved in resistance to the PIs used in clinical practice (13; Stanford HIV Drug Resistance Database [http://hivdb.Stanford.edu]). Generally, single mutations confer only a modest reduction in drug susceptibility (with some exceptions, such as D30N, I47A, G48V, and I50L/V), while for many PIs, particularly if they are boosted with ritonavir, a stepwise accumulation of several protease mutations is required for the development of high-level drug resistance. This requirement of multiple mutations to overcome the activities of PIs has been referred to as a high genetic barrier to drug resistance (7, 16, 22). Furthermore, most drug resistance mutations in the protease confer cross-resistance to many PIs, so this resistance should be considered class specific rather than drug specific (10, 15, 17, 27).
Since protease is characterized by a high degree of structural flexibility and resistance to PIs is a such complex phenomenon, it is conceivable that the mutational pathways of HIV protease are far more complex than is currently known and that some mutations (and associations of mutations) have been not yet precisely defined. For instance, previous work recently identified some positions or mutations positively associated with treatment with PIs; however, their exact role in PI resistance has not yet been precisely clarified (3, 23, 37). Thus, we decided to focus our attention on these uncharacterized (novel) mutations to better understand their association with specific PIs and PI resistance-associated mutations and thus contribute to obtaining a better definition of the protease mutational patterns that regulate PI resistance.
The identification of new mutational clusters also suggests that these additional mutations may interact as part of higher-order networks and strongly suggests that other mutations beyond those currently known to be associated with PI resistance should be considered to define precise algorithms able to predict resistance to antiretroviral drugs.
| MATERIALS AND METHODS |
|---|
|
|
|---|
HIV sequencing. HIV genotype analysis was performed with plasma samples by means of a commercially available kit (The ViroSeq HIV-1 Genotyping System; Applied Biosystems) (3, 26). Briefly, RNA was extracted, retrotranscribed with murine leukemia virus reverse transcriptase, and amplified with the AmpliTaq Gold polymerase enzyme by using two different sequence-specific primers for 40 cycles. The sense and antisense orientations of the full lengths of the pol-amplified products (containing the entire protease gene and the first 320 amino acids of the reverse transcriptase open reading frame) were sequenced by using seven different overlapping sequence-specific primers and an automated sequencer (ABI 3100) (3, 26). Sequences with a mixture of wild-type and mutant residues at single positions were considered to have the mutation(s) at that position. The isolates were subtyped by comparing their sequences to reference sequences of known subtype (http://hivdb.Stanford.edu).
Statistical analysis. To assess the association of protease mutations with PI treatment, we calculated their frequencies in isolates both from untreated patients and from PI-treated patients and performed chi-square tests of independence (based on a two-by-two contingency table) to verify whether the differences in frequency between the two groups of patients were statistically significant (P < 0.05). We considered novel mutations (T12A, A22V, E34Q, E35G, K43T, K45R, K55R, Q58E, I62V, L63Q, H69N, T74S, I85V, Q92K, and C95F) to be those that have not yet been reported to be associated with PI resistance by the Stanford HIV Drug Resistance Database (http://hivdb.Stanford.edu) and by the International AIDS Society (13). We also included in our analysis the novel mutations K20I and K20T, reported by the International AIDS Society to be specifically associated with resistance to atazanavir (approved by FDA in June 2003) and tipranavir (not yet approved by FDA), respectively, and also recently reported in the Stanford HIV Drug Resistance Database.
To assess the association of each novel mutation with treatment with a specific PI, we compared the rate of occurrence of each novel mutation in the subpopulation that experienced treatment with a specific PI with that in the subpopulation that did not experience treatment with that PI; we then performed chi-square tests of independence to verify statistically significant differences (P < 0.05).
We used the Benjamini-Hochberg method (1, 37) to identify results that were statistically significant in the presence of multiple-hypothesis testing. A false discovery rate of 0.05 was used to determine statistical significance.
Mutation covariation. We calculated the binomial correlation coefficient (phi) for all the possible pairwise combinations between novel mutations and the other protease mutations in the set of PI-treated patients. Statistically significant pairwise correlations were those with a P value of <0.05. We used the Benjamini-Hochberg method (1) to identify pairwise combinations that were significant in the presence of multiple-hypothesis testing. A false discovery rate of 0.05 was used to determine statistical significance.
For each pair, two positions each with a mixture of two or more mutations were excluded from the covariation analysis, since it is impossible to discriminate whether these mutations fall in the same viral genome.
Mutational clusters were defined as clusters of three or more mutated positions in which each position was significantly correlated with each other. Mutational clusters were identified by a computational technique that evaluated all possible clusters that can be formed from the significant correlated pairwise combinations of mutated positions. The computational technique was performed with the S-Plus program by implementation of hierarchical clustering algorithms. For the box-plot analysis, we considered D30N, V32I, M46I/L, I47A, G48V, I50L/V, I54A/L/M/S/T/V, V82A/F/S/T, I84V, N88D/S, and L90M to be major mutations (13; Stanford HIV Drug Resistance Database [http://hivdb.Stanford.edu]).
Structural analysis. The X-ray crystallographic coordinates with the code 1HIH.pdb deposited in the Protein Data Bank (PDB; http://www.rcsb.org/PDB/) were used as the template for structural analysis. Deep View (version 3.7) software was used to map amino acids onto the three-dimensional representation of the protease enzyme and to inspect the structures for hydrogen bond formation between residues. To calculate the interatomic distances between statistically significantly correlated residues, a script was developed to measure the distance between each pair of atoms within two different amino acids. The interresidue distances are referred to as the shortest interatomic distance between any atoms in two residues. Residues within 8 Å in the protease structure were considered near enough to interact (37).
Nucleotide sequence accession numbers. The majority of nucleotide sequences from drug-naïve patients have been submitted previously to GenBank (26). The 485 new sequences of isolates from patients treated with at least a PI within the HAART regimens (3) were submitted to GenBank and assigned the following accession numbers: AY855351 to AY855439, AY855441 to AY855458, AY855460 to AY855502, AY855504 to AY855556, AY855558 to AY855773, AY855775 to AY855795, AY855797 to AY855818, AY855820 to AY855836, and AY995408 to AY995555.
| RESULTS |
|---|
|
|
|---|
|
Class II included mutations (Q58E and I62V) that were already occurring in isolates from drug-naïve patients at a frequency >1% but that showed a significant increase (P < 0.01) in isolates from PI-treated patients (Q58E from 1.5% to 5.2% and I62V from 24.5% to 35.9%).
Interestingly, and different from the other classes, class III included mutations (T12A, L63Q, and H69N) that showed a significant decrease in isolates from PI-treated patients compared with that in isolates from drug-naïve patients. In a comparison of drug-naïve and PI-treated patients, the frequency of T12A decreased from 6.3% to 2.8%, that of L63Q increased from 4.2% to 1.6%, and that of H69N increased from 12.7% to 8.1% (P < 0.05 for all comparisons).
Many class I mutations and both class II mutations showed remarkable increases in frequency in isolates from patients who had experienced more than three PIs (Table 1). In particular, for E34Q, Q58E, I62V, and C95F, the increase in frequency observed in isolates from patients who experienced more than three PIs was statistically significant (P < 0.05) compared with that observed not only in isolates from drug-naïve patients but also in isolates from patients who experienced one or two PIs. On the other hand, class III mutations showed a progressive decrease in prevalence, despite experience with increased numbers of PIs (Table 1).
|
Associations between protease mutations. A further step in our study was to assess the association of novel protease mutations with other mutations observed in PI-treated patients, with the focus of our attention on those involved in PI resistance.
Many mutations of class I or class II usually occurred in isolates with one to four major PI resistance mutations. More than three major mutations were found in >50% of the isolates from patients harboring one of the following mutations: E34Q, K43T, K55R, I85V, Q92K, C95F, or Q58E (Fig. 2). Among these, E34Q, K55R, and C95F were never found in the absence of major mutations, whereas isolates with class III mutations had a median number of major mutations ranging from 0 to 1 (Fig. 2).
|
|
Class III mutations did not show any positive correlations with PI resistance-associated mutations; on the contrary, H69N was negatively associated with the compensatory mutations M36I (phi = 0.14) and L10I (phi = 0.13). In addition, H69N showed a weak negative correlation with the major mutation M46I (phi = 0.05), although it was not statistically significant.
Cluster of correlated positions. The mathematical approach used in our study let us define some clusters of mutated positions in which all possible pairs were mutually correlated. Among the class I mutations, K20I, K43T, K45R, K55R, I85V, and C95F were involved in one or more clusters that ranged in size from three to seven mutated positions (Table 3). K45R specifically clustered only with D30N and N88D. K43T was involved in two clusters, both of which included V82A; one was in the presence of I54V, while the other was in the presence of both I47V and V32I. Similarly, C95F was involved in two different and separated clusters. The most frequent cluster (71.4% of isolates with C95F) included L90M with the compensatory mutation I93L, while the other cluster (35.7% of isolates with C95F) included V82A with other major mutations (G48V and I54V). The presence of V82A was also observed in a large cluster involving K55R together with the major mutations M46I and I54V and the compensatory mutations L10I and L24I. Clusters involving I85V contained only I84V as the major mutation. K20I frequently clustered with the major mutation I54V and the compensatory mutation M36I. The other novel mutations of class I (A22V, K20T, E34Q, E35G, T74S, and Q92K) were not found to cluster at a significant level with PI resistance mutations.
|
Locations of novel mutations in the three-dimensional structure of protease. The distribution of the novel mutations was widespread in the HIV-1 protease structure (Fig. 3A), and the mutations were not directly involved in protease binding sites and did not directly affect the dimerization interface (36). However, several novel mutation residues were located at minimal interatomic distances from PI resistance-associated residues. In particular, residues 43, 45, and 55, which are located in the flap, showed interatomic distances <8 Å with several PI resistance-associated residues (residues 43 to 46, 45 to 30, 55 to 46, and 55 and 54), and residue 85 showed an interatomic distance <8 Å with substrate binding sites (V82 and I84). Overall, among the 61 statistically significant correlated pairs of residues, 23 (37.7%) were within 8 Å of each other; and in particular, 4 of the 23 correlated pairs of residues were contiguous (residues 34 and 33, 35 and 36, 55 and 54, and 85 and 84) (Table 2).
|
Finally, among the highly correlated residues, the novel residue 95 showed interatomic distances <4 Å with residues 90 and 93. In particular, the three residues are involved in a hydrogen bond network, and the carbonylic oxygen in the backbone of residue 90 interacts with the nitrogens of residues 93, 94, and 95 (Fig. 3C).
| DISCUSSION |
|---|
|
|
|---|
These results are consistent with recent analyses conducted with a large number of protease sequences by two independent groups. In fact, both class I and class II mutations were included in a list of 22 newly described treatment-associated mutated positions (37). The fact that Wu et al. (37) observed a higher number of new treatment-associated mutated positions may probably be related to the different number of patients analyzed, the patients' characteristics, and the therapeutic regimens used. In addition, most class I and II mutations (K20T, E34Q, K43T, K45R, K55R, T74S, I85V, Q92K, Q58E, and I62V) were included as candidate mutations for drug resistance or positive reproductive fitness by having a ratio of the number of nonsynonymous mutations to the number of synonymous mutations >1, with values similar to those observed for the majority of the already-known PI resistance-associated mutations (4).
Many class I and class II mutations were strongly correlated (both in pairs and in clusters) with PI resistance-associated mutations and rarely occurred in the absence of major mutations, suggesting that they may require the accumulation of PI resistance-associated mutations in the protease enzyme for their emergence. In particular, correlations frequently involved one or more multi-PI resistance-associated mutations at positions 82, 84, and 90, thus confirming the wide cross-resistance that characterizes PIs. We have no evidence how each mutation could influence (enhance) drug resistance or increase replication capacity; it is conceivable that class I and class II mutations may act as minor mutations able to rescue the loss of fitness or to increase the level of resistance. As confirmation of these hypotheses, K20I and K20T were recently included in the set of mutations associated with reduced susceptibility to atazanavir and tipranavir, respectively, even if their exact role has not been determined (13). Parkin et al. (24) included E34Q, K43T, Q58E, and T74S in the list of lopinavir mutational scores used as a predictor of reduced lopinavir susceptibility; in particular, E34Q and K43T are associated with lopinavir resistance in the rules-based CREST algorithm (http://assign.dci.uwa.edu.au/crest/crest5_table.asp). Furthermore, it was observed to be a better predictor of lopinavir resistance by use of an artificial neural network when a set of mutations, including K43T, K55R, Q58E, T74S, I85V, and C95F, was used (34). Consistent with this report, our analysis shows that E34Q, K43T, and K55R occurred in isolates from patients who failed an antiretroviral therapy containing lopinavir/ritonavir and correlated with mutations associated with lopinavir resistance (13, 31). In fact, E34Q was specifically associated with either L33F or F53L, and K43T was specifically associated with I54A and clustered with multi-PI resistance mutations (V82A and I54V or V82A, V32I, and I47V), while K55R clustered with multi-PI resistance mutations (V82A, I54V, and M46I). Therefore, these data strongly suggest that E34Q, K43T, and K55R will need consideration in more than one resistance algorithm to correctly predict lopinavir resistance. In addition, K55R significantly occurred also in patients failing a nelfinavir-containing regimen, thus supporting the previously reported association of K55R with reduced susceptibility to nelfinavir (18).
Similar to K55R, Q58E clustered with positions 10, 46, 54, and 82; in this context, the most common amino acid substitution at position 46 was M46L and not M46I. We then suppose that M46I and M46L may have different compensatory pathways mediated by K55R and Q58E, respectively, thus supporting the hypothesis formulated by Hoffman et al. (12), who suggested that as a consequence of different codon usage at position 46, M46I and M46L lie along divergent evolutionary pathways.
On the other hand, C95F clustered with G48V and V82A, whose copresence reflects prolonged saquinavir exposure (30, 35). As C95F was associated with saquinavir use, we may suppose that this mutation may contribute to saquinavir resistance. However, C95F was also strongly associated with indinavir use and was strongly associated in a cluster with L90M and I93L. Our structural analysis showed that residues at positions 90, 93, and 95 directly interact through a hydrogen bond network that stabilizes the ending loop from positions 92 to 95 of the protease enzyme.
Two mutations (K45R and K20T) were specifically associated with nelfinavir. K45R occurred in isolates from patients who failed a first-line regimen containing nelfinavir and showed a highly specific association with nelfinavir-associated mutations D30N and N88D (25, 28). Kaldor et al. (14) demonstrated that the D30N mutation reduces a hydrogen bond with bound nelfinavir. However, D30N also determines a loss of a negative charge on the molecular surface of the protease enzyme. The restoration of this negative charge is somewhat related to the emergence of N88D, thus helping the overall increase in protease efficiency. K45R, which results in an increased size of the side chain, may contribute to an enlargement of the positively charged area around position 30. K20T significantly occurred in patients who failed nelfinavir as the second PI and was significantly correlated with the multi-PI resistance-associated mutation L90M, suggesting that K20T may contribute to L90M-mediated resistance to nelfinavir. Also, K20I and I62V were strongly correlated with L90M, but they were not associated with any specific PI, thus suggesting that they may play their compensatory roles in the context of cross-resistance.
Although biochemical experiments are required to demonstrate the mechanism of correlation between pairs of residues, our study provides preliminary hypotheses on the importance of these novel mutations. In addition, the fact that many class I and II residues are located at minimal interatomic distances with their positively correlated PI resistance-associated residues in the three-dimensional protease structure confirms a role of the local environment in the interactions among mutations.
Class III mutations are common polymorphisms in isolates from drug-naïve patients, and their frequency decreased in isolates from patients who failed an antiretroviral regimen containing at least one PI, thus suggesting the negative association of these mutations with PI treatment and failure. Class III mutations were rarely found in the presence of major mutations and were never positively correlated with any PI resistance-associated mutations. If anything, H69N showed negative correlations with the minor mutations M36I and L10I. These data suggest a protective role for class III mutations that may prevent the appearance of PI resistance-associated mutations under PI pressure, thus contributing to increases in the level of the genetic barrier to PI resistance. However, this hypothesis needs confirmation in in vitro studies and clinical practice.
It should be noted that all patients analyzed in this study carried the HIV-1 clade B subtype. Further studies should also investigate the prevalence and the role of the novel mutations in non-B subtype clades. In fact, it is known that pathways of viral evolution toward drug resistance may proceed through distinct steps and at different rates among different HIV-1 subtypes (33). Also, we cannot exclude the possibility that novel mutations, and in particular, the observed patterns of correlated mutations, may be the result of pharmacological pressure imposed by the drug regimens that were used by our cohort, while other treatment regimens may lead to the development of pathways in part different from those that we observed. Further studies and extended databases (with genotypic and/or phenotypic data) may provide insights regarding this important point.
In conclusion, our study strongly suggests that PI resistance is a very complex phenomenon and is regulated both positively and negatively by a highly ordered network of mutations, which also reflects the high genetic barrier to PI resistance. On this basis, novel mutations must be taken into account to define more precise algorithms able to correctly predict resistance to antiretroviral drugs.
| ACKNOWLEDGMENTS |
|---|
We thank Ada Bertoli, Alessandra Cenci, Federica Forbici, Roberta D'Arrigo, Fabio Continenza, Andrea Biddittu, and Sandro Bonfigli for sequencing and data management and all the ICONA study group participants and members.
| FOOTNOTES |
|---|
V.S and F.C.-S. contributed equally to this work. ![]()
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Clin. Vaccine Immunol. | Clin. Microbiol. Rev. |
|---|---|
| J. Clin. Microbiol. | ALL ASM JOURNALS |