ABSTRACT
A homology model of Helicobacter pylori urease was developed by using the crystal structure of urease from Klebsiella aerogenes (EC 3.5.1.5) as a template. The acetohydroxamic acid moiety was docked into the active pocket of the enzyme model, followed by relaxation of the complex by use of molecular dynamics. The resulting conformation was used as a template to construct 24 potential dipeptide hydroxamic acid inhibitors with which comparative molecular field analysis (CoMFA) was performed. The resulting model provided a cross-validation correlation coefficient (q2L00) of 0.610, a conventional r2 value of 0.988, and an F (Fisher indication of statistical significance) value of 294.88. We were able to validate the CoMFA model by using the 50% inhibitory concentrations of six compounds that were not included in the construction of the model. A very good structural correlation was observed between the amino acids in the model urease's active pocket and the contour maps derived from the CoMFA model. This correlation, accompanied by the validation supplied by use of the CoMFA data, illustrates that the model can aid in the prediction and design of novel H. pylori urease inhibitors.
Helicobacter pylori is a gram-negative, spiral bacterium thought to affect about 90% of the world's population (11). It is well accepted that H. pylori infection is etiologically associated with chronic active gastritis, peptic ulcer diseases, mucosa-associated lymphoid tissue-type gastric carcinoma, and other gastric cancers (16). Although H. pylori infection has been implicated as an etiological factor in chronic gastric reflux disease, new studies show that H. pylori infection may provide a protective mechanism against such disease; however, the results of those studies remain controversial (8, 18). Eradication therapy heals gastritis and results in cure of peptic ulcer and the remission of mucosa-associated lymphoid tissue-type gastric carcinomas (22). Although most infections can be controlled by antibiotic therapy (17, 27), H. pylori antibiotic resistance is becoming somewhat commonplace (1). Antibiotic resistance in a microorganism as widespread as H. pylori is a cause for immediate concern and warrants a dedicated search for the discovery of new drug therapies.
H. pylori, characterized by its strong urease activity (5), has received a great deal of attention from the scientific community over the past two decades. It is now clear that for survival the organism requires the production of a urease enzyme to help produce ammonia to counteract the strong acidic environment of the stomach (19). It has been estimated that over 5% of the total protein in the cell is represented by this enzyme (12). The urease reaction not only provides an environment with a pH suitable for H. pylori colonization of the stomach mucosal lining but also provides the mechanism for eventual gastric wall damage that increases the overall likelihood and the severity of gastric ulcers (20). Ureases are ubiquitous in nature and are inhibited, in general, by a variety of agents including fluorides (26), thiols (25), and hydroxamic acids (14). Urease-specific inhibitors are much less common. Recently, several mono-amino acid and dipeptide derivatives containing hydroxamic acid moieties were synthesized and tested for their specific inhibitory activities against H. pylori urease (23). The initial findings suggest that these derivatives are potent, specific inhibitors of H. pylori urease but show little or no inhibitory activity against jack bean urease.
In order to explore the binding parameters associated with these and potentially novel hydroxamic acid inhibitors targeted to the active pocket of H. pylori urease, a homology model was developed by using the urease crystal structure from Klebsiella aerogenes (13) (EC 3.3.1.5) as a template. Acetohydroxamic acid was docked into the active pocket of the homology model developed with this urease, and the most probable configuration of the enzyme-inhibitor complex was assessed by molecular dynamics studies. Comparative molecular field analysis (CoMFA) was then carried out with a variety of dipeptide hydroxamic acid derivatives. Quantitative models obtained by three-dimensional quantitative structure-activity relationship (QSAR) techniques like CoMFA and comparative molecular similarity indices analysis, in which the steric and electrostatic fields sampled at the intersections of one or more lattices spanning a specific three-dimensional region are compared, have shown unprecedented accuracy in predicting specific structure-activity relationships (15).
We have developed by CoMFA a model of 24 dipeptide hydroxamic acid derivatives, using the conformations of structural ligands based on the acetohydroxamic acid-enzyme complex obtained by homology modeling, docking, and finally, molecular dynamics. The predictive value of the model was evaluated and verified with data for compounds not included in the set used to develop the original model. Overlapping of the contour maps derived from the model obtained by CoMFA with the amino acids associated with the enzyme active pocket resulted in a model that provides an initial conceptualization and understanding of the steric and electrostatic requirements for ligand binding to and inhibition of H. pylori urease.
MATERIALS AND METHODS
Data set.A group of 24 dipeptide hydroxamic acid derivatives that were assayed in one laboratory under the same assay conditions was selected for use as the primary set of compounds for which data were obtained. The 50% inhibitory concentrations (IC50s) of the dipeptide derivatives were previously determined by Odake et al. (23), and these data are reported in Table 1. The primary structural variation among these compounds was the amino acid side chain.
IC50 of hydroxamic acid derivatives of dipeptidesa
Computational approaches. (i) Homology modeling.The amino acid sequence for H. pylori urease was retrieved from SWISS-PROT data bank entry URE2_HELPY (5). The X-ray crystal structure of the urease of K. aerogenes, entry 2KAU (13), was obtained via the Protein Data Bank. The construction of the protein model was based on the homologous structure of the K. aerogenes urease, which was used as a template. Amino acid sequence alignment indicated a 61.4% residue identity between the primary structures of the urease enzymes. The three-dimensional model was constructed by copying aligned coordinates of identical residues, building loops, and structural refinement (10). The protein modeling tools available in the computer software package MOE (2000; Chemical Computing Group Inc. Montreal, Quebec, Canada) were used for protein modeling. The partial equalization of orbital electronegativity (PEOE) (7) charges were calculated, and the force field was set to AMBER94 (28). This was followed by geometric optimization of the protein structure, minimizing the structure to a root mean square deviation (RMSD) gradient of 0.01 kcal/mol · Å.
(ii) Docking.The geometrically optimized protein structure was used as a starting point for docking experiments, and PEOE partial charges were calculated. The acetohydroxamic acid was docked into the active pocket of the protein by using the docking protocol of the MOE software package. Specific information identifying the amino acids involved in the formation of the urease active pocket was obtained from the literature, and this information was used to define the docking box or volume to be explored during docking (6). The MOE software package uses grid-based potential fields to calculate interaction energies between flexible ligands and rigid targets. Twenty-five putative complex geometries were generated and optimized during the docking process by an annealing procedure with Monte Carlo simulation (9); however, the final configuration was selected on the basis of similarity to the crystal structure of acetohydroxamic acid as bound to the K. aerogenes urease (24). Energy minimization of the enzyme-inhibitor complex, which was selected from the docking results, was accomplished by using the energy minimization protocol of MOE with chirality constraints to a gradient of 2.0 kcal/mol · Å. The PEOE charges were calculated prior to minimization of the structure using the AMBER94 force field, so that the PEOE charge calculation is independent of the force field.
(iii) Molecular dynamics.The molecular dynamics study was carried out by using a constant temperature and a constant volume of the proposed enzyme-inhibitor complex obtained from docking studies. The enzyme-inhibitor complex was heated from 1 to 300 K over 10 ps with a time step of 1 fs. Data were collected during the following 100-ps equilibrium phase, while the temperature response was fixed at 25. Snapshots were collected every 1 ps throughout the equilibrium phase. Of the 100 conformations obtained in this manner, the 10 conformations showing the lowest potential energy were selected for further minimization to a gradient of 0.01 kcal/mol · Ao, as described above.
Structure alignment.Each of the dipeptide hydroxamic acids was constructed by using the acetohydroxamic acid-H. pylori urease bound conformation as a template. As expected, at physiological pH the amino groups are ionized and were modeled as such. All structures were subsequently imported to Sybyl software (version 6.7, 1995; Tripos Associates, St. Louis, Mo.), with which Gasteiger-Huckel charges were calculated. The biologically active portions of the structures, as illustrated with asterisks in Fig. 1, were overlaid by using the fit atoms protocol available in Sybyl (version 6.7) software (3, 4). The resulting structure alignment of the compounds (RMSD, 0.077 for the fitted atoms) is shown in Fig. 2.
Atoms of dipeptide hydroxamic acid used for the structure alignment. Asterisks, biologically active portions of the molecules.
Structure alignment of 24 dipeptide hydroxamic acids. RMSD, 0.077 Å for the atoms marked with asterisks in Fig. 1.
Generation and analysis of data by CoMFA.The CoMFA tools available in the Sybyl (version 6.7) software were used to calculate steric and electrostatic fields at grid points by using the Lennard-Jones and the Coulomb potential functions of the Tripos Standard field class. An sp3 carbon probe with a charge of +1 was used for the grid calculations. The software program was prompted to automatically generate a single grid, overlapping all entered molecules and extending past them by at least 4 Å along all axes. Steric and electrostatic cutoffs were set to 30 kcal/mol and proceeded with smooth transition, except that the electrostatic contributions were dropped for each row where the steric cutoff was reached. Both steric and electrostatic contributions were calculated by use of a distance-dependent dielectric constant of 4.0 with a smoothing function. The grid spacing was set to 2 Å. Standard CoMFA scaling was applied to give each individual CoMFA field the same potential influence on the resulting QSAR.
Model validation.The partial-least-squares (PLS) method (3) was used to derive a linear relationship between the biological activities and the molecular fields. Leave-one-out cross validation with a 2-kcal/mol column filter was performed by omitting from the analysis those columns (lattice points) with an energy variance less than 2.0 kcal/mol. This allowed determination of the optimum number of components associated with the lowest standard error of prediction.
Four components provided the highest cross-validated r2 (q2) with the lowest standard error of prediction. The cross validation was performed by dividing the total number of compounds into 10 groups. Only models with q2 values over 0.5 and a fraction of variance greater than 0.85 for the optimum number of components were considered further. The model found to be the best by cross validation was used to perform the nonvalidation run.
RESULTS
Homology modeling.Our homology modeling yielded a Z score of 66.7. This is an estimate of the statistical significance of the alignment score (10). The homology obtained between the sequences was 61.4%. An overlay of the K. aerogenes urease and the H. pylori urease is illustrated in Fig. 3A.
(A) Overlay of the urease of K. aerogenes (white; entry 2KAU in the Protein Data Bank) with the homology model H. pylori urease (purple) (Z score, 61.7%); (B) comparison of active pockets (green, H. pylori urease; white, K. aerogenese urease).
Docking and molecular dynamics.Docking, minimization, and molecular dynamics produce an orientation of acetohydroxamic acid in the H. pylori urease model that is very similar to that in the K. aerogenes urease crystal structure. The hydroxamic acid binds to the nickel ions and is very important for activity (6). The overlap of the modeled H. pylori urease-acetohydroxamic acid on the K. aerogenes urease complex obtained from the crystallographic data is shown in Fig. 3B.
CoMFA.The results of our CoMFAs by PLS analysis are as follows: q2 was 0.610, r2 was 0.988, the standard error of prediction was 0.131, and F (the Fisher indication of statistical significance) was 294.55 in tests with four components. With the 24 dipeptides, a q2L00 of 0.609 was obtained with the single optimal principal component. The correlation coefficient obtained by the nonvalidation analysis was 0.988, and the q2 found was 0.610. It is widely accepted that a correlation with a q2 value greater than 0.5 to 0.6 is useful for the prediction of new biologically active molecules (2). The predicted and actual IC50s of the compounds in the test set are provided in Table 2.
Predicted and actual −Log IC50s of the compounds in the test set
DISCUSSION
CoMFA fields.In order to visualize the CoMFA results, the CoMFA contour maps were created by using the data from PLS analysis. These maps help to explain the steric and electrostatic features of the compounds included in our analysis. The electrostatic contour maps are shown as red and blue polyhedra in Fig. 4A, with the red indicating regions in which the electronegative groups in the ligands are associated with increased biological activities. These can be envisioned as electropositive groups within the active pocket of the receptor. In contrast, the blue contours can be visualized as the electronegative groups in the active pocket and indicate where positively charged groups in the ligand have improved biological activity and thus lower the IC50s. The steric contour maps (Fig. 4B) also illustrate the areas in which steric bulk on an inhibitor is favored (green contours). The yellow polyhedra represent regions in which steric bulk is detrimental to binding and hence increases the IC50s by decreasing the overall biological activity. The green polyhedra may be envisioned as hydrophobic cavities in the receptor, which can accommodate hydrophobic groups. In contrast, the yellow polyhedra represent areas already occupied by the receptor, which would thus prohibit any effective binding.
(A and B) electrostatic and steric maps derived by CoMFA overlaid on the H. pylori urease active pocket (acetohydroxamic acid is shown in orange). (A) Blue contours indicate where electropositive groups improve activity, and red contours show where electron-rich groups increase activity. (B) Yellow contours show the sterically disfavored areas of the pocket, and green contours are areas where steric bulk is predicted to have a favorable impact on the biological activity.
After the development of this enzyme-acetohydroxamic acid complex of the H. pylori urease active pocket, we compared the maps prepared by use of the coefficients from PLS analysis of data obtained by CoMFA with the geometric and chemical properties of the binding site. Since CoMFA studies are based solely on a set of active compounds, these maps cannot provide a picture of the binding pocket. Nevertheless, CoMFA does reflect the regions in space where differences in the ligand-probe interaction energy can be correlated with the variance associated with biological activity. Therefore, CoMFA data can be extremely useful in the design of new chemical moieties based on a prediction of their biological activities (2). Figure 4 illustrates the contour maps prepared by use of the coefficients from PLS analysis, in which the contour maps have been overlapped on the active pocket of the H. pylori urease. Figures 4A and B indicate that there exists considerable agreement between the contour maps and the observed positions of the amino acids. Interestingly, a red negative field was observed near the nickel atoms; this is an indication that only electronegative groups that can interact favorably with the positive charge on the nickel ions will be capable of effectively binding to the active pocket. The presence of a large positive field occupying the Asp223, Glu222, and His221 residues, as indicated by a blue contour, demonstrates that a requirement for a positive charge is most consistent with the increased biological activities. We believe that this explains why the NH2 is crucial for biological activity and why either blockage or removal of this group would result in compounds with increased IC50s. The presence of Asp223 in the active pocket also supports our decision to model the NH2 group in its ionized state.
In Fig. 4B, the presence of yellow polyhedra surrounding the NH2 group indicates an area of the pocket which is sterically hindered and which most likely prohibits any effective binding where large groups are present on the ligand. The green contour covering the hydrophobic pocket is created by amino acid residues Met366, Met317, and Ala365; and the presence of these residues helps explain why ligands with polar groups that fit this position correlate with improved biological activities. Indeed, the most active ligands possess hydrophobic substituents that interact in this position. A similar interpretation was obtained from previous QSAR studies with hydroxamic acid inhibitors of urease (21).
Our model, along with previously reported studies of urease inhibitors, indicates that only the inclusion of electronegative moieties like hydroxamic acids, thiols, and phosphoramides that can efficiently complex with nickel ions likely results in biologically active compounds (25). In addition, in illustrating the requirement for an ionizable NH2 group, our model provides essential information for the design of novel H. pylori urease inhibitors. Finally, our model identifies a hydrophobic region whose identification should provide a significant advantage in the design of inhibitors.
Overall, we have developed a robust three-dimensional QSAR model that not only explains the variance in biological activities of a set of specific dipeptide hydroxamic inhibitors of H. pylori urease but also correlates with the homology model of this urease. In addition, we have incorporated active site-based information in our CoMFA studies to aid in the prediction and design of structural properties for novel hydroxamic acid molecules. We are synthesizing several hydroxamic acid derivatives to be evaluated against H. pylori urease in vitro and to help further refine our model.
ACKNOWLEDGMENTS
A.L.P. and J.S.W. are grateful to the Chemical Computing Group for the MOE software package.
Financial support for this project was provided in part by grants from the Centers for Disease Control and Prevention (grants CDC U50-CCU418839 and CDC UR3-CCU418652).
FOOTNOTES
- Received 3 July 2001.
- Returned for modification 1 March 2002.
- Accepted 1 May 2002.
- Copyright © 2002 American Society for Microbiology