Previous Article | Next Article ![]()
Antimicrobial Agents and Chemotherapy, August 2002, p. 2409-2419, Vol. 46, No. 8
0066-4804/02/$04.00+0 DOI: 10.1128/AAC.46.8.2409-2419.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
GlaxoSmithKline, Collegeville, Pennsylvania 19426-0989
Received 19 April 2001/ Returned for modification 27 January 2002/ Accepted 24 April 2002
|
|
|---|
|
|
|---|
|
|
|---|
It is also possible to assign a two- or three-digit hyphenated code based on the number of nonsusceptible results (1s) in the binary string. The first digit in the hyphenated number represents the total number of nonsusceptible results seen in the string. Each unique string with the same number of nonsusceptible results would be given a new second digit, and the process would be continued until each antibiotype has a unique number designation. For the present analysis a single breakpoint based on National Committee for Clinical Laboratory Standards (NCCLS) interpretations for susceptibility was used for each organism-drug combination. For a more sophisticated analysis, the intermediate category could be incorporated or an artificial breakpoint for the detection of very low level resistance could be used.
|
|
|---|
|
|
|---|
Diversity indices have been used in population biology and ecology as a comparative measure of genetic diversity across different populations. The haplotypic or nucleon diversity estimate (h) of Nei and Tajima (20) was initially developed to measure the heterozygosity of nuclear loci, but it has also been used to represent the diversity of clonal lineages, such as mitochondrial DNA genotypes (19). Since antibiotic susceptibility is genetically determined, h can also be used to estimate antibiotype diversity both within and between different countries and years. The low and high range of values for h are 0.0 and 1.0, respectively. Thus, instances in which all isolates in the population have the same antibiotype would have the lowest diversity (h = 0.0), while, conversely, if all isolates in a population had unique antibiotypes, the diversity would be the highest (h = 1.0). Since h is an aggregate estimate of diversity based on both the number of different antibiotypes and their frequencies of occurrence relative to those of other antibiotypes, h can be considered a comparative measure of the levels of multidrug resistance in a bacterial population.
In population genetics, relationships between populations are often constructed from samples of gene frequency and/or DNA haplotype frequency data, in which haplotype is equivalent to allele in classical genetics, except that it refers to any DNA segment (large or small), but not necessarily a gene proper. Antibiotic resistance is genetically determined, and thus, antibiotype (i.e., our scheme of 0s and 1s describing susceptibility and nonsusceptibility, respectively) can be considered analogous to a descriptor or at least representative of a haplotype. Thus, the frequencies of various haplotypes (antibiotypes) between populations (in different countries and years) can be used to assess relationships between the populations of species in the Alexander Project collection. From the frequency of each antibiotype for each country and year, a commonly used measure in population genetics known as genetic distance Neis (18) can be calculated. From a matrix of such genetic distances, for any particular species, phylogenies can be reconstructed depicting the relationships and relative changes between various countries and years for which surveillance data exist.
|
|
|---|
|
|
|---|
|
|
|---|
The 2000 NCCLS breakpoints were applied (17). Those antibiotics without NCCLS breakpoints were analyzed by using the NCCLS breakpoints for agents in the same class. It should be noted that these breakpoints were used for statistical purposes only. For this type of analysis, breakpoints can be assigned on the basis of various criteria, as long as the same criteria are used for the entire data set. For the analysis performed with S. pneumoniae, doxycycline and ciprofloxacin were analyzed by using the breakpoints for tetracycline and ofloxacin, respectively. For the analysis performed with H. influenzae, amoxicillin, erythromycin, and doxycycline were analyzed by using the breakpoints for ampicillin, azithromycin, and tetracycline, respectively. The percentage of isolates that were considered susceptible to all of the antimicrobials observed was determined for each year and country. In addition, strings of 0s and 1s were converted to five-digit antibiotype numbers. However, this approach was found to be too cumbersome, and most analyses were performed with a two- or three-digit number based on the frequency of nonsusceptible designations. The frequencies of unique antibiotypes were determined by year and country.
PCA. Distributions of MIC data are generally not symmetric. For modeling purposes, transformation of the MIC data to achieve a close-to-symmetric distribution is common. For example, MIC distributions are often summarized by using the geometric mean, which is based on log MIC, a close-to-symmetric transformation of the MIC data. Such transformations are essential for data modeling, mainly to make the models more efficient (reliable) and to remove an undue influence of high values (in this case, high MICs) on the model. For PCA, therefore, data were transformed to achieve a closer-to-symmetric distribution. A log transformation was used.
We examined the distribution of antimicrobial MICs for S. pneumoniae isolates in Spain from 1992 to 1997. PCA was carried out with the log MICs by using SIMCA software (version 8.0, 2000; Umetrics, AB, Umea, Sweden.). This software has no limitations in the allowable number of rows or columns of a data table and executes rapidly. The analyses conducted for the study described in this paper were run with SIMCA software in a few seconds. The mathematical complexities of PCA are beyond the scope of this paper; for introductory discussions, see the text of Morrison (16). Briefly, multidimensional data are projected into a lower-dimensional space, which allows a hidden structure to be revealed while at the same time conserving the variation in the data. By the methodology with SIMCA software, the principal components are summarized graphically by using score and loading plots. The coordinates of the observations in the lower-dimensional space are the scores. In order to interpret the score plot, it is necessary to know which variables are influential in the model and how they are correlated, and this information is provided by the loadings. Variables contributing similar information, for example, are correlated and are grouped together in the loading plot. Variables which are negatively correlated are located on opposite sides of the plot. The farther from the plot origin that a variable lies, the greater the impact that it has on the PCA model. By relating the score and loading plots, one is able to interpret the PCA model and gain understanding of the data set.
Population genetics.
Antibiotype diversity was quantified by using the h value of Nei and Tajima (20), where
![]() |
Evolution of antibiotic resistance. The principle of MP involves the identification of a topology that requires the smallest number of evolutionary changes to explain the observed differences among the entities under study. Parsimony can be conducted by a variety of methods. The simplest method and the one which imposes minimal constraints upon character state changes is known as the Wagner parsimony method (1, 7). This method allows free reversibility of characters. This means that the probability of a change from a 0 to a 1 is just as likely as the reverse. A consequence of this reversibility is that the length of the tree and the order of branching are independent of the position of the root. However, the assignment of the root (the ancestral branch) then provides an indication of the directionality of evolution for that particular data set. For the data in question, trees were rooted at the all-susceptible condition, i.e., the antibiotype represented by 0s for all antibiotics. This analysis therefore depicts the pattern of evolution of antibiotic resistance starting from an organism that is susceptible to all 15 antibiotics. We believe that this is reasonable, since the all-susceptible condition is the most frequent antibiotype for all countries and years, and thus, much of the evolution of multidrug resistance for a particular year and country must ultimately arise from this state. The data were divided into individual years and countries because it is not possible to analyze together data for all years for any particular country. This is because the number of possible antibiotypes over the period from 1992 to 1998 vastly exceeds the number of phylogenetically informative positions, which results in millions of equally most parsimonious trees and eliminates any meaningful interpretation. We argue that dividing the analysis into country and year is logical because it represents a snapshot in time that certainly must capture a significant portion of the development of multidrug resistance for that period. Our purpose here is to illustrate the possibilities of such an approach by analyzing a few examples; data for the collection of S. pneumoniae and H. influenzae isolates from the United Kingdom in 1998 were chosen to illustrate the approach. A more detailed report involving all countries and years will be presented elsewhere.
Wagner parsimony was conducted with the program PAUP* (version 4.0b4a) by using a heuristic search with stepwise random additions of the antibiotypes and branch swapping via nearest-neighbor interchanges. Majority-rule consensus trees were computed from the set of most parsimonious trees with the LE50 option in effect; this option retains groups on less than 50% of the trees as long as such groups are compatible with those already on the tree. Antibiotic characters were subsequently traced on the framework of the most parsimonious tree by using McClade software (15). The presence of phylogenetic signal in these matrices of 0s and 1s was assessed by relative apparent synapomorphy analysis (14) with RASA (version 2.2) software (J. Lyons-Weiler, RASA version 2.2 for the Mac [http://loco.biology.unr.edu/archives/rasa/rasa.html], 1998).
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Antibiotype variability for S. pneumoniae from 1992 to 1998a
|
|
View this table: [in a new window] |
TABLE 2. Antibiotype variability for H. influenzae from 1994 to 1998a
|
Variability peaked for the H. influenzae isolates obtained from the United Kingdom, Germany, Italy, France, and Spain in 1995 (Table 2). It is difficult to comment on the variability of the isolates in the United States because a large number of isolates that may have biased the results were tested in 1998. Except for the isolates obtained from Germany in 1998, variability among the H. influenzae isolates did correlate directly with the number of isolates tested, and no consistent trend toward an increase in a larger number of antibiotypes was observed.
One interesting observation that is not readily evident when surveillance data are presented in a traditional fashion (i.e., percent susceptibilities or MIC90s) is the number of isolates that are completely susceptible (all 0s) to all of the antimicrobials tested. When the data are analyzed by the conversion of the susceptibility pattern into a binary code, the percentage of isolates susceptible to all antimicrobials tested becomes readily evident. The percentages of S. pneumoniae and H. influenzae isolates that were susceptible to all drugs tested are shown in Table 3 and Table 4, respectively, by year and country. The data presented show that for the S. pneumoniae isolates (Table 3) there was a slight trend toward an increase in the percentage of isolates in France and Germany that were susceptible to all antimicrobials tested. For the isolates in Italy, the United Kingdom, and the United States, there was a trend toward a decrease in the percentage of isolates that were susceptible to all antimicrobials tested.
|
View this table: [in a new window] |
TABLE 3. Percentage of completely susceptible S. pneumoniae isolatesa
|
|
View this table: [in a new window] |
TABLE 4. Percentage of completely susceptible H. influenzae isolatesa
|
PCA. For Spain for the years from 1992 to 1997 there are 15 antimicrobials, thus representing 15 dimensions or columns in the data table and 1,447 observations (bacterial isolates). These 15 dimensions and 1,447 observations were represented by three principal components. These components describe 77% of the data; this is analogous to R2 in regression analysis and represents the percentage of the total variation of MICs explained by the PCA model. Therefore, the data from the 15 dimensions are well summarized and interpretable in only 3 dimensions.
The loadings plot for the first two principal components for S. pneumoniae in Spain from 1992 to 1997 is shown in Fig. 1. All ß-lactams are clustered in the upper right-hand corner; thus, their MICs are strongly correlated. The co-trimoxazole MICs are closely correlated with the ß-lactam MICs. The macrolide MICs are also highly correlated and are located in the lower part of the graph. The MICs of chloramphenicol and doxycycline are closely related to each other but are dissimilar from those of the other classes of antibiotics. The MICs of the quinolones are clearly unique and, being close to the origin, exert no influence on the first two components. From the loadings one can interpret that the first component separates low from high MICs and the second component separates isolates for which MICs are high by antibiotic class.
![]() View larger version (16K): [in a new window] |
FIG. 1. PCA loadings plot of log MICs for isolates from Spain from 1992 to 1997. p[1], component 1; p[2], component 2; cot, co-trimoxazole; chl, chloramphenicol; dox, doxycycline; the cluster at the upper right consists of ß-lactams; the cluster at the bottom consists of macrolides; the cluster at the upper left consists of the quinolones (ofl, ofloxacin; cip, ciprofloxacin).
|
![]() View larger version (22K): [in a new window] |
FIG. 2. PCA score plots of log MICs for isolates from Spain from 1992 to 1997 (a to f).
|
![]() View larger version (20K): [in a new window] |
FIG. 3. PCA score plots showing log MICs for major clusters of isolates from Spain from 1992 to 1997.
|
|
View this table: [in a new window] |
TABLE 5. MIC groupings over timea
|
The methodology described above can also be applied to the antibiotypes, and expectedly, the results are similar, with some notable exceptions. First, the loadings plot for the first two components illustrates that the clusters for the antibiotic groups are similar to those obtained by evaluation of log MICs, with the exception that the results for amoxicillin and amoxicillin-clavulanic acid do not correlate as strongly with those for the rest of the ß-lactam group (Fig. 4). Thus, at the antibiotype level, the results for amoxicillin and amoxicillin-clavulanic acid are distinct from those for the other ß-lactams. This distinction is due to increased susceptibility to both drugs.
![]() View larger version (15K): [in a new window] |
FIG. 4. PCA loadings plot for antibiotypes of isolates from Spain from 1992 to 1997. p[1], component 1; p[2], component 2; cot, co-trimoxazole; chl, chloramphenicol; ofl, ofloxacin; cip, ciprofloxacin; dox, doxycycline; amx, amoxicillin; aug, Augmentin (amoxicillin-clavulanic acid); axo, ceftriaxone; fur, cefuroxime; fac, cefaclor; pen, penicillin; fix, cefixime; the cluster at the bottom represents macrolides.
|
Population genetics. Estimates of haplotypic diversity (h) for S. pneumoniae ranged from 0.27 (United Kingdom, 1994) to 0.90 (France, 1995). The order of increasing diversity (mean ± standard deviation h) by country was as follows: United Kingdom (h = 0.49 ± 0.15), Germany (h = 0.50 ± 0.15), United States (h = 0.64 ± 0.13), Italy (h = 0.72 ± 0.14), France (h = 0.84 ± 0.06), and Spain (h = 0.86 ± 0.04). The order of increasing diversity (mean ± standard deviation h) by year is as follows: 1993 (h = 0.53 ± 0.21), 1992 (h = 0.64 ± 0.20), 1996 (0.65 ± 0.21), 1995 (h = 0.67 ± 0.22), 1997 (h = 0.71 ± 0.15), 1994 (h = 0.71 ± 0.16), and 1998 (h = 0.77 ± 0.18). Antibiotype diversity remained generally high among isolates from France, Italy, and Spain throughout the sampling period (Fig. 5A). A trend of annual increases in antibiotype diversity was clearly evident for isolates from the United Kingdom and the United States. Nearly all countries showed marked increases in antibiotype diversity in 1998.
![]() View larger version (85K): [in a new window] |
FIG. 5. Histograms of diversity estimates (h) for S. pneumoniae (A) and H. influenzae (B) antibiotypes. The method of Nei and Tajima (20) was used to calculate h, where 0 and 1 represent low and high levels of genetic diversity, respectively. For both species, no data were collected for Spain in 1998.
|
Population phylogenetic trees (Fig. 6 and 7) drawn from genetic distance values computed with the antibiotype frequency data resulted in certain clusters of years and countries. For example, in the S. pneumoniae data, Spain formed a monophyletic group composed of years 1992 to 1997 and was joined by a group composed of predominately French isolates of various years (Fig. 6). Isolates from Italy also formed a monophyletic group to the marked exclusion of isolates from all other countries. The other principal grouping in this S. pneumoniae population tree was a mixed clade of isolates from Germany, the United Kingdom, and the United States. Branch lengths for isolates from France, Spain, and Italy were much longer than those for isolates from the other countries, suggesting increased rates of antibiotic resistance evolution in those countries. The respective monophyletic clusters of the isolates from the various years from Spain and Italy suggest the possibility that S. pneumoniae isolates in these two countries comprise genetic races distinct from those in the rest of the world.
![]() View larger version (21K): [in a new window] |
FIG. 6. Fitch-Margoliash phylogenetic tree determined from the S. pneumoniae susceptibility data; branch lengths are drawn proportional to the amount of genetic change.
|
![]() View larger version (15K): [in a new window] |
FIG. 7. Fitch-Margoliash phylogenetic tree determined from the H. influenzae susceptibility data; branch lengths are drawn proportional to the amount of genetic change.
|
Evolution of antibiotic resistance. An MP analysis of the antibiotype data for S. pneumoniae isolates from the United Kingdom from 1998 resulted in 11,632 MP trees each with a length of 39, with a consistency index (CI) of 0.3846 and a retention index (RI) of 0.7551. A majority consensus tree of the set of MP trees (Fig. 8) resulted in several clades that appeared in the entire set of MP trees and several further groupings that appeared in at least 70% of the set of MP trees. This obvious structure to the data concomitant with the moderately high and very high values for CI and RI, respectively (7), indicates that the data provide phylogenetic signal. Additionally, more statistics-based measures, such as relative apparent synapomorphy analysis (14) and g1 (skewness) statistics (10), also support the presence of phylogenetic signal in these data (tRASA = 1.951 [P < 0.05]; g1 = -0.406671 [P < 0.01]). Furthermore, the score of the MP tree (score, 39) is much less than the mean score of a set of 100,000 random trees (score, 94.7), and there is a marked skewness to the distributions in tree lengths obtained with these data (the basis of the g1 statistic), which in turn is widely regarded as an indicator of a phylogenetic signal (5, 8, 10). This view is based primarily on the fact that such skewed distributions of tree length allow a much better discrimination among the near-optimal solutions.
![]() View larger version (50K): [in a new window] |
FIG. 8. Majority consensus of the set of most parsimonious trees determined from analysis of S. pneumoniae isolates from the United Kingdom from 1998. White boxes, susceptibility to the indicated antibiotic; black boxes, resistance to the indicated antibiotic. The numbers on the branches adjacent to the dashed markers refer to the origin or loss of characters, where the characters are the 15 antibiotics. A minus sign indicates the loss of a character, and a lack of a minus sign indicates the origin of that character. Underlined numbers in boldface type refer to the percentage of times that a particular node appears in the set of most parsimonious trees. Augmentin, amoxicillin-clavulanic acid.
|
MP analysis of H. influenzae isolates from the same country and the same year (United Kingdom, 1998) (Fig. 9) yielded three MP trees each with a length of 16, with a CI of 0.5625 and an RI of 0.6667. The small number of MP trees and the relatively high values for CI and RI support the presence of phylogenetic signal in these data. Relative apparent synapomorphy analysis and g1 statistics also support the presence of phylogenetic signal in this data set (tRASA= 4.34 [P < 0.0001]; g1 = -0.306106 [P < 0.01]). The mean score of a set of 100,000 random trees constructed from these data is 25.5 (which differs from the value for the MP tree by 9 steps), and the distribution of tree lengths is markedly skewed, providing yet further support for the presence of phylogenetic signal. In this case the first type of antibiotic resistance to evolve from the condition of susceptibility to all antimicrobials is resistance to co-trimoxazole. Other trends evident in the resulting tree include the evolution of amoxicillin resistance before ampicillin resistance, and the two of these were preceded in turn by the development of resistance to erythromycin.
![]() View larger version (42K): [in a new window] |
FIG. 9. Majority consensus of the set of most parsimonious trees determined from analysis of H. influenzae isolates from the United Kingdom from 1998. See the legend to Fig. 8 for an explanation of the various tree descriptors. Augmentin, amoxicillin-clavulanic acid.
|
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»