Previous Article | Next Article ![]()
Antimicrobial Agents and Chemotherapy, December 2002, p. 3892-3899, Vol. 46, No. 12
0066-4804/02/$04.00+0 DOI: 10.1128/AAC.46.12.3892-3899.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Fermalogic, Inc., Chicago, Illinois 60612,1 Biochemie, GmbH, R&D Microbiology, Kundl, Austria2
Received 20 May 2002/ Returned for modification 18 July 2002/ Accepted 15 August 2002
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Erythromycin is a clinically important macrolide antibiotic; it is also the parent molecule for several commercially successful semisynthetic macrolide and ketolide derivatives (6). As such, it would be of both great biological and commercial interest to understand the regulatory mechanisms operating in Saccharopolyspora erythraea that control the level of production of this important antibiotic.
The erythromycin gene cluster was cloned prior to the advent of automated DNA sequencing; consequently, it was analyzed in fragments by several different laboratories. This could explain how the cluster had come to be referred to as "entirely known" (8, 9) when, in fact, significant information regarding a 6-kb region immediately downstream of eryCI was lacking. This oversight persisted, despite reports in the earlier literature that suggested that this region contained a sought-after pathway-specific regulatory gene(s) (12, 30). Furthermore, this region might have been predicted to contain additional early-stage deoxysugar biosynthesis genes, for example, a dTDP-glucose synthase or a dTDP-glucose dehydratase, that had been found in the tylosin biosynthetic pathway gene cluster but not yet in the erythromycin gene cluster (19, 28).
In this study, then, we sought to determine the probable identities of the genes contained in this region contiguous with the erythromycin gene cluster and whether they are required for erythromycin biosynthesis. In so doing we completed the analysis of the erythromycin gene cluster, a project that had its beginnings 20 years ago with the cloning of the erythromycin resistance gene (29).
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
Plasmids constructed for DNA sequencing. The eryCI-flanking region was originally cloned in plasmid pMW3 during an earlier study (32). A 6.7-kb HindIII-BglII fragment from pMW3 was subcloned into BamHI-HindIII-digested pUC19 to make pFL2000.
The remaining portion of the eryCI-flanking region was cloned from pMW3 by PCR to produce a fragment covering the region from the unique BglII site at the 3' end of open reading frame (ORF) ORFL2 (Fig. 1, site 11) to the 3' end of ORFL4. The reaction was performed according to the protocol of the manufacturer with the Expand Long Template PCR kit (Roche Molecular Biochemicals, Indianapolis, Ind.) by using kit buffer 3. The amplification reaction involved a heating step at 98°C for 4 min, followed by 30 cycles at 98°C for 30 s, 55°C for 1 min, and 68°C for 10 min. The PCR fragment was cloned into pCRtopo (Invitrogen, Carlsbad, Calif.) to form pFB46.
|
Both strands of the inserts from plasmids pFL2000 and pFB46 were sequenced by Microsynth GmbH (Balgach, Switzerland). The insert from plasmid pFL2070 was sequenced at ACGT, Inc. (Northbrook, Ill.).
Plasmids for deletion of eryCI-flanking region.
Plasmid pFL8, which facilitates integrative transformation into S. erythraea and autonomous replication in Escherichia coli and Streptomyces lividans (which allows screening for clones in E. coli according to the blue or white color obtained with 5-bromo-4-chloro-3-indolyl-ß-D-galactopyranoside), was constructed through the ligation of MseI-digested pIJ487 (31) to NdeI-digested pBS+ (Stratagene, Inc., La Jolla, Calif.). MseI and NdeI have compatible overlapping ends. Two smaller MseI fragments were eliminated from pIJ487 during this step, with a portion of the unneeded fd terminator and a portion of the unneeded melC2 gene removed. The ligation mixture was transformed into E. coli DH5
-e, selecting for ampicillin resistance (Apr), and plasmid DNA was prepared from a collection of primary transformants. This mixture of plasmid DNA was transformed into S. lividans TK21, selecting for thiostrepton resistance (Thr). DNA from the Thr Streptomyces lividans transformants was transformed back into E. coli, and the resulting plasmid was designated pFL6. To create pFL8, plasmid pFL6 was digested with BglII and then religated; this removed the polylinker that was originally carried on pIJ487.
Plasmid pFL2067 was constructed to delete a 6.0-kb portion of the eryCI-flanking region from the chromosome. pFL2063 was first constructed from one 1.8-kb EcoRI-BglII DNA fragment (clone 1; Fig. 2A and C) carrying ORFL4 and part of ORFL3 generated by PCR with primer pair K16 (5'-CGGAATTCCACCGAGTAGTCGTTGGC-3' [the EcoRI site is indicated in boldface]) and K17 (5'-GAAGATCTGCCACGATGAAGTACGT-3' [the BglII site is indicated in boldface]). Another HindIII-BglII DNA fragment (clone 2; Fig. 2A and C) carried DNA from the entire eryCI gene and a part of ermE and was generated by PCR with primer pair K18 (5'-GTAGATCTCCGCGGGACCGTATCGA-3' [the BglII site is indicated in boldface]) and K19 (5'-GAAAGCTTCTCCCGCAACGACTTCG-3' [the HindIII site is indicated in boldface]) (Fig. 2).
|
After the amplification steps, the PCR products were run out on a 1% agarose gel and purified with the Geneclean II kit (Bio 101). The purified DNA was digested with EcoRI-BglII (from primer pair K16-K17) and HindIII-BglII (from primer pair K18-K19) in double digestions. After another purification step, the two PCR products were ligated in a three-component ligation reaction to EcoRI-HindIII-digested pFL8
G (pFL8 with the unique BglII site removed) to form pFL2063. In a subsequent step, the 1.2-kb kanamycin resistance (Kmr) gene from pUC4K (Amersham Pharmacia Biotech, Piscataway, N.J.) was ligated as a BamHI fragment into the BglII site between the two cloned PCR fragments in pFL2063. The ligation mixture was transformed into E. coli DH5
-e, and the cells were spread on Luria-Bertani plates containing ampicillin (100 µg/ml) and kanamycin (40 µg/ml). Kmr and Apr colonies were screened for the presence of the cloned PCR products by ENase digestion. The plasmids were digested with EcoRI, BglII, and HindIII in single and double digestions to verify the presence of the correct inserts. Several recombinant plasmids were obtained and were designated pFL2067 (Fig. 2A).
One-step integration and eviction of pFL2067. Plasmid pFL2067 was chromosomally integrated into S. erythraea by standard procedures (34), except that primary transformants were selected by using kanamycin (10 µg/ml) instead of thiostrepton. Primary transformants appearing on R2T2 plates with kanamycin were patched onto a series of three E20A plates containing thiostrepton at 15 µg/ml, kanamycin at 10 µg/ml, and no added antibiotic, respectively. Of 96 colonies tested, all 96 grew on the kanamycin-supplemented plates and the plates containing no drug. Ninety-one colonies grew on the thiostrepton-supplemented plates. The five putative Kmr thiostrepton-susceptible (Ths) strains with gene replacements were designated FB50a to FB50e.
Southern blotting verification of deletion in FB50 strains. Chromosomal DNA from strains FB50a to FB50e was prepared (13) and digested with NotI. The Southern blotting procedure (Roche Molecular Biochemicals) was performed, with the prehybridization and hybridization steps carried out at 60°C (Fig. 2). Hybridizations were performed in the presence of an eryCI probe (for the size and the location of the probe, see Fig. 2C). The 4.4-kb hybridizing band, band A, which corresponds to the native eryCI-flanking region and which extends into the ermE gene, appears only in the parent strain, BC1. In FB50 deletion candidates, band A is changed to a 5.1-kb band, band B.
PCR verification of deletion in FB50 strains. The approach described above produces a unique PCR fragment that would be created only if the Kmr gene was substituted for the deleted region. PCR primer pair K68-K69 (sequences below) was designed to amplify a 1,656-bp fragment that comprised the 1,268-bp Kmr gene and 388 bp of DNA located on both sides of the resistance gene (Fig. 2). Primer K68 (5'-GGAATTCTGG CTCAACTGCGCAGACAA-3' [where the boldface indicates EcoRI]) anneals in the eryCI-flanking region 175 bp upstream of the Kmr gene. Primer K69 (5'-CAGAGCTCTTGATCTGCGTTCGGCTC-3' [where the boldface indicates SacI]) anneals in the eryCI gene 213 bp downstream of the Kmr gene. Both primer sequences are present in the strain BC1 chromosome, but no amplified product is expected in the parent strain because the 2-min extension reaction used would not amplify a PCR fragment that large. The estimated size of each of the fragments in all three lanes is 1,692 bp (extrapolated from a standard curve of the 1-kb ladder), which is very close to the known size of 1,656 bp. No amplified product of any size is observed in the lane containing strain BC1 DNA (lane 4). The sizes and the presence of the amplification products in lanes 1 to 3 confirm that the deletion has occurred in strains FB50a to FB50c. PCR conditions were as follows: total chromosomal DNA was isolated from three deletion candidate strains (strains FB50a to FB50c) and BC1 as described previously. The final Mg2+ concentration was 1.5 mM. All other reaction components were added as described above. Five units of Taq polymerase (MBI Fermentas, Vilnius, Lithuania) was used in all reactions. The cycling program consisted of a heating step without Taq polymerase for 5 min at 95°C, followed by 25 cycles of 1 min at 95°C, 1 min at 60°C, and 2 min at 72°C. A final extension cycle at 72°C for 7 min was performed to fill in protruding ends.
Shake-flask fermentation for production of erythromycin. Fermentations were performed on a Lab-line model 3525 incubator-shaker. Working stock spore suspensions were maintained at -80°C in 20% glycerol. Samples (25 µl) of spores from working stocks were transferred aseptically to capped test tubes (16 by 125 mm) containing 4 ml of sterile SCM broth (18). Test tube (seed) cultures were grown for 2 days at 32°C, with shaking at a 10° angle at 375 rpm (calibrated with a strobe light) on a 1-in. rotary displacement. Soluble complete medium (SCM) cultures (25 ml) containing 1.3 ml of soybean oil were started from 2.5 ml of the seed cultures and were allowed to grow for 5 days at 32°C and 375 rpm with a 1-in. rotary displacement. After a 5-day fermentation, the flasks were reweighed, and their weights were adjusted to the original weights by the addition of water to compensate for evaporation. The cultures were streaked onto agar plates to check for contamination.
Bioassay for erythromycin. A double-agar layer system on a large plate (245-mm2 bioassay dish; catalog no. 431111; Corning Costar, Cambridge, Mass.) was used for the bioassay for erythromycin. The bottom agar layer consisted of 100 ml of tryptic soy broth (TSB) agar. Once the agar solidified (after it was allowed to sit for 30 min to 1 h at room temperature), a top agar layer was poured. Top agar consisted of 100 ml of TSB agar containing 250 µl of 1% triphenyl-tetrazolium red and a sufficient quantity of Thr Bacillus subtilis spores to produce a confluent lawn of growth. The upper layer was allowed to solidify at room temperature for 1 h. Broth samples (15 µl) from red variant strains FL1347 and FL2067 were spotted directly onto 1/4-in. bioassay discs (Schleicher & Schuell, Keene, N.H.) and allowed to dry for 30 min. Broth samples from strains producing larger amounts (strains BC1 and FB50) were first diluted 1:10 before being bioassayed to keep the erythromycin concentrations within the values on the standard curves. Stock erythromycin A solution (100 mg/ml) was prepared in 95% ethanol. Standard erythromycin A solutions to be used for the bioassay were prepared at 25, 50, 100, and 250 µg/ml in Tris-EDTA (pH 8.0) buffer. The bioassay plate was incubated overnight at 37°C. After incubation, the diameters of the inhibition zones were measured and converted to concentrations by using the standard curve produced for each plate.
Thin-layer chromatography. Thin-layer chromatography was performed by a previously described method (34), except that only a 1-ml sample of broth, with the appropriate scaling down of the amounts of the reagents used, was assayed.
HPLC. For determination of erythromycin A and by-products, a method based on high-performance liquid chromatography (HPLC)-mass spectrometry (MS) was used. Measurements were carried out on an Agilent Technologies 1100 series LC/MSD system that comprised a capillary pump, a micro-vacuum degasser, an autosampler, a thermostatted column compartment, and an MSD detector. The LC-MSD system was used with an electrospray ionization source. Complete system control and data evaluation were done on an Agilent Technologies ChemStation for LC-MS. The HPLC separation was carried out on an Asahipak ODP-50, 5-µm column (125 by 2.0 mm; Agilent Technologies) with an ammonium formate buffer-acetonitrile mobile phase mixture. Electrospray ionization-MS in single-ion monitoring mode was used for detection.
For calibration, an erythromycin A solution of known concentration was used.
Database searches and sequence alignments. The DNA sequence of the eryCI-flanking region was subjected to searches by use of the BLASTN, BLASTP, and BLASTX programs (1). Searches based on conserved protein family motifs were also performed.
Nucleotide sequence accession number. The nucleotide sequence of the eryCI-flanking region has been submitted to the GenBank database and has been given accession number AF487998.
| RESULTS |
|---|
|
|
|---|
DNA sequence analysis of eryCI-flanking region. The eryCI-flanking region (Table 2) begins with a 3,037-bp region next to eryCI for which the BLASTX and BLASTN programs found no significant sequence similarities. FramePlot analysis (3, 17) of this region does not depict clearly defined ORFs, in contrast to the region to the right, within the ery cluster, where ORFs are densely packed and can be clearly visualized (Fig. 1). One potentially significant hairpin-loop structure (12-bp stem, two mismatches) was found between nt 7670 and 7703, and a 12-bp perfect direct repeat was between nt 7789 and 7825; both of these were close to the 3' end of the eryCI gene.
|
The GNAT family also includes the histone acetyltransferases of eukaryotes. These enzymes catalyze the acetylation of lysine residues in histones, leading to transcriptional activation; however, the analysis with the BLASTP program did not identify this specific type of protein as homologous to ORFL1. Considering the close proximity of ORFL1 to the erythromycin gene cluster, it might have been thought to be a tailoring enzyme, but to the best of our knowledge, no acetylated tailored products of erythromycin have been reported, and our results show that ORFL1 plays no role in the biosynthesis of erythromycin A. ORFL1 is followed by a gap of 169 bp of noncoding DNA until an att site is reached at nt 4470 to 4503, and then another 881 bp separates the att site from the start codon for ORFL2.
ORFL2, a precursor protein of 148 aa residues with a molecular weight of 15,869 and a pI of 9.48, showed significant sequence similarity to subtilisin inhibitor-like (SSIL) precursor proteins (proteinaceous inhibitors of proteolytic enzymes) from a wide variety of actinomycetes. The predicted mature protein of ORFL2 would have 112 aa, a molecular weight of 11,841, and a pI of 6.77. The strongest match for ORFL2 (40% identity over 242 aa) was to another SSIL protein previously identified in S. erythraea by Pereda et al. (24) as ORF4 (referred to in this report as ORFR4) in the eryK-flanking region of the ery gene cluster. Analysis with the BLAST program revealed that ORFL2 has significant homologies to many other SSIL precursor proteins from other actinomycetes. SSIL precursor proteins are widely distributed in nature; however, little is known about their biological role or significance. One SSIL protein-coding sequence from Streptomyces venezuelae has been used commercially to provide the signal sequence for the large-scale production of secreted tumor necrosis factor alpha (25).
ORFL2 (precursor form) is the reverse complement of nt 3144 to 3587. It begins with an ATG start codon and is followed by a 36-aa signal peptide, as determined by SignalP computer analysis (21). SignalP analysis of ORFR4, the subtilisin-like inhibitor from the study of Pereda et al. (24), shows that it also has a classic signal sequence. No ATG or GTG start codon was found close to the ORFR4 signal sequence, indicating that the gene is either not translated or not translated by the usual mechanisms. We also determined the nucleotide sequence for the 5' flanking region of ORFR4 and found it to be in agreement with the results of Pereda et al. (24). Our interpretation of ORFR4, if it is translated, indicates that an additional 6 aa would exist on the 5' end of the mature protein described by Pereda et al. (24), which would then be preceded by a 37-aa signal peptide. Another similarity between these two genes is the stem-loop structures following both of these ORFs. The stem-loop structure (nt 2543 to 2573) is farther away from the end of ORFL2 than it is from the end of ORFR4. A large gap of 673 bp follows the end of ORFL2 before the gap reaches the 3' end of ORFL3.
The predicted amino acid translation for ORFL3 contains 271 aa and has a molecular weight of 29,247 and a pI of 7.19. From the analysis with the BLASTP program, the sequences with the two highest degrees of homology to ORFL3 (nt 1654 to 2466) were for uncharacterized proteins: the putative S-adenosyl-L-methionine methyltransferase from Sinorhizobium meliloti, (38% identity over 231 aa; the protein with GenBank accession no. CAC49637) and the probable O-methyltransferase from Mesorhizobium loti (34% identity over 199 aa; the protein with GenBank accession no. q98g94). The analysis with the BLASTP program showed a domain in the predicted amino acid sequence of ORFL3 that is common to the UbiE and COQ5 family C-methyltransferases and a domain common to the PCMT family of O-methyltransferases.
O-Methyltransferases are known to play a role in antibiotic biosynthesis, either through the methylation of deoxysugars or through the polyketide portion of the structure (see reference 22 and references therein). One of the high-scoring ORFL3 homologs found by analysis with the BLASTP program was to a C-5-O-methyltransferase, the product of the aveD gene of the avermectin biosynthetic pathway in Streptomyces avermitilis (38% identity over 283 aa; GenBank accession no. JC6531 and PIR accession no. JC6531) (15). The protein encoded by aveD, in turn, shows significant homology to that encoded by eryG, the O-methyltransferase in the ery gene cluster (23, 33). Only one methyltransferase, the product of the eryG gene, has been found to be required for erythromycin A biosynthesis. Although ORFL3 is close to the ery gene cluster, it is not required for erythromycin A biosynthesis (see "Mutational analysis of eryCI-flanking region" below).
The deduced product of the ORFL4 gene, a protein of 293 aa residues with a molecular weight of 33,026 and the reverse complement of nt 92 to 970, shows a very high degree of sequence similarity to a probable transposase for IS1533 from Mycobacterium tuberculosis (57% identity over 264 aa; PIR accession no. D70668). ORFL4 has a homolog, ORFR6, in the eryK-flanking region. Interestingly, both transposases are located one ORF downstream of a subtilisin-like inhibitor and are transcribed in the same direction. The sequences of the ORFL4 and ORFR6 transposases, however, do not share a significant degree of homology to one another, and only the sequence of ORFR6 shows similarity to the transposase in the region between eryAI and eryAII (5).
Mutational analysis of eryCI-flanking region. Plasmid pFL2067 (Fig. 2) was constructed so that a 6.0-kb portion of the eryCI-flanking region encompassing the area between -1.5 and -7.5 kb (between nt 1925 and 7888) could be deleted. The deletion removed the 3' half of ORFL3, all of ORFL2 and ORFL1, and the 3-kb noncoding region next to eryCI (Fig. 1; see the grey bar labeled 6.0-kb deletion). Deletion strains FB50 and FL2067, for which the deletions were confirmed by PCR and Southern blot analysis (Fig. 2), were grown in shake flasks for erythromycin production (see Materials and Methods). The results of the agar plate bioassay (see Materials and Methods) revealed that the ability of the strains to make antibiotic was not noticeably affected, indicating that ORFs 1, 2, and 3 are not required for erythromycin production. Further analysis of fermentation broth extracts by thin-layer chromatography and HPLC showed that the bioactive material produced by the deletion mutants migrated and eluted identically to erythromycin A (data not shown).
In this study we also determined that ORFL4, the presumptive transposon homolog, would have been deleted in an earlier study (deletion +457; see Fig. 1 in reference 34). Since the +457 deletion was reported to have no effect on erythromycin production, we can now conclude that ORFL4 is also not required for erythromycin production.
| DISCUSSION |
|---|
|
|
|---|
The only information available regarding the eryCI-flanking region was presented in two reports that mapped erythromycin biosynthetic genes to this region by complementation analysis (30) and suggested the presence of pathway-specific regulatory genes through plasmid insertion experiments (12, 30). Since publication of those reports, no mutational analyses concerning this region have been reported and no DNA sequence information had been published or deposited.
The earlier results were not further developed, although any gene in this region that played a regulatory or structural role in erythromycin biosynthesis would have been considered part of the cluster. Also, there was significant potential commercial value in knowledge of this region. The potential was heightened because two genes predicted to play a role in the early steps of erythromycin deoxysugar biosynthesis were still unidentified (27, 28) and because no pathway-specific regulatory genes had yet been found for Em.
Ironically, we found that the erythromycin gene cluster does in fact end with eryCI and that there are no additional erythromycin biosynthetic or regulatory genes in the 8.1-kb eryCI-flanking region. Hence, references to eryCI as the boundary gene stand without need for correction. In this study it was shown that ORFs in this region code for enzymes that are not known to be required for erythromycin biosynthesis or resistance. Furthermore, deletion analysis showed that this region is not required for production of erythromycin.
Interestingly, pathway-specific regulatory genes have recently been reported for two other macrolide antibiotics; pikromycin has one regulator (35), and tylosin has five regulators (2). In each case it was reported that these regulatory genes are contiguous with or contained within their respective clusters. Therefore, the results reported here show that there are further differences between macrolide antibiotic gene clusters with respect to the number of regulatory genes that they contain.
In an earlier study dealing with this region, the eryD24 locus, which defined a putative regulatory gene controlling deoxysugar biosynthesis, was mapped by complementation analysis (30) immediately downstream of eryCI (Fig. 1). The evidence presented in this report shows clearly that a deoxysugar biosynthesis or regulatory gene is not located in or near this area.
Two other previous results are relevant to this report. They concern reports of significant strain improvement effects caused by plasmid insertions in this area of the chromosome (12, 30). When the first plasmid, plasmid pWHM14 (30), is inserted into the eryCI-flanking region, it was reported to cause significant overproduction of erythromycin (Fig. 1). The 1-kb cloned fragment (the Vara fragment) responsible for the insertion overlapped portions of ORFL1 and ORFL2 and the region between them. The second plasmid, plasmid pWHM55, contained a 5.3-kb fragment (the Hanel fragment) that was also reported to cause a significant increase in the level of erythromycin production. The Hanel fragment overlapped the complete eryCI gene, the 3-kb noncoding region next to eryCI, and all of ORFL1 (Fig. 1). The DNA sequence information reported here does not provide any further insights into the strain improvement effects reported earlier, for example, how the effects might be rationally explained or further exploited.
Some of the results from earlier studies that also dealt with the eryCI-flanking region were not further clarified by this study. For example, the eryB locus that Vara et al. (30) mapped downstream of eryCI was beyond the region analyzed in this study. If the mapping for the eryB locus done by Vara et al. (30) is correct, it will represent a rare case in which an essential antibiotic biosynthesis gene falls well outside the bounds of the antibiotic biosynthesis gene cluster. Additional DNA sequence analysis and mutational analysis (up to -20 kb) will be needed to answer this question.
Finally, the eryA34 mutation, originally genetically mapped downstream of eryCI (32), was not found within the 8.1-kb region analyzed in this study. It could conceivably fall within the uncharacterized region farther to the left, up to -22 kb, but since the only known structural eryA locus is now known to be on the other (right) side of ermE, this earlier conclusion, based on less definitive genetic mapping data, is likely to be incorrect.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Clin. Vaccine Immunol. | Clin. Microbiol. Rev. |
|---|---|
| J. Clin. Microbiol. | ALL ASM JOURNALS |