|
|
||||||||
Bioinformatics Laboratory, International Institute of Molecular and Cell Biology, ul. ks. Trojdena 4, 02-109 Warsaw, Poland
Reprint requests to: Janusz M. Bujnicki, Bioinformatics Laboratory, International Institute of Molecular and Cell Biology, ul. ks. Trojdena 4, 02109 Warsaw, Poland; e-mail: iamb{at}bioinfo.pl; fax: 48-22-668-5288.
(RECEIVED September 6, 2000; FINAL REVISION November 27, 2000; ACCEPTED December 7, 2000)
Article and publication are at www.proteinscience.org/cgi/doi/10.1110/ps.37101.
| Abstract |
|---|
|
|
|---|
/ß domains, the N-terminal domain (NTD) and the C-terminal domain (CTD) containing the RNase A-like active site. Comparison of the EndA coordinates with the publicly available protein structure database revealed the similarity of both domains to site-specific deoxyribonucleases: the NTD to the LAGLIDADG family and the CTD to the PD-(D/E)XK family. Superposition of the NTD on the catalytic domain of LAGLIDADG homing endonucleases allowed a suggestion to be made about which amino acid residues of the tRNA splicing nuclease might participate in formation of a presumptive cryptic deoxyribonuclease active site. On the other hand, the CTD and PD-(D/E)XK endonucleases, represented by restriction enzymes and a phage
exonuclease, were shown to share extensive similarities of the structural framework, to which entirely different active sites might be attached in two alternative locations. These findings suggest that EndA evolved from a fusion protein with at least two distinct endonuclease activities: the ribonuclease, which made it an essential "antitoxin" for the cells whose RNA genes were interrupted by introns, and the deoxyribonuclease, which provided the means for homing-like mobility. The residues of the noncatalytic CTDs from the positions corresponding to the catalytic side chains in PD-(D/E)XK deoxyribonucleases map to the surface at the opposite side to the tRNA binding site, for which no function has been implicated. Many restriction enzymes from the PD-(D/E)XK superfamily might have the potential to maintain an additional active or binding site at the face opposite the deoxyribonuclease active site, a property that can be utilized in protein engineering. Keywords: Protein evolution; endonuclease; homing; intron splicing; restriction-modification
| Introduction |
|---|
|
|
|---|
The tRNA splicing endonuclease cleaves the pre-tRNA substrate, leaving 5'-hydroxyl and 2',3'-cyclic phosphate termini. Then the exons are joined by a nucleoside triphosphate-dependent RNA ligase. Finally, the splice junction 2'-phosphate is removed by an NAD-dependent phosphotransferase (Abelson et al. 1998).
The crystal structure of the tRNA splicing endoribonuclease (RNase) EndA from Methanococcus jannaschii was recently determined (Li et al. 1998). The EndA monomer consists of two distinct
/ß domains: the N-terminal domain (NTD) takes the shape of a saddle formed by a five-stranded mixed ß-sheet flanked on one side by three
-helices. The C-terminal domain (CTD) folds into a topologically different five-stranded mixed ß-sheet cradled between two
-helices. Two pairs of subunits in the EndA tetramer are nonequivalent, suggesting that only two of the four proposed active sites are functional. Three amino acid residues, corresponding to Y115, H125, and K156 in M. jannaschii enzyme, are conserved in catalytically active EndA/Sen family members (Trotta et al. 1997). The corresponding side chains can be spatially superimposed with the catalytic triad of RNase A, an enzyme that is unrelated to EndA, although it seems to carry out a mechanistically similar reaction, arguing for a case of convergent evolution (Li et al. 1998).
LAGLIDADG and PD-(D/E)XK are mutually unrelated families of deoxyribonucleases (DNases), of which biological functions have been regarded as overlapping to only a limited degree, and which show remarkable dissimilarities of function at the molecular level (Belfort and Roberts 1997; Aggarwal and Wah 1998). The LAGLIDADG family comprises relatively closely related proteins, of which some are composed of two tandemly fused versions of the conserved domain (Dalgaard et al. 1997). In contrast, the PD-(D/E)XK superfamily groups together extremely diverged enzymes and exhibit strong variations in sequences, structures, and biological functions (Bujnicki 2000).
Traditionally, the LAGLIDADG family has been grouped together with functionally similar, although evolutionarily unrelated HNH/His-Cys (vel ßß
Me) and GIYYIG families, and juxtaposed against the largest and best characterized functional division of the PD-(D/E)XK superfamily, namely type II restriction endonucleases (ENases) (Belfort and Roberts 1997; Jurica and Stoddard 1999). In the presence of divalent metal ions, all the abovementioned enzymes perform a site-specific cleavage of phosphodiester bonds in two strands of DNA at fixed locations within a specific nucleotide sequence, yielding 5'-phosphate and 3'-hydroxyl termini. In contrast to type II restriction enzymes, which recognize 48 base pair palindromic sequences (Pingoud and Jeltsch 1997), the homing ENases recognize extended sequences of up to 20 bp, which are generally nonpalindromic (Jurica and Stoddard 1999). This is indicative of their different molecular associations: The homing ENases are usually encoded by introns and inteins, and by making a double-strand (ds) break in the intronless (or inteinless) allele, they promote a gene conversion process that duplicates the intron or intein in the so-called homing event. This event is analogous to the transposition of a transposon, in which, however, in contrast to the transposase, the homing ENase does not interact with its parental DNA, but acts only at the target site (Jurica and Stoddard 1999). Homing ENase genes abound in all three domains of life, with the largest number encoded in organellar genomes. These elements are highly invasive. It is believed that they colonized self-splicing introns and inteins (genetically silent loci, whose disruption is not usually dangerous for the cell), providing them with the potential for genetic mobility (Belfort et al. 1995; Derbyshire et al. 1997).
Traditionally, restriction enzymes have been implicated in the defense against invading genetic elements like plasmids or phages because they might function as weapons-destroying foreign DNA by cleaving it at multiple sites (Bickle and Kruger 1993). They are usually accompanied by DNA methyltransferases (MTases) of similar specificity, which methylate the DNA of the host, thereby rendering it resistant to the cleavage (Wilson and Murray 1991). Although these ENase-MTase pairs (termed restriction-modification or RM systems) might be advantageous to their Prokaryotic hosts, there is growing evidence that many of them exhibit "selfish" behavior (Kusano et al. 1995). RM systems commonly undergo horizontal transfer using plasmids, phages, and other mobile genetic elements as carriers (Jeltsch and Pingoud 1996) and act as toxin-antitoxin systems to become fixed in the new hosts (Kobayashi et al. 1999). It is worth emphasizing that the ds breaks introduced in the unmodified DNA by the restriction enzyme (for example, in the yet not fully methylated chromosome of the new host) might serve a recombinogenic role similar to that of the ds breaks introduced by the intron-encoded homing ENase, promoting integration of the DNA encoding the selfish element into the cut site at some frequency (Eddy and Gold 1992).
| Results: Sequence and structure analysis |
|---|
|
|
|---|
Both algorithms recognized structural similarity of the EndA CTD (residues 83179) to various PD-(D/E)XK enzymes, including the phage
exonuclease (
-exo, PDB entry: 1avq) and the cleavage domain of the FokI restriction enzyme (2fok) among the best hits reported (VAST score 11; DALI Z-score 5.0; rmsd for the fit of the C-
atoms is 2.9 å over 78 residues for
-exo and VAST score 10.1; DALI Z-score 5.1; rmsd 2.6 å over 66 residues for FokI). The NTD (residues 982) showed similarity to the LAGLIDADG enzymes, with I-CreI (1af5) reported as the most similar structure by both methods (VAST score 9.7; DALI Z-score 3.1; rmsd 2.2 å over 42 residues). To the best of our knowledge, these similarities have not been analyzed previously.
In the EndA-NTD, the core of the open curved ß-sheet structure used by I-CreI and presumably all LAGLIDADG ENase as the DNA-binding interface (Jurica et al. 1998), is perfectly conserved (Fig. 1A
). However, the NTD of EndA does not have the long variable loops that extend the recognition sites of the homing ENases. This suggests that the putative DNA-binding site, even if it were present in EndA, would be minimal compared to the typical LAGLIDADG ENases, unless the second domain contributed to the binding surface, as has been shown for the protein splicing domain of PI-SceI intein (Duan et al. 1997; He et al. 1998). Considering the alpha helices of EndA-NTD that shield the opposite face of the ß-sheet, the orientations of
1 and
2 (nomenclature adapted from Li et al. 1998) are somewhat different from their equivalents in LAGLIDADG Enases.
3 has a spatial counterpart only in I-CreI, so the reported similarity might be dismissed as a chance convergence on a small folding motif. However, close examination of the structures reveals that EndA-NTD possesses a triad of residues (Thr11, Asp23, and Lys62), which superimposes quite well with the catalytic triads of dimeric I-CreI (Asp20, Gln47, and Lys98) (Jurica et al. 1998) and pseudodimeric PI-SceI (Asp218, Asp229, and Lys301; and Asp326, Thr341, and Lys403) (Duan et al. 1997; Christ et al. 1999). The only exception is Asp229 in one of the two PI-SceI active sites. We suspect that this residue takes part in the catalysis using Ser227 as a part of a charge relay system (Fig. 1A
). It is therefore possible that EndA possesses cryptic ENase activity. However, it is not obvious whether (and how) the NTD could dimerize to put the two intramolecular putative active sites together. Remarkably, the counterpart of the LAGLIDADG motif, which forms a dimerization interface in the structurally characterized family members (Heath et al. 1997), and which is not conserved in the tRNA splicing enzymes, is partially disordered in the crystal structure of EndA (Li et al. 1998). It would be interesting to determine if EndA (or the separately expressed NTD) binds DNA and exhibits any kind of DNase activity, and if it shows any sequence specificity or preference, because it might indicate that the function of the NTD is (or was before it degenerated) to mobilize the EndA gene or the intron itself.
|
The closest structural similarities of EndA-CTD are to
-exo, which represents an outgroup, branching early in the evolutionary history of the PD-(D/E)XK ENase superfamily, and to FokI, which is also believed to be quite ancient (Bujnicki 2000), suggesting that the EndA family is very old. In contrast, whereas the PD-(D/E)XK ENases are highly divergent at the sequence and structural levels, the sequences of EndA family members are relatively similar to one another (Trotta et al. 1997), suggesting that their divergence began relatively late. The question of mutual relationship of the two nuclease families with alternative active sites must wait a more extensive phylogenetic study of all proteins assuming the PD-(D/E)XK fold, which would considerably exceed the limits of this paper.
| Discussion |
|---|
|
|
|---|
Association of homing and splicing activities within a single protein has precedent. It has been shown in vitro and in vivo that several LAGLIDADG DNases encoded by self-splicing group I introns also promote the splicing process as maturases (Schafer et al. 1994), and that both activities are closely associated (Ho et al. 1997). Much is known about the structure/function relationships in the LAGLIDADG family, but maturase activity is relatively poorly studied. Group II introns also encode proteins with multiple activities, including reverse transcriptase, DNase, and maturase (Moran et al. 1992). Therefore, it is possible that the EndA protein, which is composed of domains implicated in a variety of nucleolytic and nucleic acid binding activities in more or less selfish contexts, might also exert functions other than tRNA splicing, and are probably connected with genetic mobility.
The residues of the noncatalytic CTDs from the positions corresponding to the catalytic side chains in PD-(D/E)XK ENases map to the surface on the side opposite to the tRNA binding site, for which no function has yet been implicated. This raises an intriguing possibility that historically, these sites participated in some kind of DNase activity not necessarily connected with the earlier proposed homing-like function of the NTD. Analogous to our suggestion that the EndA protein might possess several cryptic activities, it can be concluded that many restriction enzymes from the PD-(D/E)XK superfamily, which have a three-dimensional fold essentially identical to that of EndA, might have the potential to develop an additional active or binding site on the face opposite the ENase active site.
Because the evolution of the sequence specificity of restriction enzymes is believed to be strongly influenced by their potential to develop different quaternary structures (Newman et al. 1998), the possibility of introducing new sites for intermolecular or intramolecular interactions at their surface might be utilized in protein engineering.
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Aggarwal, A.K. and Wah, D.A. 1998. Novel site-specific DNA endonucleases. Curr. Opin. Struct. Biol. 8: 1925.[CrossRef][Medline]
Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25: 33893402.
Belfort, M. and Roberts, R.J. 1997. Homing endonucleases: Keeping the house in order. Nucleic Acids Res. 25: 33793388.
Belfort, M., Reaban, M.E., Coetzee, T., and Dalgaard, J.Z. 1995. Prokaryotic introns and inteins: A panoply of form and function. J. Bacteriol. 177: 38973903.
Bernstein, F.C., Koetzle, T.F., Williams, G.J., Meyer, E.F., Jr., Brice, M.D., Rodgers, J.R., Kennard, O., Shimanouchi, T., and Tasumi, M. 1977. The Protein Data Bank: A computer-based archival file for macromolecular structures. J. Mol. Biol. 112: 535542.[Medline]
Bickle, T.A. and Kruger, D.H. 1993. Biology of DNA restriction. Microbiol. Rev. 57: 434450.
Bujnicki, J.M. 2000. Phylogeny of the restriction endonuclease-like superfamily inferred from comparison of protein structures. J. Mol. Evol. 50: 3944.[Medline]
Cech, T.R. 1990. Self-splicing of group I introns. Annu. Rev. Biochem. 59: 543568.[CrossRef][Medline]
Christ, F., Schoettler, S., Wende, W., Steuer, S., Pingoud, A., and Pingoud, V. 1999. The monomeric homing endonuclease PI-SceI has two catalytic centres for cleavage of the two strands of its DNA substrate. EMBO J. 18: 69086916.[CrossRef][Medline]
Dalgaard, J.Z., Garrett, R.A., and Belfort, M. 1993. A site-specific endonuclease encoded by a typical Archaeal intron. Proc. Natl. Acad. Sci. USA 90: 54145417.
Dalgaard, J.Z., Klar, A.J., Moser, M.J., Holley, W.R., Chatterjee, A., and Mian, I.S. 1997. Statistical modeling and analysis of the LAGLIDADG family of site-specific endonucleases and identification of an intein that encodes a site-specific endonuclease of the HNH family. Nucleic Acids Res. 25: 46264638.
Derbyshire, V., Wood, D.W., Wu, W., Dansereau, J.T., Dalgaard, J.Z., and Belfort, M. 1997. Genetic definition of a protein-splicing domain: Functional mini-inteins support structure predictions and a model for intein evolution. Proc. Natl. Acad. Sci. USA 94: 1146611471.
Duan, X., Gimble, F.S., and Quiocho, F.A. 1997. Crystal structure of PI-SceI, a homing endonuclease with protein splicing activity. Cell 89: 555564.[CrossRef][Medline]
Eddy, S.R. and Gold, L. 1992. Artificial mobile DNA element constructed from the EcoRI endonuclease gene. Proc. Natl. Acad. Sci. USA 89: 15441547.
Gibrat, J.F., Madej, T., and Bryant, S.H. 1996. Surprising similarities in structure comparison. Curr. Opin. Struct. Biol. 6: 377385.[CrossRef][Medline]
He, Z., Crist, M., Yen, H., Duan, X., Quiocho, F.A., and Gimble, F.S. 1998. Amino acid residues in both the protein splicing and endonuclease domains of the PI-SceI intein mediate DNA binding. J. Biol. Chem. 273: 46074615.
Heath, P.J., Stephens, K.M., Monnat, R.J.J., and Stoddard, B.L. 1997. The structure of I-Crel, a group I intron-encoded homing endonuclease. Nat. Struct. Biol. 4: 468476.[CrossRef][Medline]
Ho, Y., Kim, S.J., and Waring, R.B. 1997. A protein encoded by a group I intron in Aspergillus nidulans directly assists RNA splicing and is a DNA endonuclease. Proc. Natl. Acad. Sci. USA 94: 89948999.
Holm, L. and Sander, C. 1993. Protein structure comparison by alignment of distance matrices. J. Mol. Biol. 233: 123138.[CrossRef][Medline]
Jeltsch, A. and Pingoud, A. 1996. Horizontal gene transfer contributes to the wide distribution and evolution of type II restriction-modification systems. J. Mol. Evol. 42: 9196.[CrossRef][Medline]
Jurica, M.S. and Stoddard, B.L. 1999. Homing endonucleases: Structure, function and evolution. Cell Mol. Life Sci. 55: 13041326.[CrossRef][Medline]
Jurica, M.S., Monnat, R.J.J., and Stoddard, B.L. 1998. DNA recognition and cleavage by the LAGLIDADG homing endonuclease I-CreI. Mol. Cell 2: 469476.[CrossRef][Medline]
Kobayashi, I., Nobusato, A., Kobayashi-Takahashi, N., and Uchiyama, I. 1999. Shaping the genome-restriction-modification systems as mobile genetic elements. Curr. Opin. Genet. Dev. 9: 649656.[CrossRef][Medline]
Kusano, K., Naito, T., Handa, N., and Kobayashi, I. 1995. Restriction-modification systems as genomic parasites in competition for specific sequences. Proc. Natl. Acad. Sci. USA 92: 1109511099.
Li, H., Trotta, C.R., and Abelson, J. 1998. Crystal structure and evolution of a transfer RNA splicing enzyme. Science 280: 279284.
Lykke-Andersen, J., Aagaard, C., Semionenkov, M., and Garrett, R.A. 1997. Archaeal introns: Splicing, intercellular mobility and evolution. Trends Biochem. Sci. 22: 326331.[CrossRef][Medline]
Michel, F. and Ferat, J.L. 1995. Structure and activities of group II introns. Annu. Rev. Biochem. 64: 435461.[CrossRef][Medline]
Moran, J.V., Wernette, C.M., Mecklenburg, K.L., Butow, R.A., and Perlman, P.S. 1992. Intron 5 alpha of the COXI gene of yeast mitochondrial DNA is a mobile group I intron. Nucleic Acids Res. 20: 40694076.
Newman, A. 1997. RNA splicing: Out of the loop. Curr. Biol. 7: R418R420.[CrossRef][Medline]
Newman, M., Lunnen, K., Wilson, G., Greci, J., Schildkraut, I., and Phillips, S.E. 1998. Crystal structure of restriction endonuclease BglI bound to its interrupted DNA recognition sequence. EMBO J. 17: 54665476.[CrossRef][Medline]
Phizicky, E.M. and Greer, C.L. 1993. Pre-tRNA splicing: Variation on a theme or exception to the rule? Trends Biochem. Sci. 18: 3134.[CrossRef][Medline]
Pingoud, A. and Jeltsch, A. 1997. Recognition and cleavage of DNA by type-II restriction endonucleases. Eur. J. Biochem. 246: 122.[Medline]
Schafer, B., Wilde, B., Massardo, D.R., Manna, F., Del Giudice, L., and Wolf, K. 1994. A mitochondrial group-I intron in fission yeast encodes a maturase and is mobile in crosses. Curr. Genet. 25: 336341.[CrossRef][Medline]
Trotta, C.R., Miao, F., Arn, E.A., Stevens, S.W., Ho, C.K., Rauhut, R., and Abelson, J.N. 1997. The yeast tRNA splicing endonuclease: A tetrameric enzyme with two active site subunits homologous with the archaeal tRNA endonucleases. Cell 89: 849858.[CrossRef][Medline]
Wilson, G.G. and Murray, N.E. 1991. Restriction and modification systems. Annu. Rev. Genet. 25: 585627.[CrossRef][Medline]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
J. M. Bujnicki, P. Rotkiewicz, A. Kolinski, and L. Rychlewski Three-dimensional modeling of the I-TevI homing endonuclease catalytic domain, a GIY-YIG superfamily member, using NMR restraints and Monte Carlo dynamics Protein Eng. Des. Sel., October 1, 2001; 14(10): 717 - 721. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |