|
|
||||||||
1 Division of Structural Biology and Biochemistry, Research Institute, Hospital for Sick Children, Toronto M5G 1X8, Ontario, Canada
2 Bioinformatics Supercomputing Centre, Research Institute, Hospital for Sick Children, Toronto M5G 1X8, Ontario, Canada
3 Department of Biochemistry, University of Toronto, Toronto M5S 1A8, Ontario, Canada
Reprint requests to: Dr. Charles M. Deber, Structural Biology and Biochemistry, Research Institute, Hospital for Sick Children, 555 University Avenue, Toronto, Ontario M5G 1X8, Canada; e-mail: deber{at}sickkids. on.ca; fax (416) 813-5005.
(RECEIVED July 18, 2000; FINAL REVISION October 2, 2000; ACCEPTED October 6, 2000)
Article and publication are at www.proteinscience.org/cgi/doi/10.1110/ps.30301.
| Abstract |
|---|
|
|
|---|
Keywords: TM Finder; transmembrane segments; threshold hydrophobicity; nonpolar phase helicity; membrane protein prediction
| Introduction |
|---|
|
|
|---|
In the absence of high-resolution structural data, the approximate positions of the membrane-spanning segments within a TM domain can, in principle, be proposed from the amino acid sequence alone with the aid of hydropathy plots (Kyte and Doolittle 1982), which select TM segments based on a moving average of segment hydrophobicity. The development of such hydrophobicity scales traditionally has been based on a combination of statistical analysis (Munoz and Serrano 1994), and the partitioning properties of individual amino acids both as monomers and in globular proteins (Kyte and Doolittle 1982), and in short hydrophilic peptides (Hodges et al. 1994). We have previously reported the experimental determination of apparent hydrophobicity using a series of TM-mimetic model peptides of prototypic sequence KKAAAXAAAAAXAAWAAXAAAKKKK-amide (the underlined region represents the hydrophobic core of sufficient length to span a lipid bilayer, and the guest residues X = each of the 20 common amino acids) (Liu and Deber 1998a). This work led to a hydrophobicity scale for the 20 X-residues based on the individual high-pressure liquid chromatography (HPLC) retention time measurements of the model peptides. When corresponding peptide secondary structures were characterized in aqueous versus micellar environments, peptides exhibited random or partially helical structures in water, but adopted X-residuedependent full helical structure only upon integration into the micellar membrane. These experiments revealed the existence of a threshold hydrophobicity approximately equivalent to that of a strand of poly-alanine, which, once met or exceeded, dictates the spontaneous insertion of the peptides into micelles. When hydropobicities of natural protein sequences in SWISS-PROT and TMbase were examined using the HPLC-based scale, and the threshold applied, it was found that close to 97% of protein TM segments have hydrophobicity above the threshold value, while nearly 80% of non-TM helices (>19 residues, derived from soluble proteins) fail to meet the same minimum requirement (Liu and Deber 1998a). Therefore, threshold hydrophobicity is an attribute that can be used to distinguish TM segments from helices in soluble proteins.
While transmembrane segments generally are
-helical, peptide helicity itself is influenced fundamentally by molecular environment, and accordingly, designations of particular residues as helix formers or breakers will differ significantly in membrane versus soluble protein domains. This circumstance is epitomized by the fact that ß-branched residues (Val, Ile, Thr) and Gly residues predicted to disfavor
-helices by algorithms evolved from statistical analysis of residues in soluble proteins (Chou and Fasman 1978) among them can account for
40% of TM segment amino acid composition (Li and Deber 1994). Thus, a helicity scale developed from globular proteins cannot be applied to membrane proteins. We addressed this issue by using circular dichroism (CD) measurements to examine the series of TM-mimetic model peptides in a nonpolar organic phase (n-butanol), from which a scale of relative helical propensity for the 20 amino acids was constructed (Liu and Deber 1998b). When this scale was applied to protein databases, we observed a clear segregation between TM and non-TM helices; for example, a helicity value set at midscale selects correctly for 98% of the TM helices in a large database (Wang et al. 1999). These results suggested that in addition to the threshold hydrophobicity, there also exists a de facto minimum residue-dependent helicity requirement, which will promote and stabilize the folding of the nascent polypeptide chain within the membrane environment.
We have developed the program TM Finder, which automates and implements the concept of dual threshold hydrophobicity and nonpolar phase helicity requirements to locate protein TM segments from primary sequences alone. Here, we present the algorithm used for the TM Finder program, and discuss the refinements made to optimize the program for various protein sequences.
| Construction of the TM Finder program |
|---|
|
|
|---|
![]() |
t represent hydrophobicity and HPLC retention time, respectively. An exception was made for Cys, which was synthesized with only one Cys in the middle ``X,'' and two Leu residues at the other two guest positions, to avoid excessive disulfide formation. the hydropathy of Cys then was calculated according to
![]() |
|
![]() |
= 1 corresponds to -30,000 deg cm2/dmol, a typical helicity value for a protein to adopt
50%
-helical conformation in a nonpolar environment (Liu and Deber 1998b; Parker et al. 1992). This helicity scale, with relative residue rankings is given in Table 1
TM Finder parameters
The original TM Finder program was prototyped using Microsoft Visual Basic and Excel. A web-enabled version of the program written in C and perl is available free to all users at http://www.bioinformatics-canada.org/TM/. Figure 1
illustrates the operation of the web version of the TM Finder program. The program accepts protein sequences in SWISS-PROT format, and then outputs TM segments based on the combined prediction from hydrophobicity and helicity. Hydrophobicity and helicity values are assigned to each amino acid, and then a sliding window is applied to calculate the moving average. TM segments are selected on the basis of residues, which exceed both the imposed threshold hydrophobicity and helicity levels. A gap joining operation (vide infra) was implemented to ligate segments within short breaks from the main segments. Users may adjust five parameters depending on the size and nature of the protein to produce the best prediction. These five parameters are: N-terminal window size, C-terminal window size, minimum core length, closed-gap length, and minimum segment length. The default settings of these parameters are values obtained from optimizing the five parameters against a collection of solved membrane protein structures (training sets) which are collected on the TM Finder website. It is emphasized that the program is designed to evaluate proteins that contain transmembrane
-helices; membrane proteins such as the porins, which contain transmembrane ß-strands as components of ß-barrel structures, should be treated through algorithms developed specifically from this protein subset (Liu and Deber 1998a).
|
| Results and discussion |
|---|
|
|
|---|
To a significant extent, hydrophobicity rankings of the 20 residues (Table 1
, column 1) parallel helicity rankings (column 2). Inspection of the rankings reveals that five of the top six helix-promoting residues (F, L, I, M, and V) also are among the top six hydrophobes in the Liu-Deber scale. Yet a few residues do move drastically in their relative rankings, for example, the rankings of Gly and Trp in hydrophobicity (16th and 2nd, respectively) interchange versus their nonpolar phase helicity rankings (7th and 10th, respectively). As well, Ala is relatively less hydrophobic than it is a helix former in nonpolar phases. These circumstances suggest that hydrophobicity and helicity should be considered as uncoupled entities, and therefore the outcome of the prediction should improve when the two factors work in concert.
Gap-joining operation: Elimination of window size bias
Strict conformity to threshold hydrophobicity and helicity values can produce a lengthy list of predicted segments including many spikes. With the application of a small window size, we noted that the peaks tended to become narrow and break into several regions within one TM segment; such apparent shorter segments may arise in regions of local hydrophilicity (common to membrane transport proteins). In light of these observations, we modified TM Finder in the following manner. First, a variable m (usually set at 10) was set up, which allows the elimination of small segments with length m AA; and second, to resolve the problem of fragmentation when smaller window sizes are used, the program allows the fusion of neighboring smaller segments to a larger hydrophobic nucleus.
The importance of the gap-joining operation is demonstrated in Table 2
. Cytochrome C oxidase had been chosen as the example because its crystal structure (Tsukihara et al. 1996) provides a good reference to determine the accuracy of the TM Finder prediction. As seen from Table 2
, without the gap-joining operation (gap size = 0), two to three segments will be missed from the output. In addition, the prediction accuracy is highly dependent on the size of the sliding window chosen, i.e., the prediction accuracy is raised 7% simply by increasing the window size from 11 AA to 19 AA. With the application of gap size = 2 AA, all 12 segments of the protein are identified by the program, the prediction accuracy is improved by 6%12%, and essentially the same level of accuracy is achieved regardless of the choice of window size. However, it must be noted that overuse of gap-joining operations can produce fused segments and result in reduced accuracy, as demonstrated in the case of gap size = 4 AA, and window size of 11 AA. The corrective action for fused segments is to reduce the gap size while at the same time increase the window size; when a larger window size is chosen, the resulting segments are more concentrated and have better separation. The eventual optimized choice (3 AA gap, 15 AA window) between gap size and window size that would allow the prediction of well-separated and reasonably sized segments is the choice that produces the best accuracy (81%).
|
|
-helix could readily traverse at both N- and C-termini, it remains problematical to designate the precise residue membrane entry/exit points of a given TM segment. Although the helicity requirement used in the current work helps to refine prediction of entry/exit points (Liu and Deber 1999), it may ultimately become useful to distinguish between a core TM helix in the hydrocarbon phase, and a full TM helix, with the latter defined in terms of inclusion of the adjacent (usually hydrophilic) N- and C-capping helical regions.
Globular protein database used to refine the prediction
In order to further codify the power of TM Finder to distinguish between membrane and nonmembrane proteins, we constructed a database composed of helices from globular proteins and from aqueous-based domains of membrane proteins, restricting the analysis to segments of comparable length to TM segments. Application of the threshold hydrophobicity criterion to this latter database of non-TM helices produced a few outliers containing 34 charged residues, but for which hydropathy is raised to just above the threshold value because of (1) an increased occurrence of aromatic residues (Tyr, Phe, Trp); and (2) an increased amount of Leu residues as compared to the majority of globular protein helices (i.e., those below the threshold). We have termed such helices
-regions (Wang et al. 2000), which we speculate may occur as helices that function to bridge core-to-surface regions in globular protein structures. To eliminate false positives from this source, TM Finder has been programmed to flag potential TM segments, which contain three or more charged residues per 19 AA's.
Application of TM Finder to databases of membrane and nonmembrane proteins
To assess the generic ability of TM Finder to locate TM segments of membrane proteins, we constructed two databases. We first applied the program to a database of 21 protein subunits (containing 75 reported TM segments) in a set of membrane proteins considered to be solved to high resolution (Table 3
) (see references herein and the TM Finder website). Recognizing that segment entry/exit points will vary from the specific positions reported from crystallography (data not shown), TM Finder correctly locates 67 of 75 TM segments, while mispredicting three segments, for a predictive value of a positive test of 67/70 = 96%. The segments underpredicted by TM Finder likely result from its stringency in that both hydrophobicity and helicity profiles must exceed the threshold for a positive prediction; thus, the program will occasionally bypass a Gly-rich and/or hydrophilic residue-rich (e.g., His) TM segment for which it computes insufficient hydrophobicity (example from Table 3
: TM 3 of photosystem I [PsaA], residues 197221: LNHHLAGLLGLGSLAWAGHQIHVSL). Comparison with three other hydropathy scales (Table 3
) indicates roughly comparable performance for TM segment prediction in membrane proteins, viz., the KD scale correctly locates 70 segments while mispredicting eight (predictive value = 90%); the GES scale found 66 while mispredicting five (93%), and the Eisenberg scale found 73 while mispredicting 15 (83%).
|
|
|
| Conclusion |
|---|
|
|
|---|
90% of TM segments from the training set of crystallized membrane proteins containing transmembrane
-helices, with a predictive value of 96%. Applied to membrane proteins not yet solved to high resolution, TM Finder constitutes a useful screening operation for identifying their membrane-based segments. The program also has the power to distinguish essentially unequivocally between membrane and nonmembrane proteins, e.g., in a representative database of soluble proteins, no protein (with one minor exception) is mispredicted as a membrane protein. This latter feature of TM Finder would be of particular value, for example, in functional genomics research, where open reading frames present may code for as-yet-uncharacterized proteins.
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked ``advertisement'' in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Deisenhofer, J., Epp, O., Miki, K., Huber, R., and Michel, H. 1984. X-ray structure analysis of a membrane protein complex. Electron density map at 3 Å resolution and a model of the chromophores of the photosynthetic reaction center from R. viridis. J. Mol. Biol. 180: 385398.
Doyle, D.A., Cabral, J.M., Pfuetzner, R.A., Kuo, A., Gulbis, J.M., Cohen, S.L., Chait, B.T., and MacKinnon, R. 1998. The structure of the potassium channel: Molecular basis of K+ conduction and selectivity. Science 280: 6977.
Eisenberg, D., Schwarz, E., Komaromy, M., and Wall, R. 1984. Analysis of membrane and surface protein sequences with the hydrophobic moment plot. J. Mol. Biol. 179: 125142.[CrossRef][Medline]
Engelman, D.M., Steitz, T.A., and Goldman, A. 1986. Identifying nonpolar transbilayer helices in amino acid sequences of membrane proteins. Annu. Rev. Biophys. Biophys. Chem. 15: 321353.[CrossRef][Medline]
Fazio, M.J., Olsen, D.R., Kauh, E.A., Baldwin, C.T., Indik, Z., Ornstein-Goldstein, N., Yeh, H., Rosenbloom, J., and Uitto, J. 1988. Cloning of full-length elastin cDNAs from a human skin fibroblast recombinant cDNA library: further elucidation of alternative splicing utilizing exon-specific oligonucleotides. J. Invest. Dermatol. 91: 458464.[CrossRef][Medline]
Henderson, R., Baldwin, J.M., Ceska, T.A., Zemlin, F., Beckmann, E., and Downing, K.H. 1990. Model for the structure of bacteriorhodopsin based on high-resolution electron cryo-microscopy. J. Mol. Biol. 213: 899929.[Medline]
Hodges, R.S., Zhu, B.Y., Zhou, N.E., and Mant, C.T. 1994 Reversed-phase liquid chromatography as a useful probe of hydrophobic interactions involved in protein folding and protein stability. J. Chromatography 676: 315.
Kyte, J. and Doolittle, R.F. 1982. A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157: 105132.[CrossRef][Medline]
Li, S.C. and Deber, C.M. 1994. A measure of helical propensity for amino acids in membrane environments. Nat. Struct. Biol. 1: 368373.[CrossRef][Medline]
Liu, L.P. and Deber, C.M. 1998a. Guidelines for membrane protein engineering derived from de novo designed model peptides. Biopolymers 47: 4162.[CrossRef][Medline]
Liu, L.P. and Deber, C.M. 1998b. Uncoupling hydrophobicity and helicity in transmembrane segments:
-helical propensities of the amino acids in non-polar environments. J. Biol. Chem. 273: 2364523648.
Liu, L.P. and Deber, C.M. 1999. Combining hydrophobicity and helicity: A novel approach to membrane protein structure prediction. Bioorg. Med. Chem. 7: 17.[CrossRef][Medline]
Munoz, V. and Serrano, L. 1994. Intrinsic secondary structure propensities of the amino acids, using statistical phi-psi matrices: Comparison with experimental scales. Proteins 20: 301311.[CrossRef][Medline]
Papavoine, C.H., Christiaans, B.E., Folmer, R.H., Konigs, R.N., and Hilbers, C.W. (1998). Solution structure of the M13 major coat protein in detergent micelles: A basis for a model of phage assembly involving specific residues. J. Mol. Biol. 282: 401419.[CrossRef][Medline]
Park, K., Perczel, A., and Fasman, G.D. 1992. Differentiation between transmembrane helices and peripheral helices by the deconvolution of circular dichroism spectra of membrane proteins. Protein Sci. 1: 10321049.[Abstract]
Stowell, M.H., McPhillips, T.M., Rees, D.C., Soltis, S.M., Abresch, E., and Feher, G. 1997. Light-induced structural changes in photosynthetic reaction center: Implications for mechanism of electron-proton transfer. Science 276: 812816.
Tsukihara, T., Aoyama, H., Yamashita, E., Tomizaki, T., Yamaguchi, H., Shinzawa-Itoh, K., Nakashima, R., Yaono, R., and Yoshikawa, S. 1996. The whole structure of the 13-subunit oxidized cytochrome C oxidase at 2.8 Å. Science 272: 11361144.[Abstract]
Wang, C., Liu, L.P., and Deber, C.M. 1999. Helicity of hydrophobic peptides in polar vs. non-polar environments. Phys. Chem. Chem. Phys. 1: 15391542.[CrossRef]
Wang, C., Liu, L.P., and Deber, C.M. 2000.
-Regions in proteins: Helices mispredicted as transmembrane segments by the threshold hydrophobicity requirement. Proc. 16th Amer. Peptide Symp. (Fields, G.B., Tam, J.P., and Barany, G., eds.), pp. 367369.
Wimley, W.C . and White, S.M. 1996. Experimentally determined hydrophobicity scale for proteins at membrane interfaces. Nat. Struct. Biol. 3: 842848.[CrossRef][Medline]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
G. Zhao and E. London An amino acid "transmembrane tendency" scale that approaches the theoretical limit to accuracy for prediction of transmembrane helices: Relationship to biological hydrophobicity. Protein Sci., August 1, 2006; 15(8): 1987 - 2001. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Rath, R. A. Melnyk, and C. M. Deber Evidence for Assembly of Small Multidrug Resistance Proteins by a "Two-faced" Transmembrane Helix J. Biol. Chem., June 2, 2006; 281(22): 15546 - 15553. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Aisenbrey, R. Kinder, E. Goormaghtigh, J.-M. Ruysschaert, and B. Bechinger Interactions Involved in the Realignment of Membrane-associated Helices: AN INVESTIGATION USING ORIENTED SOLID-STATE NMR AND ATTENUATED TOTAL REFLECTION FOURIER TRANSFORM INFRARED SPECTROSCOPIES J. Biol. Chem., March 24, 2006; 281(12): 7708 - 7716. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. N. Amin, B. L. Taylor, and M. S. Johnson Topology and Boundaries of the Aerotaxis Receptor Aer in the Membrane of Escherichia coli J. Bacteriol., February 1, 2006; 188(3): 894 - 901. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Glukhov, M. Stark, L. L. Burrows, and C. M. Deber Basis for Selectivity of Cationic Antimicrobial Peptides for Bacterial Versus Mammalian Membranes J. Biol. Chem., October 7, 2005; 280(40): 33960 - 33967. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Cuthbertson, D. A. Doyle, and M. S.P. Sansom Transmembrane helix prediction: a comparative evaluation and analysis Protein Eng. Des. Sel., June 1, 2005; 18(6): 295 - 308. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Y. Choi, A. W. Partridge, C. Daniels, K. Du, G. L. Lukacs, and C. M. Deber Destabilization of the Transmembrane Domain Induces Misfolding in a Phenotypic Mutant of Cystic Fibrosis Transmembrane Conductance Regulator J. Biol. Chem., February 11, 2005; 280(6): 4968 - 4974. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. R. Mathews, F. Wang, D. J. Eide, and M. Van Doren Drosophila fear of intimacy Encodes a Zrt/IRT-like Protein (ZIP) Family Zinc Transporter Functionally Related to Mammalian ZIP Proteins J. Biol. Chem., January 7, 2005; 280(1): 787 - 795. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Chan, L. L. Burrows, and C. M. Deber Helix Induction in Antimicrobial Peptides by Alginate in Biofilms J. Biol. Chem., September 10, 2004; 279(37): 38749 - 38754. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Zhou and Y. Zhou Predicting the topology of transmembrane helical proteins using mean burial propensity and a hidden-Markov-model-based method Protein Sci., July 1, 2003; 12(7): 1547 - 1555. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. W. Partridge, R. A. Melnyk, D. Yang, J. U. Bowie, and C. M. Deber A Transmembrane Segment Mimic Derived from Escherichia coli Diacylglycerol Kinase Inhibits Protein Activity J. Biol. Chem., June 6, 2003; 278(24): 22056 - 22060. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Stark, L.-P. Liu, and C. M. Deber Cationic Hydrophobic Peptides with Antimicrobial Activity Antimicrob. Agents Chemother., November 1, 2002; 46(11): 3585 - 3590. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Bertaccini and J.R. Trudell Predicting the transmembrane secondary structure of ligand-gated ion channels Protein Eng. Des. Sel., June 1, 2002; 15(6): 443 - 453. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Ott and V. R. Lingappa Integral membrane protein biosynthesis: why topology is hard to predict J. Cell Sci., May 15, 2002; 115(10): 2003 - 2009. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. H. White, A. S. Ladokhin, S. Jayasinghe, and K. Hristova How Membranes Shape Protein Structure J. Biol. Chem., August 24, 2001; 276(35): 32395 - 32398. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |