Protein Science
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Protein Science (2004), 13:278-294. Published by Cold Spring Harbor Laboratory Press. Copyright © 2004 The Protein Society
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Shapiro, J.
Right arrow Articles by Brutlag, D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Shapiro, J.
Right arrow Articles by Brutlag, D.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

FoldMiner: Structural motif discovery using an improved superposition algorithm

Jessica Shapiro1 and Douglas Brutlag1,2

1 Biophysics Program and
2 Department of Biochemistry, Stanford University, Stanford, California 94305-5307, USA

Reprint requests to: Douglas Brutlag, B400 Beckman Center, Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305-5307, USA; e-mail: brutlag{at}stanford.edu; fax: (650) 723-6783.

We report an unsupervised structural motif discovery algorithm, FoldMiner, which is able to detect global and local motifs in a database of proteins without the need for multiple structure or sequence alignments and without relying on prior classification of proteins into families. Motifs, which are discovered from pairwise superpositions of a query structure to a database of targets, are described probabilistically in terms of the conservation of each secondary structure element’s position and are used to improve detection of distant structural relationships. During each iteration of the algorithm, the motif is defined from the current set of homologs and is used both to recruit additional homologous structures and to discard false positives. FoldMiner thus achieves high specificity and sensitivity by distinguishing between homologous and nonhomologous structures by the regions of the query to which they align. We find that when two proteins of the same fold are aligned, highly conserved secondary structure elements in one protein tend to align to highly conserved elements in the second protein, suggesting that FoldMiner consistently identifies the same motif in members of a fold. Structural alignments are performed by an improved superposition algorithm, LOCK 2, which detects distant structural relationships by placing increased emphasis on the alignment of secondary structure elements. LOCK 2 obeys several properties essential in automated analysis of protein structure: It is symmetric, its alignments of secondary structure elements are transitive, its alignments of residues display a high degree of transitivity, and its scoring system is empirically found to behave as a metric.

Keywords: structural motif discovery; core fold; structural superposition; structural alignment; structural similarity score; statistical significance score; expectation


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Brief Funct Genomic ProteomicHome page
P. F. Gherardini and M. Helmer-Citterich
Structure-based function prediction: approaches and applications
Brief Funct Genomic Proteomic, July 3, 2008; (2008) eln030v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Shi, Y. Zhong, I. Majumdar, S. Sri Krishna, and N. V. Grishin
Searching for three-dimensional secondary structural patterns in proteins with ProSMoS
Bioinformatics, June 1, 2007; 23(11): 1331 - 1338.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Ebert and D. Brutlag
Development and validation of a consistency based multiple structure alignment algorithm
Bioinformatics, May 1, 2006; 22(9): 1080 - 1087.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
Y. Chen and G. M. Crippen
A novel approach to structural alignment using realistic structural and environmental information
Protein Sci., December 1, 2005; 14(12): 2935 - 2946.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
H. Cheng and N. V. Grishin
DOM-fold: A structure with crossing loops found in DmpA, ornithine acetyltransferase, and molybdenum cofactor-binding domain
Protein Sci., July 1, 2005; 14(7): 1902 - 1910.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. G. Kann, P. A. Thiessen, A. R. Panchenko, A. A. Schaffer, S. F. Altschul, and S. H. Bryant
A structure-based method for protein sequence alignment
Bioinformatics, April 15, 2005; 21(8): 1451 - 1456.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Shapiro and D. Brutlag
FoldMiner and LOCK 2: protein structure comparison and motif discovery on the web
Nucleic Acids Res., July 1, 2004; 32(suppl_2): W536 - W541.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2004 by The Protein Society.