|
|
||||||||
1 Bioinformatics Laboratory, International Institute of Molecular and Cell Biology (IIMCB), 02-109 Warsaw, Poland
2 Stockholm Bioinformatics Center, Stockholm University, 10691 Stockholm, Sweden
3 Department of Bioinformatics and Computer Science, Ben Gurion University, Beer-Sheva 84105, Israel
Reprint requests to: Dr. Leszek Rychlewski, Bioinformatics Laboratory International Institute of Molecular and Cell Biology, ul. ks. Trojdena 4, 02-109 Warsaw, Poland; e-mail: leszek{at}bioinfo.pl; fax: 48-22-668-5288.
We present a novel, continuous approach aimed at the large-scale assessment of the performance of available fold-recognition servers. Six popular servers were investigated: PDB-Blast, FFAS, T98-lib, GenTHREADER, 3D-PSSM, and INBGU. The assessment was conducted using as prediction targets a large number of selected protein structures released from October 1999 to April 2000. A target was selected if its sequence showed no significant similarity to any of the proteins previously available in the structural database. Overall, the servers were able to produce structurally similar models for one-half of the targets, but significantly accurate sequence-structure alignments were produced for only one-third of the targets. We further classified the targets into two sets: easy and hard. We found that all servers were able to find the correct answer for the vast majority of the easy targets if a structurally similar fold was present in the server's fold libraries. However, among the hard targetswhere standard methods such as PSI-BLAST failthe most sensitive fold-recognition servers were able to produce similar models for only 40% of the cases, half of which had a significantly accurate sequence-structure alignment. Among the hard targets, the presence of updated libraries appeared to be less critical for the ranking. An "ideally combined consensus" prediction, where the results of all servers are considered, would increase the percentage of correct assignments by 50%. Each server had a number of cases with a correct assignment, where the assignments of all the other servers were wrong. This emphasizes the benefits of considering more than one server in difficult prediction tasks. The LiveBench program (http://BioInfo.PL/LiveBench) is being continued, and all interested developers are cordially invited to join.
Keywords: Automated protein structure prediction; benchmarking; meta server; CAFASP; LiveBench
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
D. Przybylski and B. Rost Powerful fusion: PSI-BLAST and consensus sequences Bioinformatics, September 15, 2008; 24(18): 1987 - 1993. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Rangwala and G. Karypis Incremental window-based protein sequence alignment algorithms Bioinformatics, January 15, 2007; 23(2): e17 - e23. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. D. Romanyuk, D. J. Rigden, O. K. Vatamaniuk, A. Lang, R. E. Cahoon, J. M. Jez, and P. A. Rea Mutagenic Definition of a Papain-Like Catalytic Triad, Sufficiency of the N-Terminal Domain for Single-Site Core Catalytic Enzyme Acylation, and C-Terminal Domain for Augmentative Metal Activation of a Eukaryotic Phytochelatin Synthase Plant Physiology, July 1, 2006; 141(3): 858 - 869. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. S. Pettitt, L. J. McGuffin, and D. T. Jones Improving sequence-based fold recognition by using 3D model quality assessment Bioinformatics, September 1, 2005; 21(17): 3509 - 3515. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Zhou, C. Zhang, S. Liu, and Y. Zhou Web-based toolkits for topology prediction of transmembrane helical proteins, fold recognition, structure and binding scoring, folding-kinetics analysis and comparative analysis of domain combinations Nucleic Acids Res., July 1, 2005; 33(suppl_2): W193 - W197. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Jaroszewski, L. Rychlewski, Z. Li, W. Li, and A. Godzik FFAS03: a server for profile-profile sequence alignments Nucleic Acids Res., July 1, 2005; 33(suppl_2): W284 - W288. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Bogan, G. R. Agnes, F. Pio, and R. B. Cornell Interdomain and Membrane Interactions of CTP:Phosphocholine Cytidylyltransferase Revealed via Limited Proteolysis and Mass Spectrometry J. Biol. Chem., May 20, 2005; 280(20): 19613 - 19624. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Ginalski, N. V. Grishin, A. Godzik, and L. Rychlewski Practical lessons from protein structure prediction Nucleic Acids Res., April 1, 2005; 33(6): 1874 - 1891. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Zhang, S. Kochhar, and M. G. Grigorov Descriptor-based protein remote homology identification Protein Sci., February 1, 2005; 14(2): 431 - 444. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Rychlewski and D. Fischer LiveBench-8: The large-scale, continuous assessment of automated protein structure prediction Protein Sci., January 1, 2005; 14(1): 240 - 245. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Zhang, S. Kochhar, and M. Grigorov Exploring the sequence-structure protein landscape in the glycosyltransferase family Protein Sci., October 1, 2003; 12(10): 2291 - 2302. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. L. Franco and D. J. Rigden Fold recognition analysis of glycosyltransferase families: further members of structural superfamilies Glycobiology, October 1, 2003; 13(10): 707 - 712. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Ginalski and L. Rychlewski Detection of reliable and unexpected protein fold predictions using 3D-Jury Nucleic Acids Res., July 1, 2003; 31(13): 3291 - 3292. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Y. Y. Koh, V. A. Eyrich, M. A. Marti-Renom, D. Przybylski, M. S. Madhusudhan, N. Eswar, O. Grana, F. Pazos, A. Valencia, A. Sali, et al. EVA: evaluation of protein structure prediction servers Nucleic Acids Res., July 1, 2003; 31(13): 3311 - 3315. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Ginalski, J. Pas, L. S. Wyrwicz, M. v. Grotthuss, J. M. Bujnicki, and L. Rychlewski ORFeus: detection of distant homology using sequence profiles and predicted secondary structure Nucleic Acids Res., July 1, 2003; 31(13): 3804 - 3807. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Wallner and A. Elofsson Can correct protein models be identified? Protein Sci., May 1, 2003; 12(5): 1073 - 1086. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Pieper, N. Eswar, A. C. Stuart, V. A. Ilyin, and A. Sali MODBASE, a database of annotated comparative protein structure models Nucleic Acids Res., January 1, 2002; 30(1): 255 - 259. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Lundstrom, L. Rychlewski, J. Bujnicki, and A. Elofsson Pcons: A neural-network-based consensus predictor that improves fold recognition Protein Sci., November 1, 2001; 10(11): 2354 - 2362. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Friedhoff, R. Lurz, G. Luder, and A. Pingoud Sau3AI, a Monomeric Type II Restriction Endonuclease That Dimerizes on the DNA and Thereby Induces DNA Loops J. Biol. Chem., June 22, 2001; 276(26): 23581 - 23588. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |