Publicado 2013-03-09
Palabras clave
- secondary structure prediction,
- consensus strategy,
- neural networks
Cómo citar
Resumen
En este artículo se propone un nuevo programa para la predicción de la estructura secundaria de la proteína. El programa, llamado NN-Pred, recibe como entrada múltiples secuencias de ADN y utiliza cinco algoritmos existentes para la predicción de la estructura secundaria de la proteína. Además, NN-Pred calcula una predicción consenso que se basa en una estrategia de redes neuronales y que se plantea en este artículo para mejorar la exactitud en la predicción. NN-Pred se obtuvo usando una metodología conocida como estrategia consenso que intenta obtener un modelo de predicción integrando algunos de los mejores métodos existentes. El programa NN-Pred provee una predicción de tres estados (hélices alfa, hojas beta, y otro) para la estructura secundaria de la proteína. De acuerdo a los resultados de las pruebas realizadas, NN-Pred alcanza una exactitud de predicción de al menos 70.0%, sobrepasando la mayoría de los métodos existentes. Los experimentos realizados mostraron que la técnica de redes neuronales se puede usar como una estrategia consenso para obtener modelos precisos para la predicción de la estructura secundaria.
Descargas
Referencias
V. Lim, “Structural principles of the globular organisation of protein chains. A stereochemical theory of globular protein secondary structure,” Journal of Molecular Biology, vol. 88, no. 1, 1974, pp. 857-872.
J. Garnier, D. Osguthorpe and B. Robson, “Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins,” Journal of Molecular Biology, vol, 120, no 1, 1978, pp. 97-120.
B. Rost and C. Sander, “Improved prediction of protein secondary structure by use of sequence profiles and neuronal networks,” Proceedings of the National Academy of Science, vol. 90, no. 1, 1990, pp. 7558-7562.
B. Rost, “PHD: predicting one-dimensional protein structure by profile based neural networks,” Methods Enzimol, vol. 266, no. 1, 1996, pp. 525-539.
C. Cole, J. Barber and G. Barton, “The JPred 3 secondary structure prediction server,” Nucleic Acids Res, vol. 36, no. 1, 2008, pp. 197-201.
M. Osman, M. Abdullah and R. AbdulRashid, “RNA secondary structure prediction using dynamic programming algorithm - A review and proposed work,” Information Technology, vol. 2, 2010, pp. 551-556.
D. Mojie, Z. Yanhong, H. Huiyan, “A Protein Secondary Structure Prediction Tool Using Two-Level Strategy to Improve the Prediction Accuracy of Secondary Structures and Structure Boundaries,” Information Engineering and Computer Science, vol. 1, 2009, pp.1-4.
H. Tsang and K. Wiese, “SARNA-Predict: Accuracy Improvement of RNA Secondary Structure Prediction Using Permutation-Based Simulated Annealing,” Computational Biology and Bioinformatics, vol. 7, no. 4, 2010, pp. 727-740.
B. Tang, X. Wang and X. Wang, “Protein Secondary Structure Prediction Using Large Margin Methods,” Computer and Information Science, vol. 1, 2009, pp. 142-146.
D. Kneller, F. Cohen and R. Langridge, “Improvements in protein secondary structure prediction by an enhanced neural network,” Journal of Molecular Biology, vol. 214, no. 1, 1990, pp. 171-182.
D. Jones, “Protein secondary structure prediction based on position-specific scoring matrices,” Journal of Molecular Biology, vol. 292, 1999, pp. 195-202.
C. Geourjon and G. Deleage, “SOPM: a self-optimized method for protein secondary structure prediction,” Protein Eng, vol. 7, no. 2, 1994, pp. 157-164.
C. Geourjon and G. Deleage, “SOPMA: significant improvements in protein secondary structure prediction by consensus prediction from multiple alignments,” Comput. Appl. Biosci, vol. 1, no. 6, 1995, pp. 681-684.
D. Frishman and P. Argos, “75% accuracy in protein secondary structure prediction,” Proteins, vol. 27, 1997, pp. 329-335.
R. King and M. Sternberg, “Identification and application of the concepts important for accurate and reliable protein secondary structure prediction,” Protein Science, vol. 5, 1996, pp. 2298-2310.
A. Thomas and Y. Zheng, “Improved Prediction of HIV-1 Protease Genotypic Resistance Testing Assays using a Consensus Technique,” Neural Networks, vol. 1, 2006, pp. 2308-2314.
Y. Zhao and W. Zhengzhi, ”Consensus RNA Secondary Structure Prediction Based on SVMs,“ Bioinformatics and Biomedical Engineering, vol. 1, 2008, pp. 101-104.
C. Mazo and O. Bedoya, “PESPAD: una nueva herramienta para la predicción de la estructura secundaria de la proteína basada en árboles de decisión,” Ingeniería y Competitividad, vol. 12, no. 2, 2010, pp. 9-22.
Q. Wang, Y. Shang and D. Xu, “Protein structure selection based on consensus,” Evolutionary Computation, vol. 1, 2010, pp.1-7.
M. Siek, “Nonlinear multi-model ensemble prediction using dynamic Neural Network with incremental learning,” The 2011 International Joint Conference on Neural Networks (IJCNN), vol. 1, 2011, pp. 2873-2880.
Liyu Lin, Shuanqiang Yang, and Ruijuan Zuo, “Protein secondary structure prediction based on multi-SVM ensemble,” International Conference on Intelligent Control and Information Processing(ICICIP), vol. 1, 2010, pp. 356-358.
Bidargaddi, N., Chetty M., and Kamruzzaman, J, “An Architecture Combining Bayesian segmentation and Neural Network Ensembles for Protein Secondary Structure Prediction,” Proceedings of the 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), vol. 1, 2005, pp. 1-8.
J. De Haan and J. Leunissen, “Protein Secondary Structure Prediction: Comparison of Ten Common Prediction Algorithms Using a Neural Network,” Nato Science Series Sub Series I Life and behavioural sciences, vol. 368, 2005, pp. 149-161.
J. Allen, M. Pertea and S. Salzberg, “Computational Gene Prediction Using Multiple Sources of Evidence,” Genome Res, vol. 14, 2004, pp. 142-148.
A. Lukashin and M. Bordovsky, “GeneMark.hmm: New solutions for gene finding,” Nucleic Acids Res, vol. 26, 1998, pp. 1107-1115.
M. Pertea and S. Salzberg, “Computational gene finding in plants,” Plant Mol. Biol, vol. 48, 2002, pp. 39-48.
C. Burge and S. Karlin, “Prediction of complete gene structures in human genomic DNA,” J. Mol. Biol, vol. 268, 1997, pp. 78-84.
J. Garnier, J. Gibrat and B. Robson, “GOR method for predicting protein secondary structure from amino acid sequence,” Methods in Enzymology, vol. 266, 1996, pp. 540-553.
J. Levin, “Exploring the limits of nearest neighbour secondary structure prediction,” Protein Engineering, vol. 7, 1997, pp. 771-776.
C. Combet, C. Blanchet, C. Geourjon and G. Deléage, “NPS@: Network Protein Sequence Analysis,“ TIBS, vol. 25, no. 3, 2000, pp. 147-150.
B. Matthews, “Comparison of the predicted and observed secondary structure of T4 phage lysozyme,” Biochimica Biophysica, vol. 405, 1975, pp. 442-451.
W. Kabsch and C. Sander, “Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features,” Biopolymers, vol. 22, 1983, pp. 2577-637.