Amino acid sequence autocorrelation vectors and Bayesian-regularized genetic neural networks for modeling protein conformational stability:: Gene V protein mutants

Fernandez, Leyden; Caballero, Julio; Abreu, Jose Igancio; Fernandez, Michael

Abstract

Development of novel computational approaches for modeling protein properties from their primary structure is the main goal in applied proteomics. In this work, we reported the extension of the autocorrelation vector formalism to amino acid sequences for encoding protein structural information with modeling purposes. Amino acid sequence autocorrelation (AASA) vectors were calculated by measuring the autocorrelations at sequence lags ranging from 1 to 15 on the protein primary structure of 48 amino acid/residue properties selected from the AAindex data base. A total of 720 AASA descriptors were tested for building predictive models of the change of thermal unfolding Gibbs free energy change (Delta Delta G) of gene V protein upon mutation. In this sense, ensembles of Bayesian-regularized genetic neural networks (BRGNNs) were used for obtaining an optimum nonlinear model for the conformational stability. The ensemble predictor described about 88% and 66% variance of the data in training and test sets respectively. Furthermore, the optimum AASA vector subset not only helped to successfully model unfolding stability but also well distributed wild-type and gene V protein mutants on a stability self-organized map (SOM), when used for unsupervised training of competitive neurons. Proteins 2007;67:834-852. (C) 2007 Wiley-Liss, Inc.

Más información

Título según WOS: ID WOS:000246415700006 Not found in local WOS DB
Título de la Revista: PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS
Volumen: 67
Número: 4
Editorial: Wiley
Fecha de publicación: 2007
Página de inicio: 834
Página final: 852
DOI:

10.1002/prot.21349

Notas: ISI