Abstract
The prediction of protein secondary structure (alpha-helices, beta-sheets and coil) is improved by 9% to 66% using the information available from a family of homologous sequences. The approach is based both on averaging the Garnier et al. (1978) secondary structure propensities for aligned residues and on the observation that insertions and high sequence variability tend to occur in loop regions between secondary structures. Accordingly, an algorithm first aligns a family of sequences and a value for the extent of sequence conservation at each position is obtained. This value modifies a Garnier et al. prediction on the averaged sequence to yield the improved prediction. In addition, from the sequence conservation and the predicted secondary structure, many active site regions of enzymes can be located (26 out of 43) with limited over-prediction (8 extra). The entire algorithm is fully automatic and is applicable to all structural classes of globular proteins.
Original language | English |
---|---|
Pages (from-to) | 957-961 |
Number of pages | 5 |
Journal | Journal of Molecular Biology |
Volume | 195 |
Issue number | 4 |
DOIs | |
Publication status | Published - 20 Jun 1987 |
Keywords
- Algorithms
- Amino acid sequence
- Binding sites
- Protein conformation