Multiple Sequence Alignment I

Rex A. Dwyer

doi:10.1017/CBO9781139164764.010

9 - Multiple Sequence Alignment I

Published online by Cambridge University Press: 05 June 2012

Rex A. Dwyer

Show author details

Rex A. Dwyer: Affiliation:
The BioAlgorithmic Consultancy

Book contents

Get access

Summary

Once a family of homologous proteins has been identified, it is often useful to arrange their sequences in a multiple alignment such as the one in Figure 9.1.

A multiple alignment is useful for constructing a so-called consensus sequence, which – while probably differing from every individual sequence in the family – is nonetheless a better representative of the family than any of its actual members. Multiple alignments can also form the basis of more abstract statistical models of the protein family called profiles.

By examining which elements of the consensus are present in most or all family members and which exhibit a greater degree of variability, we can also find clues to the protein's function. Highly conserved regions are likely to have been conserved because they form active sites crucial to function, while more variable regions are more likely to have merely structural roles.

We have already seen in Chapter 3 that the number of ways in which a mere two sequences of only moderate length can be aligned is comparable to current estimates of the number of atoms in the observable universe. The addition of more sequences only increases the number of possibilities. We need both a criterion for evaluating multiple alignments and a computational strategy that will allow us to eliminate large sets of alignments at one stroke.

To describe our evaluation criterion, we will rely on the notion of projection of a multiple alignment.

Type: Chapter
Information: Genomic Perl
From Bioinformatics Basics to Working Code
, pp. 127 - 140

DOI: https://doi.org/10.1017/CBO9781139164764.010 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2002

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

9 - Multiple Sequence Alignment I

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive