Published online by Cambridge University Press: 05 August 2012
Great fleas have little fleas upon their backs to bite ’em, And little fleas have lesser fleas, and so ad infinitum.
(Augustus De Morgan, 1806–1871)For this chapter, as well as Chapters 12 and 13, we turn to the important genomics and bioinformatics problem of identifying biological function based on nucleotide and amino acid sequences.
Assigning function based on sequence similarity
A common problem in molecular biology is that you are faced with a gene or a gene product and you have no clue from experimental studies as to its function. In this context a critical contribution of bioinformatics is to attribute the sequence of a gene or a gene product a function. As one example, a genome sequencing project may give rise to tens of thousands of predicted protein sequences. In such a case we want to assign as many of these as possible a biological function using computational tools. In this manner we avoid many laborious wetlab experiments. In addition to genome sequencing projects, there are other more specialized situations where we want to find functions of genes. For instance, we could identify genes as being related to a specific genetic trait or disease, or a set of genes as being expressed under certain conditions.
A number of computational tools are available to predict a biological function associated with a protein sequence. In this chapter we will see an example in which we assign a function to a protein based on sequence similarity. Consider the human gene encoding the protein BRCA1, originally sequenced in 1994 (Miki et al., 1994). It was found to be related in sequence to a yeast protein RAD9. This yeast protein is involved in cell cycle control. This observation gave scientists a hint about possible roles of the BRCA1 gene. We see here an example of inferring a function based on a homology relationship to a protein that has already been functionally characterized. We will see yet another example of this situation in this chapter, where we will make use of BLAST to identify a homology relationship. We already encountered BLAST in the context of the BCR–ABL fusion protein in Chapter 7.
To save this book to your Kindle, first ensure no-reply@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Find out more about the Kindle Personal Document Service.
To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.
To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.