Hostname: page-component-6766d58669-vgfm9 Total loading time: 0 Render date: 2026-05-16T11:23:21.745Z Has data issue: false hasContentIssue false

Functional genome annotation of Drosophila seminal fluid proteins using transcriptional genetic networks

Published online by Cambridge University Press:  22 December 2011

JULIEN F. AYROLES
Affiliation:
Department of Genetics, North Carolina State University, Raleigh, NC 27695, USA W. M. Keck Center for Behavioral Biology, North Carolina State University, Raleigh, NC 27695, USA
BROOKE A. LAFLAMME
Affiliation:
Departments of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853-2703, USA
ERIC A. STONE
Affiliation:
Department of Genetics, North Carolina State University, Raleigh, NC 27695, USA W. M. Keck Center for Behavioral Biology, North Carolina State University, Raleigh, NC 27695, USA
MARIANA F. WOLFNER
Affiliation:
Departments of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853-2703, USA
TRUDY F. C. MACKAY*
Affiliation:
Department of Genetics, North Carolina State University, Raleigh, NC 27695, USA W. M. Keck Center for Behavioral Biology, North Carolina State University, Raleigh, NC 27695, USA
*
*Corresponding author: North Carolina State University, Raleigh, NC, USA.
Rights & Permissions [Opens in a new window]

Summary

Predicting functional gene annotations remains a significant challenge, even in well-annotated genomes such as yeast and Drosophila. One promising, high-throughput method for gene annotation is to use correlated gene expression patterns to annotate target genes based on the known function of focal genes. The Drosophila melanogaster transcriptome varies genetically among wild-derived inbred lines, with strong genetic correlations among the transcripts. Here, we leveraged the genetic correlations in gene expression among known seminal fluid protein (SFP) genes and the rest of the genetically varying transcriptome to identify 176 novel candidate SFPs (cSFPs). We independently validated the correlation in gene expression between seven of the cSFPs and a known SFP gene, as well as expression in male reproductive tissues. We argue that this method can be extended to other systems for which information on genetic variation in gene expression is available.

Information

Type
Research Papers
Copyright
Copyright © Cambridge University Press 2011
Figure 0

Fig. 1. Graphical representation of the correlation among known SFPs. Each node represents a gene and each edge the correlation between two genes. The thickness of each edge is scaled proportional to the strength of the correlation between two genes. The absolute value of all correlations depicted is greater than 0·5 (P<0·001).

Figure 1

Fig. 2. Correlation of qRT-PCR estimates of gene expression between cSFP genes and positive and negative SFP control genes (y-axis) to a known ACP gene (CG9997, x-axis) among males of 20 inbred lines. All estimates of gene expression are normalized to that of actin5C. The linear regression line is shown, along with the t-test P-value and the estimate of the correlation coefficient, r. (a) CG11828, r=0·68, P=0·001. (b) CG31413, r=0·81, P=0·000014. (c) CG31493, r=0·77, P=0·000063. (d) CG31496, r=0·51, P=0·022. (e) CG32985, r=0·53, P=0·015. (f) CG34002, r=0·66, P=0·0017. (g) CG9720, r=0·66, P=0·0016. (h) Dup99B (positive control), r=0·55, P=0·012. (i) CG34422 (negative control), r=0·12, P=0·61.

Figure 2

Table 1. Genes selected for experimental validation. The SFP score is the fraction of known SFPs with which the gene had correlated expression. Sprob is the predicted probability of a secretion signal sequence as given by SignalP. Tissue of expression is given from the FlyAtlas compilation and our RT-PCR data from the male reproductive tract and carcass. ED and bulb are not represented in FlyAtlas. Bold font denotes tissues of predominant expression. AG, accessory glands; ED, ejaculatory duct; EB, ejaculatory bulb; T, testis; LSG, larval salivary glands; HT, heart; HD, head. ED and EB are not represented in FlyAtlas.

Figure 3

Fig. 3. RT-PCR analysis of gene expression for cSFP genes and positive and negative SFP control genes in five male tissues: AG, testes (T), EB, ED and carcass (C, non-reproductive tissues). The subscripts denote the two biological replicates of each tissue. The number of PCR cycles for each gene was normalized to give non-saturation results. Actin5C was used as a control for cDNA synthesis. Whole male cDNA was used as a positive (+) PCR control, and no DNA template was used as a negative (−) PCR control.

Supplementary material: File

Ayroles Supplementary Material

Table.xls

Download Ayroles Supplementary Material(File)
File 813.6 KB
Supplementary material: File

Ayroles Supplementary Material

Table.xls

Download Ayroles Supplementary Material(File)
File 127.5 KB