Hostname: page-component-77f85d65b8-v2srd Total loading time: 0 Render date: 2026-03-28T13:41:38.663Z Has data issue: false hasContentIssue false

Parasites, proteomes and systems: has Descartes’ clock run out of time?

Published online by Cambridge University Press:  25 July 2012

J. M. WASTLING*
Affiliation:
Department of Infection Biology, Institute of Infection and Global Health, Faculty of Health and Life Sciences, University of Liverpool, Liverpool L69 7ZJ, UK
S. D. ARMSTRONG
Affiliation:
Department of Infection Biology, Institute of Infection and Global Health, Faculty of Health and Life Sciences, University of Liverpool, Liverpool L69 7ZJ, UK
R. KRISHNA
Affiliation:
Department of Infection Biology, Institute of Infection and Global Health, Faculty of Health and Life Sciences, University of Liverpool, Liverpool L69 7ZJ, UK Department of Functional and Comparative Genomics, Institute of Integrative Biology, Faculty of Health and Life Sciences, University of Liverpool, Liverpool, L69 7ZB, UK
D. XIA
Affiliation:
Department of Infection Biology, Institute of Infection and Global Health, Faculty of Health and Life Sciences, University of Liverpool, Liverpool L69 7ZJ, UK
*
*Corresponding author: Department of Infection Biology, Institute of Infection and Global Health, Faculty of Health and Life Sciences, University of Liverpool, Liverpool L69 7ZJ, UK. Tel: 0151 794 4262. E-mail: j.wastling@liv.ac.uk
Rights & Permissions [Opens in a new window]

Summary

Systems biology aims to integrate multiple biological data types such as genomics, transcriptomics and proteomics across different levels of structure and scale; it represents an emerging paradigm in the scientific process which challenges the reductionism that has dominated biomedical research for hundreds of years. Systems biology will nevertheless only be successful if the technologies on which it is based are able to deliver the required type and quality of data. In this review we discuss how well positioned is proteomics to deliver the data necessary to support meaningful systems modelling in parasite biology. We summarise the current state of identification proteomics in parasites, but argue that a new generation of quantitative proteomics data is now needed to underpin effective systems modelling. We discuss the challenges faced to acquire more complete knowledge of protein post-translational modifications, protein turnover and protein-protein interactions in parasites. Finally we highlight the central role of proteome-informatics in ensuring that proteomics data is readily accessible to the user-community and can be translated and integrated with other relevant data types.

Information

Type
Research Article
Copyright
Copyright © Cambridge University Press 2012. The online version of this article is published within an Open Access environment subject to the conditions of the Creative Commons Attribution-NonCommercial-ShareAlike licence <http://creativecommons.org/licenses/by-nc-sa/2.5/>. The written permission of Cambridge University Press must be obtained for commercial re-use.
Figure 0

Fig. 1. A schematic diagram for a proteomics workflow. A typical workflow for a high-throughput identification and quantification proteomics experiment. The workflow consists of four major steps: sample collection, protein extraction and purification, sample fractionation (in either protein or peptide space) and mass spectrometry analysis. Various entry points for quantitative proteomics are shown by arrowed boxes. The collected raw mass spectra data are then analysed by various bioinformatics pipelines illustrated in Fig. 2.

Figure 1

Table 1. Current proteomics projects in protozoan parasites. There are over 15 protozoan species with on-going proteomics projects, half of which have greater than 30% coverage. EuPathDB (www.eupathdb.org) (Aurrecoechea et al., 2010) acts as one of the main portals for eukaryotic pathogen proteomics

Figure 2

Fig. 2. A bioinformatics pipeline for large-scale proteomics data storage, querying and interpretation. The involvement of various bioinformatics tasks in processing and interpreting proteomics data is summarised. Raw mass spectra data collected from an experimental pipeline are subjected to proteomics identification packages, where protein identifications are acquired based on the comparison between raw data and protein sequence databases. The results are then analysed by appropriate quantification packages where relative or absolute quantifications of the analyte proteins are calculated. Protein function and localization prediction and pathway analysis tools have been developed to infer the biological meaning of the identification and quantification data. Proteogenomic and database integration pipelines are available to facilitate data integration with online databases and improve genome annotation using alternate annotations.

Figure 3

Fig. 3. Screenshot of protein expression data of individual genes displayed on ToxoDB. A screenshot illustrating protein expression data for gene a Toxoplasma gondii gene TGME49_086420 viewed on an individual gene record page. Colour coded peptides are mapped to the gene sequence according to the various experiments and laboratories that identified them.

Figure 4

Fig. 4. Workflows for two important tools for proteomics informatics research on ToxoDB . Starting on ToxoDB the frontpage, protein expression evidence can be queried according to the experiments and samples using ‘Identify Genes based on Mass Spec. Evidence.’ tool. The results can be compared with other genome information using ‘Add Step’ tool in the result page. The example shown is the comparison of proteomics results with one of the RNA sequencing datasets (RNA Seq Evidence). ‘Genome Browser’ can also be accessed from ToxoDB frontpage, where various genome information (named tracks) can be selected to be aligned in a defined genomic region. Individual genes from proteomics results can also be viewed directly in GBrowse from the result page. The example shown is proteomics evidence of gene TGME49_100100 aligned with RNA Seq evidence acquired.