Data visualization and the DDP process

doi:10.1017/CBO9780511989421.007

6 - Data visualization and the DDP process

Published online by Cambridge University Press: 05 February 2016

Ke Xu

Edited by

William T. Loging

Show author details

Ke Xu: Affiliation:
Bristol-Myers-Squibb Inc
William T. Loging: Affiliation:
Mount Sinai School of Medicine, New York

Book contents

Get access

Summary

Data visualization denotes the techniques of visually presenting complex data sets to achieve goals such as displaying multiple data dimensions simultaneously, connecting related data points from data sets, or showing data distribution patterns. They are of great value for data processing, data analysis, and data presentation activities.

Genomics and functional genomics are the major driving forces for the development and utilization of visualization tools in biological fields. Following the completion of genomic sequencing projects of human and other model organisms around the beginning of this century, our knowledge of genes has jumped to the tens of thousands per species. Expression profiling microarray can generate millions of data points per experiment. The challenge of the huge data set size and the need to integrate different data sources in analyses prompted significant research and development work by both academic and industrial bioinformaticians. As a result, many visualization methods, proposals, and tools for biological data have been developed thus far. This chapter will describe the problems and solutions for the visualization of three basic and largest (thus, most challenging) genomics/functional genomics data types. More specifically, the first two sections will discuss visualization of sequence data and pathway/gene network data, which are two data types specific to genomics and other biology fields. In the third section, we will review visualization methods of numeric data, such as expression profiling data, proteomic data, and genotyping data. Most of the techniques in the section can also be applied to other areas. However, some topics, such as viewing numeric data in the context of genome or pathways, are still biology-specific.

Sequence and genomes

The genome is the complete set of genetic materials for an organism, which includes genes, regulatory and replication-related sequences, as well as non-functional intergenic regions. For most organisms other than RNA viruses, long linear or circular DNA molecules form the biochemical basis of the genome that stores all the genetic information. Visualization of the genome refers to the visual display of the DNA sequences and associated annotations. Depending on the visualization purposes, genome visualization tools can be classified into two categories: sequence viewer for visualizing sequence and annotations, and genome alignment viewer, for comparing different genomes.

Information

Type: Chapter
Information: Bioinformatics and Computational Biology in Drug Discovery and Development , pp. 114 - 136

DOI: https://doi.org/10.1017/CBO9780511989421.007 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2016

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book purchase

Temporarily unavailable

References

Berriman, M. and Rutherford, K.Viewing and annotating sequence data with Artemis. Briefings in Bioinformatics. 2003;4(2):124–132.CrossRef Google Scholar PubMed

Frazer, K. A., Pachter, L., Poliakov, A., Rubin, E. M. and Dubchak, I.VISTA: Computational tools for comparative genomics. Nucleic Acids Research. 2004;32: W273–W279.CrossRef Google Scholar PubMed

Junker, B. H., Klukas, C. and Schreiber, F.VANTED: A system for advanced data analysis and visualization in the context of biological networks. BMC Bioinformatics. 2006;7:109–121.Google Scholar PubMed

Le Novère, N. L., Hucka, M., Mi, H., et al. The systems biology graphical notation. Nature Biotechnology, 2009;27(8):735–741.Google Scholar PubMed

Lewis, S. E., Searle, S. M., Harris, N., et al. Apollo: A sequence annotation editor. Genome Biology. 2002;3(12):research0082.1-0082.14.CrossRef Google Scholar PubMed

Schwartz, S., Zhang, Z., Frazer, K. A., et al. PipMaker – A web server for aligning two genomic DNA sequences. Genome Research. 2000;10:577–586.CrossRef Google Scholar PubMed

Accessibility standard: Unknown

Why this information is here

This section outlines the accessibility features of this content - including support for screen readers, full keyboard navigation and high-contrast display options. This may not be relevant for you.

Accessibility Information

Accessibility compliance for the PDF of this book is currently unknown and may be updated in the future.