Hostname: page-component-77f85d65b8-9nbrm Total loading time: 0 Render date: 2026-04-18T04:32:30.615Z Has data issue: false hasContentIssue false

Anticipating innovations in structural biology

Published online by Cambridge University Press:  02 July 2018

Helen M. Berman*
Affiliation:
Department of Chemistry and Chemical Biology, Center for Integrative Proteomics Research, Institute for Quantitative Biomedicine, 174 Frelinghuysen Road, Piscataway, New Jersey 08854, USA
Catherine L. Lawson
Affiliation:
Department of Chemistry and Chemical Biology, Center for Integrative Proteomics Research, Institute for Quantitative Biomedicine, 174 Frelinghuysen Road, Piscataway, New Jersey 08854, USA
Brinda Vallat
Affiliation:
Department of Chemistry and Chemical Biology, Center for Integrative Proteomics Research, Institute for Quantitative Biomedicine, 174 Frelinghuysen Road, Piscataway, New Jersey 08854, USA
Margaret J. Gabanyi
Affiliation:
Department of Chemistry and Chemical Biology, Center for Integrative Proteomics Research, Institute for Quantitative Biomedicine, 174 Frelinghuysen Road, Piscataway, New Jersey 08854, USA
*
Author for correspondence: Helen M. Berman, E-mail: berman@rcsb.rutgers.edu
Rights & Permissions [Opens in a new window]

Abstract

In this review, we describe how the interplay among science, technology and community interests contributed to the evolution of four structural biology data resources. We present the method by which data deposited by scientists are prepared for worldwide distribution, and argue that data archiving in a trusted repository must be an integral part of any scientific investigation.

Information

Type
Review
Copyright
Copyright © Cambridge University Press 2018 
Figure 0

Fig. 1. Growth chart of structures in the PDB with indicators of each decade. (a) The number of structures released per year (blue) and the cumulative number of structures (orange). (b) The same information, using a log scale. The number of structures released at the end of each decade is indicated by black brackets.

Figure 1

Fig. 2. Examples of structures determined in the 1970's. Ribbon representations were generated using UCSF Chimera (Pettersen et al., 2004). (a) Myoglobin (Watson, 1969), PDB ID: 1MBN. First protein structure determined using X-ray crystallography. (b) Lysozyme (Blake et al., 1965; Kelly et al., 1979), PDB ID: 9LYZ. First enzyme structure determined using X-ray crystallography. (c) Yeast phenylalanine transfer RNA (Rich & Kim, 1978; Robertus et al., 1974), PDB ID: 4TNA. First RNA structure determined using X-ray crystallography.

Figure 2

Fig. 3. Examples of structures determined in the 1980's. (a) A, B and Z DNA (Dickerson et al., 1982). This representation of the three canonical forms of DNA is taken from the Molecule of the Month (Goodsell, 2001). (b) Rhinovirus (Arnold & Rossmann, 1988), PDB ID: 4RHV. This was one of the early virus structures determined using X-ray crystallography. Three unique chains (grey, pink, orange surfaces) are repeated 60-fold to create a virus capsid with icosahedral symmetry.

Figure 3

Fig. 4. Examples of structures determined in the 1990's. (a) The structure of a regulator of transcription called the TATA-binding protein bound to DNA. The binding of beta sheets into the minor groove of DNA causes a profound bend in the DNA (Patikoglou et al., 1999), PDB ID: 1QN6. (b) Nucleosome (Luger et al., 1997). The DNA is shown in orange wraps around the histone proteins shown in blue. Taken from the Molecule of the Month (Goodsell, 2000a).

Figure 4

Fig. 5. Ribosome subunits. The small subunit is shown on the left and the large on the right (Ban et al., 2000; Carter et al., 2000; Schluenzen et al., 2000). The protein is shown in blue and the RNA in orange and yellow. Taken from the Molecule of the Month (Goodsell, 2000b).

Figure 5

Table 1. Summary statistics of structures and other research products produced by the Protein Structure Initiative (PSI), 2000–2017

Figure 6

Fig. 6. Cumulative growth of 3DEM Structures. The number of structures available in EMDB for each recent year is indicated in purple (resolution better than 5·0 Å, dark purple); the number of EM-derived models available in PDB is indicated in green (resolution better than 5·0 Å, dark green).

Figure 7

Fig. 7. Sampling of 3DEM structures recently released in EMDB: (a) GroEL (Roh et al., 2017), EMD-8750 (b) DNA Protein Kinase (Sharif et al., 2017), EMD-8751 (c) Heterotrimeric Gs protein complex (Liang et al., 2017), EMD-8623 (d) Glutamate A2 receptor (Twomey et al., 2017), EMD-8823 (e) Spliceosome (Wan et al., 2016), EMD-9525 (f) Rhinovirus/Fab complex (Dong et al., 2017), EMD-8763.

Figure 8

Fig. 8. The Data Processing Pipeline, from Data Creation through Distribution. Each component of the PDB pipeline is described in the section The current PDB pipeline.

Figure 9

Table 2. Experimental metadata requirements for the methods currently supported by the PDB

Figure 10

Fig. 9. OneDep System. Deposition is provided for X-ray, 3DEM and NMR. The annotation pipeline is made up of several modules that check the chemistry of the components, add new annotations and validate the structural model against standard geometries and the experimental data.

Figure 11

Fig. 10. I/H model of the Nup84 sub-complex from the Nuclear Pore Complex (Shi et al., 2014) available from PDB-Dev (Burley et al., 2017; Vallat et al., 2018), PDB-Dev ID: PDBDEV_00000001. Multi-scale structural model of the heptameric Nup84 sub-complex is shown (colored ribbons and spheres) along with the localization densities of the sampled structures (colored contoured surfaces). The model is obtained using the Integrative Modeling Platform (IMP) software (Russel et al., 2012) and visualized using ChimeraX software (Goddard et al., 2018).

Figure 12

Fig. 11. Conceptual diagram of the I/H Methods Federation. At the center are the three structural biology model repositories: the PDB archives experimentally determined structures of macromolecules (Berman et al., 2000); the Model Archive (MA), part of the Protein Model Portal (PMP), archives in silico structural models (Bordoli and Schwede, 2012; Haas and Schwede, 2013; Haas et al., 2013); and PDB-development (PDB-Dev) is the prototype system for archiving I/H models (Vallat et al., 2018; Burley et al., 2017). The outer circle consists of experimental data repositories that contribute to structural biology. Only a limited set of experimental data archives have been identified at present and many others may be included as the field evolves and the respective research communities build their own repositories.