Hostname: page-component-6766d58669-76mfw Total loading time: 0 Render date: 2026-05-20T22:51:05.681Z Has data issue: false hasContentIssue false

A molecular engineering toolbox for the structural biologist

Published online by Cambridge University Press:  02 May 2017

Galia T. Debelouchina
Affiliation:
Department of Chemistry, Princeton University, Princeton, NJ 08540, USA
Tom W. Muir*
Affiliation:
Department of Chemistry, Princeton University, Princeton, NJ 08540, USA
*
*Author for correspondence: T. W. Muir, Department of Chemistry, Princeton University, Princeton, NJ 08540, USA. Email: muir@princeton.edu
Rights & Permissions [Opens in a new window]

Abstract

Exciting new technological developments have pushed the boundaries of structural biology, and have enabled studies of biological macromolecules and assemblies that would have been unthinkable not long ago. Yet, the enhanced capabilities of structural biologists to pry into the complex molecular world have also placed new demands on the abilities of protein engineers to reproduce this complexity into the test tube. With this challenge in mind, we review the contents of the modern molecular engineering toolbox that allow the manipulation of proteins in a site-specific and chemically well-defined fashion. Thus, we cover concepts related to the modification of cysteines and other natural amino acids, native chemical ligation, intein and sortase-based approaches, amber suppression, as well as chemical and enzymatic bio-conjugation strategies. We also describe how these tools can be used to aid methodology development in X-ray crystallography, nuclear magnetic resonance, cryo-electron microscopy and in the studies of dynamic interactions. It is our hope that this monograph will inspire structural biologists and protein engineers alike to apply these tools to novel systems, and to enhance and broaden their scope to meet the outstanding challenges in understanding the molecular basis of cellular processes and disease.

Information

Type
Review
Copyright
Copyright © Cambridge University Press 2017 
Figure 0

Fig. 1. Molecular engineering toolbox for the structural biologist.

Figure 1

Fig. 2. Chemical modification of cysteine residues.

Figure 2

Fig. 3. Chemical modification of natural amino acids. (a) Modification of lysine ε-amines with activated esters such as N-hydroxysuccinimide. (b) Modification of terminal α-amines with 2-pyridinecarboxyaldehydes. (c) Three-component Mannich reaction for tyrosine modification at the ortho-position. (d) Coupling of carboxyls and amines with carbodiimides such as EDC.

Figure 3

Fig. 4. Native chemical ligation at cysteine followed by desulfurization to alanine for the construction of larger polypeptide chains without any ‘scars’.

Figure 4

Fig. 5. Examples of constructs prepared by NCL and EPL for X-ray crystallography studies. (a) D-alanine was introduced at position 77 in the sequence of the potassium channel KcsA to elucidate its ion selectivity mechanism (Valiyaveetil et al. 2006) (PDB ID: 2IH3). (b) Acetylated lysine (Ac) was incorporated at postions 401 and 408 in S-Adenosylhomocysteine hydrolase (SAHH) to evaluate the structural basis of enzyme inhibition (Wang et al. 2014b) (PDB ID: 4PFJ). (c) Chemical synthesis of HIV protease afforded the site-specific incorporation of unnatural amino acids such as 2-aminoisobutyric acid to modulate conformational dynamics and catalysis (Torbeev et al. 2011) (PDB ID: 3IAW). (d) Semi-synthesis of Mxe GyrA and the installation of β-thienyl-alanine instead of the native histidine at position 187 provided a route to trap the branched intermediate of the intein (Liu et al. 2014b) (PDB ID: 4OZ6).

Figure 5

Fig. 6. Intein structure and mechanism. (a) Intein/extein residues important for splicing. (b) Protein splicing mechanism of contiguous inteins. In some cases, the hydroxyl groups of Ser/Thr act as nucleophiles in the first two steps.

Figure 6

Fig. 7. Protein engineering with inteins. (a) Expressed protein ligation. (b) Tagless protein purification. (c) Protein trans-splicing and recombinant production of segmentally isotopically labeled proteins.

Figure 7

Table 1. Intein toolbox for protein semi-synthesis

Figure 8

Fig. 8. C-terminal protein labeling with sortase. The acyl donor requires the LPXTG recognition motif, while the acyl acceptor often contains a pentaglycine sequence.

Figure 9

Fig. 9. Unnatural amino acid (UAA) incorporation by amber suppression. (a) An orthogonal aminoacyl tRNA synthetase charges a matching tRNA with the UAA of interest. The ribosome incorporates the UAA into a growing polypeptide chain by decoding the amber stop codon (UAG) on the messenger RNA. The UAA toolbox includes UAAs that represent (b) protein post-translational modifications, (c) spectroscopic probes, (d) cross-linkers, (e) bio-orthogonal reactive handles.

Figure 10

Fig. 10. Chemical modification of unnatural amino acids.

Figure 11

Fig. 11. Examples of bioenzymatic conjugation approaches. (a) Site-specific modification of cysteine with formylglycine generating enzyme, followed by oxime ligation to attach a chemical or optical probe. (b) Introduction of ‘click’ handles into proteins using lipoic acid ligase and lipoic acid analogs.

Figure 12

Fig. 12. Principle of racemic crystallography. The L- and D- forms of the polypeptide chains are prepared separately by solid-phase peptide synthesis and native chemical ligation. The proteins are subsequently mixed and co-crystallized, thus gaining access to a much larger set of crystallographic space groups.

Figure 13

Fig. 13. Strategies for segmental isotopic labeling of proteins for NMR analysis. (a) Intein-based segmental labeling. (b) Strategy for segmental labeling of proteins in cells. (c) Labeling with sortases.

Figure 14

Fig. 14. Dual labeling of proteins with fluorophores. (a) A labeling strategy based on the combination of cysteine chemistry and amber suppression (Brustad et al. 2008). (b) The FlAsH labeling system can be used for the selective modification of a genetically encoded peptide tag, in combination with amber suppression (Perdios et al. 2017). (c) A dual labeling strategy based on native chemical ligation and amber suppression (Wissner et al. 2013). (d) Genetic incorporation of two UAAs using orthogonal ribosomes that can decode the AGTA quadruplet codon (Sachdeva et al. 2014).