Hostname: page-component-89b8bd64d-shngb Total loading time: 0 Render date: 2026-05-06T10:47:58.171Z Has data issue: false hasContentIssue false

From covalent transition states in chemistry to noncovalent in biology: from β- to Φ-value analysis of protein folding

Published online by Cambridge University Press:  20 March 2024

Alan R. Fersht*
Affiliation:
MRC Laboratory of Molecular Biology, Cambridge, UK Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK Gonville and Caius College, University of Cambridge, Cambridge, UK
*
Rights & Permissions [Opens in a new window]

Abstract

Solving the mechanism of a chemical reaction requires determining the structures of all the ground states on the pathway and the elusive transition states linking them. 2024 is the centenary of Brønsted’s landmark paper that introduced the β-value and structure-activity studies as the only experimental means to infer the structures of transition states. It involves making systematic small changes in the covalent structure of the reactants and analysing changes in activation and equilibrium-free energies. Protein engineering was introduced for an analogous procedure, Φ-value analysis, to analyse the noncovalent interactions in proteins central to biological chemistry. The methodology was developed first by analysing noncovalent interactions in transition states in enzyme catalysis. The mature procedure was then applied to study transition states in the pathway of protein folding – ‘part (b) of the protein folding problem’. This review describes the development of Φ-value analysis of transition states and compares and contrasts the interpretation of β- and Φ-values and their limitations. Φ-analysis afforded the first description of transition states in protein folding at the level of individual residues. It revealed the nucleation-condensation folding mechanism of protein domains with the transition state as an expanded, distorted native structure, containing little fully formed secondary structure but many weak tertiary interactions. A spectrum of transition states with various degrees of structural polarisation was then uncovered that spanned from nucleation-condensation to the framework mechanism of fully formed secondary structure. Φ-analysis revealed how movement of the expanded transition state on an energy landscape accommodates the transition from framework to nucleation-condensation mechanisms with a malleability of structure as a unifying feature of folding mechanisms. Such movement follows the rubric of analysis of classical covalent chemical mechanisms that began with Brønsted. Φ-values are used to benchmark computer simulation, and Φ and simulation combine to describe folding pathways at atomic resolution.

Information

Type
Review
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Transition state is at a maximum for free energy, G, versus reaction coordinate, r.

Figure 1

Figure 2. Transition state for the general-base-catalysed attack of water on an ester.

Figure 2

Figure 3. Illustration of one type of origin for a LFER. In the plot of G versus reaction coordinate, r, the energy function of the starting material S crosses that of the products P at the transition state. To an approximation, if the structure and energetics are perturbed such the energy of P is increased relatively by $ \Delta \Delta {G}^0 $ to S, the energy of the transition state will be increased by a value of $ \Delta \Delta {G}^{\ddagger } $ that is less than $ \Delta \Delta {G}^0 $ and determined by the angles and so forth at the point of intersection. Apart from the extreme values of the position of the transition $ {r}^{\ddagger }=0 $ or 1, $ {r}^{\ddagger } $ does not generally = $ \Delta \Delta {G}^{\ddagger }/\Delta \Delta {G}^0 $ that is, ≠ α or β (Fersht, 2004b). The small change in r with changes in energetics is the basis of the Hammond Postulate (Hammond, 1955) whereby as the energy of the high energy state increases, the transition state structure moves closer to it.

Figure 3

Figure 4. Difference energy plot for mutations of side chains of the tyrosyl-tRNA synthetase. The values of $ \Delta \Delta {G}_{\mathrm{mut}-\mathrm{wt}} $ (mutant – wild type) for the $ \Delta G $ of binding Tyr, ATP, [T-A], T-A.PPi and T-A in the formation of tyrosyl-adenylate (Eq. (9)) on mutation of residues Cys35 and His48 (data from Wells and Fersht, 1986; Fersht et al.,1987).

Figure 4

Figure 5. A linear free energy relationship for the reaction E.Tyr.ATP → E.Tyr.ATP.PPi of the tyrosyl-tRNA synthetase (k3 and k3./k−3 in Eq. (9)) (Fersht et al.,1987).

Figure 5

Figure 6. Difference energy diagrams for residues in the binding site of the tyrosyl-tRNA synthetase that bind to the charged oxygens of α-phosphate of ATP primarily in the pentacovalent transition state on the nucleophilic attack of the carboxylate of tyrosine (Fersht, 1987).

Figure 6

Figure 7. Classical mechanisms of folding. Left: the framework/diffusion-collision model; middle, nucleation-growth; right, hydrophobic collapse/molten globule.

Figure 7

Figure 8. Reduction of an energy landscape to a conventional reaction coordinate diagram. This reconciles the classical view of a pathway with the ‘new view’ of an energy landscape with an ensemble of conformations (after Eaton et al.,1996). Q is the relative number of pairwise native contacts in the landscape description and r is the conventional overall reaction coordinate. The number and heterogeneity of individual states decreases as the protein folds. (A, cross-section through a folding funnel (courtesy of P.G. Wolynes); B, reducing the landscape to a collection of ensembles moving along a pathway for the folding of a two-state protein such as CI2; and C, folding of a protein with a more structured denatured state.

Figure 8

Figure 9. Thermodynamic cycles for the basis of Φ-value analysis (relabelled from Matouschek et al.,1989).

Figure 9

Figure 10. Free energy profiles for mutations giving $ \varPhi =0 $ when the mutated residue A is in disordered region (left) or 1 in a fully native (right). The energy profiles are simplified with the energies of the denatured states D for wild-type and D’ for mutant being set at the same level.

Figure 10

Figure 11. Double-Mutant cycles. X and Y are mutated individually and as a pair, and the values of $ \Delta {G}_{\mathrm{D}-\mathrm{N}} $ or $ \Delta {G}_{\ddagger -\mathrm{N}} $ measured. Interaction energies of X and Y with other residues cancel in the $ \Delta \Delta {G}_{\mathrm{int}} $ cycles and are perturbed only by $ \Delta \Delta {G}_{\mathrm{reorg}} $ terms in the folded state. For the denatured state, $ \Delta \Delta {G}_{\mathrm{int}}=0 $ when the residues X and Y do not interact with each other. Accordingly, the measured values of $ \Delta \Delta {G}_{\mathrm{int}}=\Delta \Delta {G}_{\mathrm{E}\mathrm{Y}-\mathrm{EX}\mathrm{Y}}-\Delta {G}_{\mathrm{E}-\mathrm{EX}}=\Delta {G}_{\mathrm{E}\mathrm{X}-\mathrm{EX}\mathrm{Y}}-\Delta {G}_{\mathrm{E}-\mathrm{EY}} $ give the interaction energies between X and Y in the native state at equilibrium or in the transition state for kinetics.

Figure 11

Figure 12. Possible parallel pathways of folding (Fersht et al.,1994).

Figure 12

Figure 13. Chevron plot for the folding of CI2 determined by stopped-flow kinetics (Jackson and Fersht, 1991a) and, inset, barnase (Matouschek et al.,1990). Rate constants are in units of s−1. For CI2, the plot is for a perfect two-state transition and the arms are linear. For barnase, there is deviation at low denaturant concentration from the perfect theoretical two-state (solid line) because of a change in the structure of the denatured state or presence of a folding intermediate.

Figure 13

Figure 14. Chevron plots for folding of FBP28, which nicely illustrate $ \varPhi =0 $ , B, where the refolding limbs overlap, or 0, A, where the unfolding limbs overlap, and C and D for fractional values. T-jump was required for the rate constants in the range of 10,000 s−1 (Petrovich et al.,2006).

Figure 14

Figure 15. Barnase folding from experiment and simulation. An MD unfolding simulation from the native state N to the denatured state D at 225 C, is shown in reverse. The structures are coloured from red at the N-terminus to blue at the C-terminus. The denatured state is an ensemble of structures whose overall topology resembles that of the native state. Τhe hairpin at the centre of the antiparallel β-sheet is present in the denatured state, albeit with some non-native interactions. The N-terminal helix is partly structured, stabilised by hydrophobic interactions. The final transition state consists of the largely formed N-terminal helix docked onto the β-sheet, which is strongly formed in the central regions, with the hydrophobic core in the process of being formed and other interactions consolidated (Fersht and Daggett, 2002).

Figure 15

Figure 16. Brønsted (Leffler) plots of $ \Delta \Delta {G}_{\ddagger -\mathrm{N}} $ versus $ \Delta \Delta {G}_{\mathrm{D}-\mathrm{N}} $ for CI2 which has a diffuse transition state and barnase which has a polarised one.

Figure 16

Figure 17. CI2 folding from experiment and simulation. An MD unfolding simulation from the native state N to the denatured state(s) D at 225°C shown in reverse. The structures are coloured from red at the N terminus to blue at the C terminus. The transition state is built around an extended nucleus, in which L49 and I57 pack against Ala16 (shown in magenta), towards the N terminus of the α-helix. There is flickering structure around Ala-16 in the denatured state.

Figure 17

Figure 18. Hammond and anti-Hammond behaviour for the folding of a protein. Left top: Conventional Hammond behaviour as the transition state moves closer to the folded state (F) along the reaction coordinate with increasing destabilisation of F. Left: bottom Cross-section of the energy profile perpendicular to the reaction coordinate at the transition state. Anti-Hammond behaviour as the transition state moves closer to the unfolded state in a direction perpendicular to the reaction coordinate on destabilisation of F see (Jencks, 1985). Right: Correlation diagrams of the average degree of folding, say $ {\beta}_{\mathrm{T}} $, for the whole protein and Φ, the degree of formation of the helix, in the transition state. Top right: Average degree of folding in the transition state increases as the transition state moves along the reaction coordinate closer to F as the protein is destabilised by a mutation. Bottom right: Concurrent with the movement of the transition state along the reaction coordinate in the direction of F as the protein is destabilised by a mutation, there is anti-Hammond movement perpendicular to the reaction coordinate that leads to the helix becoming less folded and Φ decreases (Matthews and Fersht, 1995).

Figure 18

Figure 19. Folding pathway of Engrailed Homeodomain (EnHD) from experiment and simulation. From right to left: native state (NS) structure solved by nuclear magnetic resonance and X-ray crystallography; transition state (TS) by Φ-analysis of secondary structure (colour-coded from $ \varPhi =0 $, red, to $ \varPhi =1 $, blue); the folding intermediate (I) stably generated by protein engineering and solved by NMR; the denatured state (U), under conditions that favour folding, simulated using molecular dynamics; and the entire unfolding pathway was simulated by molecular dynamics.

Figure 19

Figure 20. (a) Structures and (b) secondary structure prediction for En-HD (o), c-Myb (×), hRAP1 (♦), and hTRF1 (☐) (Gianni et al.,2003).

Figure 20

Figure 21. The slide from framework to nucleation-condensation (Gianni et al.,2003).

Figure 21

Figure 22. Calculated helical propensities of BBL (red), E3BD (blue), and POB (green) sequences (Neuweiler et al.,2009).

Figure 22

Figure 23. Φ-value analysis of PSBD family members. Top: $ {\varPhi}_{\mathrm{F}} $-values for BBL (red bars), E3BD (blue bars) and POB (green bars). Middle: Sequences aligned with similar residues in boldface. $ {\varPhi}_{\mathrm{F}} $-values are indicated using the colour code at bottom left grouped into ‘low’ (0.0< $ {\varPhi}_{\mathrm{F}} $ <0.3), ‘medium’ (0.3< $ {\varPhi}_{\mathrm{F}} $ <0.6), and ‘high’ (0.6< $ {\varPhi}_{\mathrm{F}} $ ≤1.0), and bottom mapped onto the sequences and native-state structures (modified from Neuweiler et al.,2009).

Figure 23

Figure 24. $ \varPhi \hbox{-} \varPhi $ plots demonstrating the robustness of Φ-values (Gianni and Jemth, 2014). (a) PDZ domains (Gianni et al.,2007; Calosci et al.,2008), (b) Circularly permuted PDZ domain (Ivarsson et al.,2009), (c) circularization of LysM domain (Nickson et al.,2008), (d) tryptophan as a fluorescence probe inserted in turn into each of the three helices of the B-domain of Protein A (Sato et al.,2006), and (e) the spectrin R16 domain with different neighbouring domains (Batey and Clarke, 2008). The P-value is the probability that the two variables are not correlated.

Figure 24

Figure 25. A transition state that is an expanded, distorted, native structure being common to framework and nucleation-condensation mechanisms.

Figure 25

Figure 26. Combining elements of Figures 19 and 21 to illustrate how movement of the expanded transition state on an energy landscape according to the classical principles of physical-organic chemistry unifies the slide between a diffuse nucleation-condensation transition state and the framework mechanism via a polarised transition state. Top: Reaction coordinate diagram for a framework mechanism with preformed secondary structure in a low energy intermediate that slides to nucleation-condensation as the secondary structure becomes less stable and requires tertiary interactions to stabilise it. The transition state can move along and perpendicular to the reaction coordinate according to Hammond and anti-Hammond effects, respectively. Both mechanisms involve an extended network of long-range native-like tertiary interactions in the expanded transition state. Bottom: Correlation diagram of formation of native secondary and tertiary interactions illustrating the above.