Hostname: page-component-89b8bd64d-rbxfs Total loading time: 0 Render date: 2026-05-06T13:04:14.368Z Has data issue: false hasContentIssue false

Refinement of AlphaFold2 models against experimental and hybrid cryo-EM density maps

Published online by Cambridge University Press:  20 September 2022

Maytha Alshammari
Affiliation:
Department of Computer Science, Old Dominion University, Norfolk, VA, USA
Willy Wriggers*
Affiliation:
Department of Mechanical and Aerospace Engineering, Old Dominion University, Norfolk, VA, USA
Jiangwen Sun
Affiliation:
Department of Computer Science, Old Dominion University, Norfolk, VA, USA
Jing He
Affiliation:
Department of Computer Science, Old Dominion University, Norfolk, VA, USA
*
*Author for correspondence: Willy Wriggers, E-mail: wriggers@biomachina.org
Rights & Permissions [Opens in a new window]

Abstract

Recent breakthroughs in deep learning-based protein structure prediction show that it is possible to obtain highly accurate models for a wide range of difficult protein targets for which only the amino acid sequence is known. The availability of accurately predicted models from sequences can potentially revolutionise many modelling approaches in structural biology, including the interpretation of cryo-EM density maps. Although atomic structures can be readily solved from cryo-EM maps of better than 4 Å resolution, it is still challenging to determine accurate models from lower-resolution density maps. Here, we report on the benefits of models predicted by AlphaFold2 (the best-performing structure prediction method at CASP14) on cryo-EM refinement using the Phenix refinement suite for AlphaFold2 models. To study the robustness of model refinement at a lower resolution of interest, we introduced hybrid maps (i.e. experimental cryo-EM maps) filtered to lower resolutions by real-space convolution. The AlphaFold2 models were refined to attain good accuracies above 0.8 TM scores for 9 of the 13 cryo-EM maps. TM scores improved for AlphaFold2 models refined against all 13 cryo-EM maps of better than 4.5 Å resolution, 8 hybrid maps of 6 Å resolution, and 3 hybrid maps of 8 Å resolution. The results show that it is possible (at least with the Phenix protocol) to extend the refinement success below 4.5 Å resolution. We even found isolated cases in which resolution lowering was slightly beneficial for refinement, suggesting that high-resolution cryo-EM maps might sometimes trap AlphaFold2 models in local optima.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NoDerivatives licence (http://creativecommons.org/licenses/by-nd/4.0), which permits re-use, distribution, and reproduction in any medium, provided that no alterations are made and the original article is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press
Figure 0

Table 1. Accuracy of models before and after refinement using high-resolution cryo-EM maps and hybrid density maps of 5, 6, and 8 Å resolutions

Figure 1

Fig. 1. Models obtained from AlphaFold2 and the refinements using Cryo-EM map 23274–7LCI-R (EMDB-PDB-chain ID) and hybrid density maps at 6 and 8 Å resolutions. (a) Superposition of the protein structure (red, chain R of 7LCI) and the model obtained from AlphaFold2. (b1) The box-cropped region of cryo-EM map 23274 (EMDB ID, cyan) superimposed with the model (blue) refined using Phenix and the cryo-EM map. (b2) Superposition of the structure (red, chain R of 7LCI) and the refined model (blue) using Phenix and the box-cropped cryo-EM map in b1. Hybrid density maps of 6 Å (grey in c1) and 8 Å (yellow in d1) resolutions are superimposed with the corresponding models refined from the maps, respectively. The 6 Å-map-refined model (Cyan ribbon in c1, c2) and 8 Å-map-refined model (green in d1, d2) are superimposed with the structure (red) in c2 and d2. The superposition of two atomic models was performed with TM-align (Zhang and Skolnick, 2005) in all figures. The superposition of a density map and a model was performed using Phenix (Terwilliger et al.,2022) in all figures. An example of a weaker density region in the cryo-EM map and in the corresponding hybrid maps is indicated by an ellipse in b1, c1, and d1.

Figure 2

Fig. 2. Models obtained from AlphaFold2 and the refinements using Cryo-EM map 12183–7BGL-A-T1047S1D1 (EMDB-PDB-chain ID-CASP14 target ID) and hybrid density maps at 6 and 8 Å resolutions. (a) Superposition of the protein structure (red, chain A of 7BGL) and the model obtained from AlphaFold2 (yellow). This chain is one of the free modelling targets in CASP14 with ID T1047S1D1. (b1) The box-cropped region of cryo-EM map 12183 (EMDB ID, cyan) superimposed with the model (blue) refined using Phenix and the cryo-EM map. (b2) Superposition of the structure (red, chain A of 7BGL) and the refined model (blue) using Phenix and the box-cropped cryo-EM map in b1. Hybrid maps of 6 Å (grey in c1) and 8 Å (yellow in d1) resolutions are superimposed with the model refined from the corresponding map. The 6 Å-map-refined model (Cyan ribbon in c1, c2) and 8 Å-map-refined model (green in d1, d2) are superimposed with the structure (red) in c2 and d2.

Figure 3

Fig. 3. Predicted models using AlphaFold2 for 7LV9-B and 7L6U-A (PDB ID–Chain ID). The structures (red) and models predicted using AlphaFold2 (yellow) are superimposed for chain B of 7LV9 (a) and chain A of 7L6U (b). See the Supplementary Material for more details about the two cases.

Figure 4

Fig. 4. Accuracy of models measured using TM-align. The TM score of each model was calculated against the protein structure downloaded from the PDB for 13 cases. In each case, the accuracy is shown from left to right for the model obtained using AlphaFold2 (black), refinement using Phenix and cryo-EM maps (green), refinement using hybrid map of 5 Å (red), 6 Å (blue), 8 Å (yellow), 10 Å (grey), and 12 Å (light blue) resolutions.

Figure 5

Fig. 5. The intersecondary structure geometry for long helices in the predicted and refined models of 7KZZ chain B. (a1) The superposition of the protein structure (red, chain B of 7KZZ) and the model obtained from AlphaFold2 (yellow). (a2) Secondary structures of those superimposed models in a1 are represented by their central axes using AxisComparison (Haslam et al.,2018); The central axes of helices (red) and beta-strand (green) in the structure; the central axes of helices (yellow) and beta-strands (black) in the model obtained from AlphaFold2. (a3) The axes of three consecutive long helices (H5, H6, and H7) of the structure are overlaid with the corresponding axes of three helices (H4, H5, and H6) of the model using two vectors, the vector of the central axes between Trp168 and Ala196, and the vector of the turn between Trp168 and Tyr165. Amino acids are labelled at the start and end of a helix. (b1) The box-cropped region of cryo-EM map 23093 (EMDB ID, yellow) superimposed with the model (blue) refined using Phenix and the cryo-EM map. (b2) Superposition of the structure (red) and the refined model (blue) obtained using Phenix and the box-cropped cryo-EM map in b1. (c1) Box-cropped hybrid density map of 6 Å resolution (grey in c1) superimposed with the model refined from it. (c2) Superposition of the structure (red) and the model (cyan) refined using Phenix and the box-cropped cryo-EM map of 6 Å resolution. Annotation of secondary structures and molecular graphics was conducted with ChimeraX (Pettersen et al.,2021).

Figure 6

Fig. 6. Secondary structure regions detected from the box-cropped hybrid map of 8 Å resolution for 23274–7LCI-R (EMDB-PDB-chain ID). The helix (yellow) and β-sheet (cyan) regions in (a) and (b) were segmented from the hybrid map at 8 Å using the DeepSSETracer (Mu et al.,2021). The model obtained from AlphaFold2 is coloured by the secondary structure type for helices (orange), β-sheet (cyan), coil (grey) in (c, b), and superimposed in (b). The atomic structure (red) and the model refined (green) from the hybrid map are shown in (d) and (e), respectively.

Supplementary material: PDF

Alshammari et al. supplementary material

Alshammari et al. supplementary material

Download Alshammari et al. supplementary material(PDF)
PDF 1.3 MB

Review: Refinement of AlphaFold2 Models against Experimental and Hybrid Cryo-EM Density Maps — R0/PR1

Conflict of interest statement

Reviewer declares none.

Comments

Comments to Author: Alshammari et al. set to test whether AlphaFold2 models can be refined in the medium resolution cryo-EM maps (5-12 A). The test follows a recent work by Terwilliger et al. who implemented a refinement protocol in the Phenix program, which successfully refines AlphaFold2 models in high-resolution maps (better than 4.5 A). The Phenix protocol applies an iterative procedure of four cycles, in each AlphaFold2 models are fitted to the cryo-EM map, refined using a Phenix refinement, and then the resulting refined models are fed back to AlphaFold2 for another round of modeling. For their test, Alshammari et al. have run a single cycle of the procedure. To address the limited number of accurate structures for low-resolution EM maps, the authors used a good idea of "hybrid density maps", which are high-resolution maps blurred with a Gaussian, lowering the resolution but, supposedly (see below), preserving some of the noise and imperfections present in the original maps. Altogether, they found that the single cycle of the Phenix refinement can improve some models up to 6 A resolution. They explore examples and point out some of the specific reasons why refinement can fail.

The manuscript reads well and the study has been overall well conduced (but see below). For detailed claims like 4 A vs 6A resolution limit, one would like to see a bigger benchmark that 13 cases.

In contrast to the original Phenix procedure, where refined intermediate models are fed back to Alphafold2 so that the EM map and AlphaFold2 work together to improve the models, here only one cycle is run. With one iteration - so no moving back to AlphaFold - the authors test only the Phenix refinement procedure but not new procedure by Terwilliger et al. With four cycles, results might be even better pushing the resolution limit beyond 6 A or increasing success rate at 5-6 A. That said, this would not change the conclusions of this study, so repeating with four cycles does not seem necessary.

Overall, while the study represents merely a test of an existing method (or one fourth of it) on a small benchmark, the addressed question is important for the field and the manuscript can be useful and timely reference, and can inspire new developments.

Major comments:

- The authors simulate densities by first converting a high-resolution map to pseudo-atomic beads using vol2pdb and then convert the beads to a map at desired resolution. Why not blurring the map directly with a Gaussian? The authors say that their procedure "incorporates any quality variation within the parent high-resolution cryo-EM map into the hybrid map, resulting in a more realistic low-resolution density model" but are those "quality variations" preserved in the intermediate bead model? It must be tested or explained that the intermediate bead models preserve "quality variations" or those statements should be removed from the manuscript.

- I am not sure from the provided description whether the blurring with the intermediate bead models produces good estimation of the resolution of the resulting maps - is 6A from this "bead" procedure equivalent to a 6A map blurred directly? This equivalence must be demonstrated if authors want to claim absolute thresholds. Or, the work could be repeated with maps blurred directly, which might be better in general.

Minor comments:

- Abstract: "Resolutions better than 4.5A were reached" - resolution of models? What does this sentence mean?

- Introduction: The authors list how many PDB structures have been deposited in different resolution ranges but it would be much more informative to assess the impact of their work if they list how many cryoEM maps were deposited in these ranges, even if, or especially if, the PDB model was not deposited.

- In 2.2, the (letters?) in the formulas rendered as squares in my PDF, I assume they are fine and I could understand them, but should be made sure by the editors that it is corrected.

- Table 1: could you add columns with average pLDDT and pTM for comparison. Does the success correlate with pLDDT and pTM as it does with TM? It would be useful for the readers to assess to what extent they can rely on the predicted scores in assessing whether the refinement might succeed.

Figure 3 - why the EM map and refined model are not shown like in the other figures?

Review: Refinement of AlphaFold2 Models against Experimental and Hybrid Cryo-EM Density Maps — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

Comments to Author: In the presented work, the authors have reported the utility of deep learning-based protein structure prediction approaches, precisely of AlphaFold2, in the refinement of medium-to-low resolution cryo-EM maps. This work thoroughly discussed the dependencies, as well as the strengths and limitations, of integrating the experimental & the AI-based approach to building accurate models even from not-so-well resolved density maps (even as low as 6Å). Interestingly, the authors have suggested that refinement using medium-to-low resolution cryo maps might be beneficial towards escaping the risk of conformational trapping at local minima. In doing so, they have proposed a new approach to generate low-resolution maps from high-resolution experimentally reported maps instead of generating maps from atomic structures as has been traditionally done. Overall, this work presents a insightful discussion for the community actively investigating efficient approaches to mitigate the scarcity of high-resolution structural models.

However, a few significant concerns are not clarified in the present manuscript.

1. (i) Here the authors have proposed "a novel hybrid experimental-simulated density map" generation approach to get low-resolution maps from the high-resolution experimental map as the basis instead of using atomic models. However, the manuscript states "the high-resolution cryo-EM map was first converted to an intermediate PDB file using the Situs vol2pdb tool, with each voxel represented as a pseudoatom. Each pseudoatom was then convoluted with a Gaussian filter using a modified version of pdb2vol, with a filter size determined by the desired resolution of the hybrid map". So, here also the high-resolution cryo maps are first converted to an atomic model and then converted back to density maps of desired resolution. How is this different than generating a density map directly from an atomistic model as has been practiced traditionally? The benefits of the newly proposed hybrid approach, as has been claimed in the present manuscript, are unclear by the current writing.

(ii) Unavailability of notations in section 2.2 "Hybrid Experimental-Simulated Density Maps", makes it hard to understand the current implementation. Please elaborate as required.

2. (i) The present work has reported that the benefits of AlphaFold2 can still be obtained in the "twilight zone" (4-6Å) of experimentally resolved maps. However, refining low-resolution maps with satisfactory accuracy is not achieved. A comment on how this problem can be handled whether by integrating additional experimental data or by some other hybrid statistical approaches would be an interesting discussion.

(ii) It seems that for 8Å structures, without density refined AF2 model qualities are better than refined ones. Can authors comment?

There are a few minor concerns listed below:

1. In the method section, the last paragraph explains the generation of the density map for refinement. However, the write-up is more like a reference to the Phenix documentation rather than a comprehensive general description of the workflow. A more inclusive approach would be much appreciated by a broader/curious audience.

2. "... a more exhaustive sampling of conformations might be required, and that lowering resolution could be part of an annealing strategy to escape local traps. This is yet another argument as to why it could make sense to develop a lower resolution refinement strategy even for high-resolution maps". The motivation of refining an AF2 model with low-resolution maps even though high-resolution maps are available is better to be introduced or emphasized in the beginning as a clear motivation for the present work.

3. Please make the reference list complete and consistent.

Recommendation: Refinement of AlphaFold2 Models against Experimental and Hybrid Cryo-EM Density Maps — R0/PR3

Comments

Comments to Author: Reviewer #1: Alshammari et al. set to test whether AlphaFold2 models can be refined in the medium resolution cryo-EM maps (5-12 A). The test follows a recent work by Terwilliger et al. who implemented a refinement protocol in the Phenix program, which successfully refines AlphaFold2 models in high-resolution maps (better than 4.5 A). The Phenix protocol applies an iterative procedure of four cycles, in each AlphaFold2 models are fitted to the cryo-EM map, refined using a Phenix refinement, and then the resulting refined models are fed back to AlphaFold2 for another round of modeling. For their test, Alshammari et al. have run a single cycle of the procedure. To address the limited number of accurate structures for low-resolution EM maps, the authors used a good idea of "hybrid density maps", which are high-resolution maps blurred with a Gaussian, lowering the resolution but, supposedly (see below), preserving some of the noise and imperfections present in the original maps. Altogether, they found that the single cycle of the Phenix refinement can improve some models up to 6 A resolution. They explore examples and point out some of the specific reasons why refinement can fail.

The manuscript reads well and the study has been overall well conduced (but see below). For detailed claims like 4 A vs 6A resolution limit, one would like to see a bigger benchmark that 13 cases.

In contrast to the original Phenix procedure, where refined intermediate models are fed back to Alphafold2 so that the EM map and AlphaFold2 work together to improve the models, here only one cycle is run. With one iteration - so no moving back to AlphaFold - the authors test only the Phenix refinement procedure but not new procedure by Terwilliger et al. With four cycles, results might be even better pushing the resolution limit beyond 6 A or increasing success rate at 5-6 A. That said, this would not change the conclusions of this study, so repeating with four cycles does not seem necessary.

Overall, while the study represents merely a test of an existing method (or one fourth of it) on a small benchmark, the addressed question is important for the field and the manuscript can be useful and timely reference, and can inspire new developments.

Major comments:

- The authors simulate densities by first converting a high-resolution map to pseudo-atomic beads using vol2pdb and then convert the beads to a map at desired resolution. Why not blurring the map directly with a Gaussian? The authors say that their procedure "incorporates any quality variation within the parent high-resolution cryo-EM map into the hybrid map, resulting in a more realistic low-resolution density model" but are those "quality variations" preserved in the intermediate bead model? It must be tested or explained that the intermediate bead models preserve "quality variations" or those statements should be removed from the manuscript.

- I am not sure from the provided description whether the blurring with the intermediate bead models produces good estimation of the resolution of the resulting maps - is 6A from this "bead" procedure equivalent to a 6A map blurred directly? This equivalence must be demonstrated if authors want to claim absolute thresholds. Or, the work could be repeated with maps blurred directly, which might be better in general.

Minor comments:

- Abstract: "Resolutions better than 4.5A were reached" - resolution of models? What does this sentence mean?

- Introduction: The authors list how many PDB structures have been deposited in different resolution ranges but it would be much more informative to assess the impact of their work if they list how many cryoEM maps were deposited in these ranges, even if, or especially if, the PDB model was not deposited.

- In 2.2, the (letters?) in the formulas rendered as squares in my PDF, I assume they are fine and I could understand them, but should be made sure by the editors that it is corrected.

- Table 1: could you add columns with average pLDDT and pTM for comparison. Does the success correlate with pLDDT and pTM as it does with TM? It would be useful for the readers to assess to what extent they can rely on the predicted scores in assessing whether the refinement might succeed.

Figure 3 - why the EM map and refined model are not shown like in the other figures?

Reviewer #3: In the presented work, the authors have reported the utility of deep learning-based protein structure prediction approaches, precisely of AlphaFold2, in the refinement of medium-to-low resolution cryo-EM maps. This work thoroughly discussed the dependencies, as well as the strengths and limitations, of integrating the experimental & the AI-based approach to building accurate models even from not-so-well resolved density maps (even as low as 6Å). Interestingly, the authors have suggested that refinement using medium-to-low resolution cryo maps might be beneficial towards escaping the risk of conformational trapping at local minima. In doing so, they have proposed a new approach to generate low-resolution maps from high-resolution experimentally reported maps instead of generating maps from atomic structures as has been traditionally done. Overall, this work presents a insightful discussion for the community actively investigating efficient approaches to mitigate the scarcity of high-resolution structural models.

However, a few significant concerns are not clarified in the present manuscript.

1. (i) Here the authors have proposed "a novel hybrid experimental-simulated density map" generation approach to get low-resolution maps from the high-resolution experimental map as the basis instead of using atomic models. However, the manuscript states "the high-resolution cryo-EM map was first converted to an intermediate PDB file using the Situs vol2pdb tool, with each voxel represented as a pseudoatom. Each pseudoatom was then convoluted with a Gaussian filter using a modified version of pdb2vol, with a filter size determined by the desired resolution of the hybrid map". So, here also the high-resolution cryo maps are first converted to an atomic model and then converted back to density maps of desired resolution. How is this different than generating a density map directly from an atomistic model as has been practiced traditionally? The benefits of the newly proposed hybrid approach, as has been claimed in the present manuscript, are unclear by the current writing.

(ii) Unavailability of notations in section 2.2 "Hybrid Experimental-Simulated Density Maps", makes it hard to understand the current implementation. Please elaborate as required.

2. (i) The present work has reported that the benefits of AlphaFold2 can still be obtained in the "twilight zone" (4-6Å) of experimentally resolved maps. However, refining low-resolution maps with satisfactory accuracy is not achieved. A comment on how this problem can be handled whether by integrating additional experimental data or by some other hybrid statistical approaches would be an interesting discussion.

(ii) It seems that for 8Å structures, without density refined AF2 model qualities are better than refined ones. Can authors comment?

There are a few minor concerns listed below:

1. In the method section, the last paragraph explains the generation of the density map for refinement. However, the write-up is more like a reference to the Phenix documentation rather than a comprehensive general description of the workflow. A more inclusive approach would be much appreciated by a broader/curious audience.

2. "... a more exhaustive sampling of conformations might be required, and that lowering resolution could be part of an annealing strategy to escape local traps. This is yet another argument as to why it could make sense to develop a lower resolution refinement strategy even for high-resolution maps". The motivation of refining an AF2 model with low-resolution maps even though high-resolution maps are available is better to be introduced or emphasized in the beginning as a clear motivation for the present work.

3. Please make the reference list complete and consistent.

Recommendation: Refinement of AlphaFold2 Models against Experimental and Hybrid Cryo-EM Density Maps — R1/PR4

Comments

No accompanying comment.

Recommendation: Refinement of AlphaFold2 Models against Experimental and Hybrid Cryo-EM Density Maps — R2/PR5

Comments

No accompanying comment.