Hostname: page-component-89b8bd64d-nlwjb Total loading time: 0 Render date: 2026-05-07T03:06:21.311Z Has data issue: false hasContentIssue false

Algebraic constraints and algorithms for common lines in cryo-EM

Published online by Cambridge University Press:  16 May 2024

Tommi Muller*
Affiliation:
Mathematical Institute, University of Oxford, Oxford, UK
Adriana L. Duncan
Affiliation:
Department of Mathematics, University of Texas at Austin, Austin, TX, USA
Eric J. Verbeke
Affiliation:
Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ, USA
Joe Kileel
Affiliation:
Department of Mathematics, University of Texas at Austin, Austin, TX, USA Oden Institute, University of Texas at Austin, Austin, TX, USA
*
Corresponding author: Tommi Muller; Email: tommi.muller@maths.ox.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

We revisit the topic of common lines between projection images in single-particle cryo-electron microscopy (cryo-EM). We derive a novel low-rank constraint on a certain 2n × n matrix storing properly scaled basis vectors for the common lines between n projection images of one molecular conformation. Using this algebraic constraint and others, we give optimization algorithms to denoise common lines and recover the unknown 3D rotations associated with the images. As an application, we develop a clustering algorithm to partition a set of noisy images into homogeneous communities using common lines, in the case of discrete heterogeneity in cryo-EM. We demonstrate the methods on synthetic and experimental datasets.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. A heterogeneous common lines matrix and rank test for simulated rotations from two distinct molecules. Block-diagonals comparing rotations from the same molecule show rank-3 structure.

Figure 1

Figure 2. Algorithm for separating images of distinct molecules using algebraic constraints on common lines. The common lines matrix is first computed from an input set of images or class averages. We then apply Algorithm 4, our clustering algorithm. After clustering, images corresponding to the same molecule can then be used for 3D reconstruction.

Figure 2

Figure 3. 3-D structures and example projection images for the three structures used for simulation. (a) 80S ribosome (EMD-2858) and example projection images. (b) 60S ribosome (EMD-2811) and example projection images. (c) 40S ribosome (EMD-4214) and example projection images.

Figure 3

Table 1. The average rotation recovery error from 30 simulated images of macromolecules at various SNR, 50 runs each

Figure 4

Table 2. The average denoising error of recovered pure common lines matrices from 30 simulated images of macromolecules at various SNR, 50 runs each

Figure 5

Figure 4. Clustering results for $ n=50 $ simulated images with SNR = 5. Images are size $ 128\times 128 $ with pixel size of 3 Å, and are colored according to the ground truth labels. Using Clusters achieves ARI = 0.8581 and only one pair of images are incorrectly clustered.

Figure 6

Figure 5. Clustering results for $ n=75 $ 2D class averages from EMPIAR-10268 computed as described in (7). Images are size $ 96\times 96 $ with a pixel size of 4.4 Å, and are colored according to the ground truth labels. Using Clusters achieves ARI = 0.8440 and only three images are incorrectly clustered.