Hostname: page-component-77f85d65b8-5ngxj Total loading time: 0 Render date: 2026-03-28T11:19:55.145Z Has data issue: false hasContentIssue false

Annotation-free learning of a spatio-temporal manifold of the cell life cycle

Published online by Cambridge University Press:  06 October 2023

Kristofer delas Peñas*
Affiliation:
Department of Engineering Science, University of Oxford, Oxford, United Kingdom Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom Department of Computer Science, University of the Philippines, Quezon City, Philippines
Mariia Dmitrieva
Affiliation:
Department of Engineering Science, University of Oxford, Oxford, United Kingdom Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom
Dominic Waithe
Affiliation:
WIMM Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, United Kingdom
Jens Rittscher
Affiliation:
Department of Engineering Science, University of Oxford, Oxford, United Kingdom Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
*
Corresponding author: Kristofer delas Peñas; Email: kristofer.delaspenas@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

The cell cycle is a complex biological phenomenon, which plays an important role in many cell biological processes and disease states. Machine learning is emerging to be a pivotal technique for the study of the cell cycle, resulting in a number of available tools and models for the analysis of the cell cycle. Most, however, heavily rely on expert annotations, prior knowledge of mechanisms, and imaging with several fluorescent markers to train their models. Many are also limited to processing only the spatial information in the cell images. In this work, we describe a different approach based on representation learning to construct a manifold of the cell life cycle. We trained our model such that the representations are learned without exhaustive annotations nor assumptions. Moreover, our model uses microscopy images derived from a single fluorescence channel and utilizes both the spatial and temporal information in these images. We show that even with fewer channels and self-supervision, information relevant to cell cycle analysis such as staging and estimation of cycle duration can still be extracted, which demonstrates the potential of our approach to aid future cell cycle studies and in discovery cell biology to probe and understand novel dynamic systems.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Figure 1. The VAE-GAN architecture used to construct the manifold of cell morphology. The network is composed of three components: encoder, generator, and discriminator. The encoder maps input images to a 16-dimensional Gaussian with diagonal covariance. The generator produces a reconstruction from sampled points in the latent space. The discriminator forces the generator to output images as similar to the input as possible.

Figure 1

Figure 2. Comparison of constructed manifolds using: (a) VGG, (b) Resnet, (c) DeepCycle, and (d,e) our approach using VAE-GAN. Manifolds from the baseline models (a–c), unlike our VAE-GAN models (d,e), did not produce visually distinct clusters corresponding to cell cycle classes.

Figure 2

Figure 3. Manifolds across 5 training and evaluation runs. While different after each run, the manifolds constructed show visually distinct clusters for the cell cycle stages.

Figure 3

Figure 4. Clusters formed by GMM. Shown on the left are representative raw images from each cluster. Comparing with true labels, the formed clusters roughly correspond to known cell cycle stages: M (magenta), late S (blue), G2 (red), G1 (purple, cyan, light green), and early/mid S (yellow and green). Image size: $ 30\mu m\times 30\mu m $. Contrast and brightness of cell images are enhanced for visualization.

Figure 4

Table 1. Confusion matrix for cell cycle stage classification.

Figure 5

Figure 5. Plotting UMAP1 versus time reveals some key cell cycle stages. Blue line is the smoothed curve using B-splines. Images at timepoints of the crests and troughs of the curve (shown as triangles $ \varDelta \nabla $) are shown and correspond to the mitosis, G1, early S, late S stages in the cycle. Curve inflection points (shown as stars $ \star $) where transitions roughly occur are also annotated.

Figure 6

Figure 6. Plotting UMAP2 versus time reveals the oscillating nature of the cell cycle. Blue line is the 10-frame running average. Cycles are marked end to end by sudden drops in the running average curve.