Hostname: page-component-6766d58669-7fx5l Total loading time: 0 Render date: 2026-05-21T14:07:47.232Z Has data issue: false hasContentIssue false

Nf-Root: A Best-Practice Pipeline for Deep-Learning-Based Analysis of Apoplastic pH in Microscopy Images of Developmental Zones in Plant Root Tissue

Published online by Cambridge University Press:  23 December 2024

Julian Wanner
Affiliation:
Quantitative Biology Center (QBiC), University of Tübingen, Tübingen, Germany Hasso Plattner Institute, University of Potsdam, Germany Finnish Institute for Molecular Medicine (FIMM), University of Helsinki, Helsinki, Finland
Luis Kuhn Cuellar
Affiliation:
Quantitative Biology Center (QBiC), University of Tübingen, Tübingen, Germany
Luiselotte Rausch
Affiliation:
Center for Plant Molecular Biology (ZMBP), University of Tübingen, Tübingen, Germany
Kenneth W. Berendzen
Affiliation:
Center for Plant Molecular Biology (ZMBP), University of Tübingen, Tübingen, Germany
Friederike Wanke
Affiliation:
Center for Plant Molecular Biology (ZMBP), University of Tübingen, Tübingen, Germany
Gisela Gabernet
Affiliation:
Quantitative Biology Center (QBiC), University of Tübingen, Tübingen, Germany
Klaus Harter*
Affiliation:
Center for Plant Molecular Biology (ZMBP), University of Tübingen, Tübingen, Germany
Sven Nahnsen*
Affiliation:
Quantitative Biology Center (QBiC), University of Tübingen, Tübingen, Germany
*
Corresponding authors: Klaus Harter and Sven Nahnsen; Emails: klaus.harter@zmbp.uni-tuebingen.de; sven.nahnsen@uni-tuebingen.de
Corresponding authors: Klaus Harter and Sven Nahnsen; Emails: klaus.harter@zmbp.uni-tuebingen.de; sven.nahnsen@uni-tuebingen.de

Abstract

Hormonal mechanisms associated with cell elongation play a vital role in the development and growth of plants. Here, we report Nextflow-root (nf-root), a novel best-practice pipeline for deep-learning-based analysis of fluorescence microscopy images of plant root tissue from A. thaliana. This bioinformatics pipeline performs automatic identification of developmental zones in root tissue images. This also includes apoplastic pH measurements, which is useful for modeling hormone signaling and cell physiological responses. We show that this nf-core standard-based pipeline successfully automates tissue zone segmentation and is both high-throughput and highly reproducible. In short, a deep-learning module deploys deterministically trained convolutional neural network models and augments the segmentation predictions with measures of prediction uncertainty and model interpretability, while aiming to facilitate result interpretation and verification by experienced plant biologists. We observed a high statistical similarity between the manually generated results and the output of the nf-root.

Information

Type
Original Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press in association with John Innes Centre
Figure 0

Figure 1. Ratiomeric pH analysis and the PHDFM dataset. (a) Diagram of the ratiomeric pH analysis for FM images from root tissue samples labeled with the HPTS marker. The input images typically have four channels (two brightfield and two fluorescence channels) to accommodate HPTS data. Manual annotation of ROIs is a time-consuming step, it creates the main bottleneck for large-scale data processing and precludes full automation of the complete analysis pipeline. (b) Diagram depicting the procedure used to create the dataset. An OMERO server was used as a remote collaboration hub between plant biologists and bioinformaticians to create a semantic segmentation dataset. (c) Frequency of annotated pixels per segmentation class in the dataset. (d) Distribution of the number of images containing pixel labels of each segmentation class. (e) Representative brightfield channel (bf-405nm) of FM images (top row) and corresponding labels of the PHDFM dataset (bottom row). Segmentation masks are depicted with color-coded tissue classes, showing five classes: background (blue), root tissue (yellow), LEZ (brown), EEZ (green), and MZ (purple). Scale bars = 53.14 μm.

Figure 1

Figure 2. Qualitative and quantitative performance results of the U-Net^2 model, with an assessment of deterministic training. (a) Model predictions and ground truth show a high similarity, missing predictions are sometimes obtained when there are multiple labels in the ground truth. (b) Letter-value plot (Hofmann et al., 2017) of standard deviation values (STD) of U-Net^2 model parameters (weights and biases) across training runs (10 training runs per setup, n=10), the standard deviation of all 44.04 million trainable parameters was calculated for the Random (without random seed or deterministic settings) and Deterministic training setups (specified all random seeds and forced deterministic algorithms) (Heumos et al., 2023). (c) Boxplot of IoU performance on the test dataset (mean IoU of all images per class), after the training reproducibility test (n=10), this metric shows a large variance for all classes besides the background while using a non-deterministic setup and zero variance in all classes while using the deterministic setup, demonstrating full deterministic output of the training process. Scale bars = 53.14 μm.

Figure 2

Figure 3. Representative samples of U-Net^2 segmentation predictions and their corresponding uncertainty maps. Predictions for images from a small unlabeled dataset. The uncertainty values are the pixel-wise, standard deviation values (STD) of the softmax output from the U-Net^2 model, as calculated using Monte Carlo Dropout. Uncertainty maps were calculated using Monte Carlo Dropout, with T=10 stochastic forward passes through the trained U-Net^2 model, and dropout applied before each convolutional layer in the model (dropout rate = 0.5). Pixels displayed in bright yellow relate to high uncertainty while pixels in dark blue represent low uncertainty. Scale bars = 53.14 μm.

Figure 3

Figure 4. Representative samples of U-Net^2 segmentation predictions with their corresponding interpretability maps. Predictions for images from a small unlabeled dataset. Interpretability maps were calculated using Guided Grad-CAM, targeting the most relevant segmentation classes (Background, Root tissue, EEZ, LEZ, and MZ) for each image. Pixels in bright orange are highly important for the prediction of the target class (high importance score), and pixels in dark gray are associated with low feature importance for prediction. Scale bars = 53.14 μm.

Figure 4

Figure 5. Quantitative evaluation of the nf-root pipeline. (a) Schematic of the nf-root workflow for the processing of A. thaliana image data. Inputs and outputs are denoted by dashed arrows, and surrounding rounded rectangles show docker containers for the respective task (N = 80). (b, c) Comparison of ratiomeric measurements between manual and automated analysis. Results from manual ROI annotation and data analysis by an independent plant biologist are shown in yellow (human), while automatic segmentation and analysis with our U-Net^2 model and the nf-root pipeline are shown in blue. Panel (b) shows ratio value statistics in the MZ, while (c) shows the corresponding values for EEZ, the most relevant zones for fast-response pathway analysis.

Supplementary material: File

Wanner et al. supplementary material

Wanner et al. supplementary material
Download Wanner et al. supplementary material(File)
File 17.6 MB

Author comment: Nf-Root: A Best-Practice Pipeline for Deep-Learning-Based Analysis of Apoplastic pH in Microscopy Images of Developmental Zones in Plant Root Tissue — R0/PR1

Comments

Dear Professor Hamant, dear editorial office,

We are very pleased to send you our manuscript “nf-root: a best-practice pipeline for deep learning-based analysis of apoplastic pH in microscopy images of developmental zones in plant root tissue”. On behalf of my co-authors, Julian Wanner, Luis Kuhn Cuellar, Luiselotte Rausch, Kenneth W. Berendzen, Friederike Wanke, Gisela Gabernet and Klaus Harter, we kindly ask for the consideration of the manuscript in your journal “Quantitative Plant Biology”. In parallel the manuscript has been submitted to the preprint repository biorxiv.

This manuscript provides a best-practice pipeline to meet the growing demand for reproducibility and interpretability of results from deep learning-based image analysis. While we provide a specific solution for the analysis of developmental zones in A. thaliana root tissue, the pipeline is easily extendable to many other biomedical imaging problems in the life sciences.

The nf-root pipeline combines a deterministically trained convolutional neural network model and interpretable segmentation predictions into one easy-to-use nf-core Nextflow pipeline, which automates the identification of developmental zones in fluorescence microscopy images of A. thaliana root tissue and the analysis of apoplastic pH measurements of tissue zones. We demonstrate that the reproducible results of nf-root are similar to those obtained by an experienced plant biologist, while being magnitudes faster.

Due to other high impact publications in journals of Cambridge University Press concerning the topic, we are certain that our best-practice pipeline is of high interest to the readership of Quantitative Plant Biology. As recently highlighted in a Nature methods comment1, trustworthy standards for machine learning in the life sciences include the easy reproducibility of results and accelerated research progress. To the best of our knowledge, nf-root is the first solution to automatically segment and analyze A. thaliana root tissue, that is in addition fully reproducible and interpretable. As nf-root is easily extendable it allows readers of Quantitative Plant Biology to adapt our proposed best practices to receive more trustworthy results. We therefore consider this manuscript suitable for publication in Quantitative Plant Biology.

Thank you for your consideration! Sincerely,

Professor Dr. Sven Nahnsen

Recommendation: Nf-Root: A Best-Practice Pipeline for Deep-Learning-Based Analysis of Apoplastic pH in Microscopy Images of Developmental Zones in Plant Root Tissue — R0/PR2

Comments

Thanks for your submission to QPB. It’s now been reviewed by two independent reviewer sets -- one reviewer and one pair of reviewers working together, and I’ve taken a look too. To me this feels like a careful and deep exploration of a potentially useful tool and with some changes to the manuscript and, most importantly, the software implementation, it could be a valuable tool.

However, one reviewer raises the important concern that the software as it stands doesn’t work. Given the admirable focus on computational good practise and reproducibility, this is a critical fix. I’m therefore recommending major revisions -- though hopefully fixing these bugs won’t require substantial effort.

The reviewers also raise some points at the manuscript level that should be addressed in a round of revision. I also had the below comments (which may overlap).

The abstract in the ms and in the online system are different, perhaps due to an imposed word limit? If you are satisfied with the version that obeys this limit please include that in the manuscript.

The first intro paragraph sets up the knowledge gap in a slightly roundabout way which perhaps sells short the general applicability of this approach. It will presumably be of use not just for further development of a particular mathematical model, but also for lots of associated research.

Several citations are oddly formatted e.g. Chen & Johansson, n.d.; Grande et al., 22--24 Jun 2014. If the references can be systematised it will make future steps smoother.

Why were the particular limits for scaling and shifting (51 and 26 pixels) chosen? Won’t this behaviour depend on the resolution / size of the images?

Fig 2b -- it’s hard to interpret these STD values without an idea of the typical parameter scale. Are parameters, for example, typically on the unit interval? Similarly with the scale of the colourmaps in Fig 3. What does an uncertainty of 0.1 actually correspond to? 10% realtive error perhaps?

What are the priors in the Bayesian analysis, and what influence do they have?

In Fig 5 would it be possible to have a side-by-side comparison of a “good” result (pipeline matches human) and a “bad” result (mismatch) so the reader has an idea of the scale of differences involved?

Decision: Nf-Root: A Best-Practice Pipeline for Deep-Learning-Based Analysis of Apoplastic pH in Microscopy Images of Developmental Zones in Plant Root Tissue — R0/PR3

Comments

No accompanying comment.

Author comment: Nf-Root: A Best-Practice Pipeline for Deep-Learning-Based Analysis of Apoplastic pH in Microscopy Images of Developmental Zones in Plant Root Tissue — R1/PR4

Comments

No accompanying comment.

Recommendation: Nf-Root: A Best-Practice Pipeline for Deep-Learning-Based Analysis of Apoplastic pH in Microscopy Images of Developmental Zones in Plant Root Tissue — R1/PR5

Comments

Thanks for your revised submission, which I and the reviewers agree has fixed several bugs and cleared up most questions. The software now appears to be functional, and useful! I am happy to recommend acceptance contingent on a couple of remaining additions which shouldn’t take more than a few minutes to include. In addition, one reviewer has some further suggestions for the code that could be included in future updates.

A few questions from the previous round didn’t lead to edits in the manuscript. The response letter gave a good explanation for the particular scaling and shifting used -- please put this explanation in the manuscript (informed selection following empirical investigation is a perfectly valid answer).

In the last round I requested a comparison between an image where the pipeline performs well and one where it performs poorly. As the authors respond, the former case is in Fig 2a. But I (and a reviewer) would still like an example of less-good performance. Certainly the statistics are presented in Fig 5b-c (as in the response) but the point of this question is to help the reader understand what these values actually look like in a real image. Including an example image or two, like Fig 2a but for a poor-quality results, would immediately do this -- it could be a supplementary image if you don’t want to break the flow of the existing ms.

All the best,

Iain

Decision: Nf-Root: A Best-Practice Pipeline for Deep-Learning-Based Analysis of Apoplastic pH in Microscopy Images of Developmental Zones in Plant Root Tissue — R1/PR6

Comments

No accompanying comment.

Author comment: Nf-Root: A Best-Practice Pipeline for Deep-Learning-Based Analysis of Apoplastic pH in Microscopy Images of Developmental Zones in Plant Root Tissue — R2/PR7

Comments

No accompanying comment.

Recommendation: Nf-Root: A Best-Practice Pipeline for Deep-Learning-Based Analysis of Apoplastic pH in Microscopy Images of Developmental Zones in Plant Root Tissue — R2/PR8

Comments

Great -- I think this important detail and context is now suitably included, reviewer comments have been addressed, and am happy to recommend acceptance. Thanks for working with QPB!

Decision: Nf-Root: A Best-Practice Pipeline for Deep-Learning-Based Analysis of Apoplastic pH in Microscopy Images of Developmental Zones in Plant Root Tissue — R2/PR9

Comments

No accompanying comment.