Hostname: page-component-89b8bd64d-j4x9h Total loading time: 0 Render date: 2026-05-08T01:52:39.493Z Has data issue: false hasContentIssue false

Which morphological traits can be used to reconstruct genome size in fossil plants? Assessing sporomorph size and stomatal guard cell length as paleo-genome size proxies

Published online by Cambridge University Press:  02 June 2025

Phillip E. Jardine*
Affiliation:
Institute of Geology and Palaeontology, University of Münster, 48149 Münster, Germany
Hannah Morck
Affiliation:
Institute of Geology and Palaeontology, University of Münster, 48149 Münster, Germany
Barry H. Lomax
Affiliation:
School of Biosciences, University of Nottingham, Sutton Bonington Campus, Leicestershire, LE12 5RD, U.K.
*
Corresponding author: Phillip E. Jardine; Email: jardine@uni-muenster.de

Abstract

Genome size (GS) is thought to be a key life-history trait and important for controlling plant distributions and evolutionary dynamics, but a full understanding of GS variation through evolutionary history requires proxy measurements from fossils. Here, we compare two potential GS proxies: guard cell length (GCL) and sporomorph size. We generated GCL and pollen size data from angiosperms growing in the University of Münster Botanical Garden, compiled sporomorph size data from the literature, and related these to GS using phylogenetic regression models. We also fit evolutionary models to the botanical garden data and used a published dataset to validate GCL as a GS proxy. The majority of the analyses conducted revealed a positive relationship between GS and sporomorph size, but in most cases, the explanatory power of the regressions was low. GCL showed a stronger and more consistent relationship with GS, and independent validation of the relationship showed a generally good match between predicted and observed GS. Sporomorph size is not suitable as a cross-taxon GS proxy, but some specific taxa (e.g., Pinus) may contain useful GS information. GCL has much more potential for measuring paleo-GS, but requires further research for us to better understand possible environmental controls on cell size variation.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Paleontological Society
Figure 0

Table 1. Analyses and datasets used in this study. Numbers of plants and measurements (stomatal guard cell length measurements for leaves, otherwise pollen size) are only given for specimens collected from the University of Münster Botanical Garden, with other data taken from the literature and representing multiple studies and sampling protocols

Figure 1

Figure 1. Phylogeny for the Analysis 1 botanical garden dataset, with maximum-likelihood estimates of log10 genome size (GS) across the phylogeny mapped. GS data for the tips of the phylogeny come from the Kew C-values database (https://cvalues.science.kew.org; Leitch et al. 2019).

Figure 2

Table 2. Phylogenetic signal for the key parameters from the botanical garden dataset (n = 61 in all cases)

Figure 3

Table 3. Regression results for the relationship between each parameter and genome size, for the phylogenetically dependent ordinary least squares (OLS) models, and the phylogenetically independent phylogenetic generalized least squares (P-GLS) models. The first two rows relate to the Analysis 1 botanical garden dataset, while the remainder of the results are for the Analysis 2 clade-level analyses. λ gives the strength of the phylogenetic signal in the regression residuals. AICc, corrected Akaike information criterion

Figure 4

Figure 2. Guard cell length (A) and pollen size (B) for the Analysis 1 botanical garden dataset plotted against genome size. Solid regression lines are from phylogenetic generalized least squares (P-GLS) models, dashed regression lines are from ordinary least squares (OLS) models, and point styles and colors represent main clades (see also Fig. 1).

Figure 5

Figure 3. Sporomorph size vs. genome size for the Analysis 2 clade-level datasets. (A) Fern spore size, with point styles and colors representing subclasses; (B) Pinaceae pollen corpus length, with point styles and colors representing genera (see also Supplementary Fig. 7); (C) Poaceae pollen length, with point styles and colors representing subfamilies; (D) Asteraceae pollen size, with point styles and colors representing subfamilies (see also Supplementary Fig. 8). Solid regression lines are from phylogenetic generalized least squares (P-GLS) models, dashed regression lines are from ordinary least squares (OLS) models.

Figure 6

Table 4. Results from evolutionary models fit to the botanical garden dataset (n = 61 in all cases), comparing Brownian motion (BM), Ornstein-Uhlenbeck (OU), and early burst (EB) models. σ2 = rate of trait evolution; α = strength of attraction to the optimum value in the OU model; σ20 = initial rate of trait evolution; and r = change in the rate of trait evolution in the EB model. AICc, corrected Akaike information criterion

Figure 7

Figure 4. Observed vs. predicted genome size (GS) for the Lomax et al. (2014) guard cell length data, with GS estimated using the (A) phylogenetic generalized least squares (P-GLS) and (B) ordinary least squares (OLS) models from the Analysis 1 botanical garden dataset (see also Fig. 2A). The dotted gray line shows the 1:1 relationship between observed and predicted values. The R2 values and model coefficients are from regressions of observed onto predicted values, with the models shown as solid black lines and associated 95% confidence intervals shown as dashed lines. RMSEP, root-mean-square error of prediction.