Hostname: page-component-6766d58669-zlvph Total loading time: 0 Render date: 2026-05-15T23:57:44.488Z Has data issue: false hasContentIssue false

Selecting and averaging relaxed clock models in Bayesian tip dating of Mesozoic birds

Published online by Cambridge University Press:  17 December 2021

Abstract

Relaxed clock models are fundamental in Bayesian clock dating, but a single distribution characterizing the clock variation is typically selected. Hence, I developed a new reversible-jump Markov chain Monte Carlo (rjMCMC) algorithm for drawing posterior samples between the independent lognormal (ILN) and independent gamma rates (IGR) clock models. The ability of the rjMCMC algorithm to infer the true model was verified through simulations. I then applied the algorithm to the Mesozoic bird data previously analyzed under the white noise (WN) clock model. In comparison, averaging over the ILN and IGR models provided more reliable estimates of the divergence times and evolutionary rates. The ILN model showed slightly better fit than the IGR model and much better fit than the autocorrelated lognormal (ALN) clock model. When the data were partitioned, different partitions showed heterogeneous model fit for ILN and IGR clocks. The implementation provides a general framework for selecting and averaging relaxed clock models in Bayesian dating analyses.

Information

Type
Articles
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use or in order to create a derivative work.
Copyright
Copyright © The Author(s), 2021. Published by Cambridge University Press on behalf of The Paleontological Society
Figure 0

Figure 1. Probability densities of gamma and lognormal distributions under mean 1.0 and variance 0.1 (A), 0.5 (B), 1.0 (C), and 2.0 (D), and probability density of N(1, 1) distribution (truncated at 0) (C). Note that the Gamma(1, 1) distribution (α = 1) is Exp(1) distribution (C).

Figure 1

Figure 2. Model probabilities estimated by the reversible-jump Markov chain Monte Carlo (rjMCMC) algorithm. Nine sets of rates were simulated under distributions in Fig. 1 given each tree, and for each tree with rates, data matrices with 100 characters (A), 300 characters (B), and 500 characters (C) were simulated for inference. The model probabilities were also inferred when the trees and rates were fixed to the simulated values (no data) (D). When the rate-generating distribution was gamma, the posterior probabilities of the independent gamma rates (IGR) model are shown; when the generating distributions were lognormal and normal, the posterior probabilities of the independent lognormal (ILN) model are shown. The circle is the median, and the error bar denotes the 5th and 95th percentiles summarized across the 100 replicates (trees).

Figure 2

Table 1. Median (5th, 95th percentiles) of the mean squared errors (MSE) of relative rates across the 100 replicates (trees). For each tree, there were nine sets of rates (σG for gamma variance, σL for lognormal variance, and σN for normal variance) and three character lengths (l = 100, 300, and 500) simulated (100 × 9 × 3 datasets). Each dataset was then analyzed using the mixed independent lognormal (ILN) and independent gamma rates (IGR) clock models and the white noise (WN) clock model.

Figure 3

Figure 3. Dated phylogeny of Mesozoic birds. The topology shown is the majority-rule consensus tree summarized from the posterior trees. The node ages in the tree are the posterior medians, and the error bars at the nodes denote the 95% highest posterior density (HPD) intervals. The shade of each node circle represents the posterior probability of the corresponding clade. The color of the branch represents the mean relative evolutionary rate at that branch. The fossil sampling rate, ψ, varies along time in a piecewise constant manner, with four intervals divided at 145, 100, and 66 Ma. The solid line is the posterior mean, and the shade denotes the 95% HPD interval.

Figure 4

Figure 4. A, Five focal node ages (mean and 95% highest posterior density [HPD] interval) of early stem birds estimated under the white noise (WN) model for the unpartitioned data (WN, 1P), WN model for the partitioned data (WN, 6P*; the star denotes disallowing fossil ancestors in the fossilized birth-death [FBD] model), mixed independent lognormal (ILN) and independent gamma rates (IGR) models for the unpartitioned data (ILN&IGR, 1P), and mixed ILN and IGR models for the partitioned data (ILN&IGR, 6P). B, Relative evolutionary rates (mean and 95% HPD interval) at the five focal branches under the WN model for the unpartitioned data (WN, 1P) and mixed ILN and IGR models for the unpartitioned data (ILN&IGR, 1P).

Figure 5

Table 2. Posterior probability of the independent lognormal (ILN) clock model and acceptance rate of the reversible-jump Markov chain Monte Carlo (rjMCMC) proposal under each of the four fixed values of w and when w is auto-tuned.

Figure 6

Figure 5. Relative evolutionary rates (mean and 95% highest posterior density [HPD] interval) for the six anatomical partitions at the five focal branches averaging over the independent lognormal (ILN) and independent gamma rates (IGR) models (ILN&IGR, 6P).

Figure 7

Table 3. Posterior probability of the independent lognormal (ILN) clock model for each of the six partitions and the variance parameters estimated (mean and 95% highest posterior density [HPD] interval).