Hostname: page-component-89b8bd64d-ktprf Total loading time: 0 Render date: 2026-05-07T19:29:54.404Z Has data issue: false hasContentIssue false

On the probabilities of branch durations and stratigraphic gaps in phylogenies of fossil taxa when rates of diversification and sampling vary over time

Published online by Cambridge University Press:  19 February 2019

Peter J. Wagner*
Affiliation:
Department of Earth & Atmospheric Sciences and School of Biological Sciences. 316 Bessey Hall, University of Nebraska, Lincoln, Nebraska 68588, U.S.A. E-mail: pjwagner3@gmail.com. ORCID: 0000-0002-9083-9787

Abstract

The time separating the first appearances of species from their divergences from related taxa affects assessments of macroevolutionary hypotheses about rates of anatomical or ecological change. Branch durations necessarily posit stratigraphic gaps in sampling within a clade over which we have failed to sample predecessors (ancestors) and over which there are no divergences leading to sampled relatives (sister taxa). The former reflects only sampling rates, whereas the latter reflects sampling, origination, and extinction rates. Because all three rates vary over time, the probability of a branch duration of any particular length will differ depending on when in the Phanerozoic that branch duration spans. Here, I present a birth–death-sampling model allowing interval-to-interval variation in diversification and sampling rates. Increasing either origination or sampling rates increases the probability of finding sister taxa that diverge both during and before intervals of high sampling/origination. Conversely, elevated extinction reduces the probability of divergences from sampled sister taxa before and during intervals of elevated extinction. In the case of total extinction, a Signor-Lipps will reduce expected sister taxa leading up to the extinction, with the possible effect stretching back many millions of years when sampling is low. Simulations indicate that this approach provides reasonable estimates of branch duration probabilities under a variety of circumstances. Because current probability models for describing morphological evolution are less advanced than methods for inferring diversification and sampling rates, branch duration priors allowing for time-varying diversification could be a potent tool for phylogenetic inference with fossil data.

Information

Type
Articles
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © 2019 The Paleontological Society. All rights reserved
Figure 0

Figure 1. Hypothetical phylogeny for sampled lineages (black/light blue), unsampled predecessors (dark gray/dark blue), and unsampled sister taxa (light gray/yellow). A, Two sampled species diverged from unsampled Species D at 310 Ma, generating branch durations (and chronostratigraphic gaps) of 6 Myr for Species A and 4 Myr for Species I. Increased sampling will provide earlier divergence times and shorter branch durations/chronostratigraphic gaps in one of two ways. We might sample predecessors of A or I (dark gray/dark blue). The probability of doing so is the probability of sampling an individual lineage at any one point in time. Alternatively, we might sample sister taxa of A or I (light gray/yellow). The probability of doing so reflects origination (affecting the number of possible sister taxa and the richness of sister clades), extinction (affecting the richness of sister clades and the probability of sampling individual lineages), and sampling (affecting the probability of sampling individual lineages). B, The phylogeny re-rendered as budding speciation. Each letter represents a distinct morphospecies that we would recognize before phylogenetic analysis. Note that the predecessors of A and I now are only part of the durations of ancestors such a Species B and Species H. However, the sum of predecessors (dark gray/dark blue) and the sum of possible “sisters” (light gray/yellow) remains unchanged. C, The phylogeny re-rendered as bifurcation speciation, with ancestral morphospecies being at least slightly altered at divergence so that Species B and Species B′ have at least one character distinguishing them. Now predecessors and ancestors are nearly synonymous, save for the earliest unsampled members of Species A and I. However, the sum of predecessors and the sum of possible sisters remains unchanged.

Figure 1

Table 1. Glossary of terms and variables of importance in this work.

Figure 2

Figure 2. The probability of there being K = 0…4 successors of a species extant at time t = 0 over an interval with duration T = 2.5 given different combinations of origination (λ) and extinction (μ) rates. Note that rates increase by a factor of 1.414 (20.5) between adjacent frames. Note that the successors might include the original species at any point in time.

Figure 3

Figure 3. The expected time with K = 0…4 successors (including the original species) given interval duration T = 2.5. These are the areas under each curve for the corresponding K, origination (λ) and extinction (μ) in the corresponding panels in Fig. 2. The sum of expected lineage durations, S, is the sum of K × E[TK]. Thus at λ = 0.4 and μ = 0.4 (A), E[S] = (1 × 1.25) + (2 × 0.313) + (3 × 0.104) + (4 × 0.039) … = 2.5 LMyr.

Figure 4

Figure 4. The probability of a species extant at the outset of an interval with particular origination (λ) and extinction (μ) having K = 0…4 successors extant 0.25 Myr later. The probability of the species or any successor generating a sampled species in the subsequent 0.25 Myr is conditioned on the probability of K = 0…∞ survivors going into the next interval-slice.

Figure 5

Figure 5. The effect of interval-slice length on overestimate of Φ′. Horizontal bars denote expected Φ′ given 5000 simulations at the appropriate origination (λ) and extinction (μ) rates. Intervals durations are set to T = 2.5, so at m = 250 slices, each ti = 0.001. See also Supplementary Fig. S3.

Figure 6

Figure 6. Breaking down estimates of branch duration probabilities given empirically estimated rates of origination (λ), extinction (μ), and sampling (ψ). Gray bars give rates for each interval (2nd y-axis). A, The components of Φi, the probability of sampling a species appearing in interval-slice i or any successors of that species (i.e., a clade of unknown size). νi (yellow dots) gives the probability of sampling that species or any successors in interval-slice i. ζi (light blue dots) gives the probability of sampling any successors of that species (including the original species) in interval-slice i + 1 or any time afterward before the end of the Katian. Φi reflects the probability of doing either and therefore is Φi = 1− ([1 − νi] × [1 − ζi]). B, The exact probability of a branch duration leading to a sampled species or to a node linking sampled species starting in interval-slice i. The probability of sampling a predecessor (purple) is the probability of sampling the one predecessor of a species or node extant in interval-slice i. The probability of sampling a sister taxon (dark orange) is the probability of a divergence giving rise to a sister taxon with 1+ sampled species in interval-slice i. The probability of one or the other happening (sienna) is the probability of a phylogenetic divergence in interval-slice i given that we have some sampled species with a first occurrence after interval-slice i (or a node linking sampled species that diverges after interval-slice i). C, The probabilities of branch durations (sienna) preceding Salpingostoma richmondensis, which is first known from early in the Aphelognathus divergens conodont Zone. This is broken down into the probability of duration d with no divergences to sampled sister taxa (orange) and the probability of no sampled precursors (purple). The expected branch duration (where P[d] = 0.50) is 3.2 Myr.

Figure 7

Figure 7. Effects of variable sampling over time with constant origination and extinction (λ = μ = 0.40). Five median sampling rates (ψ) are illustrated, with ψ relative to the median given on by the gray bars (2nd y-axis). The six intervals of 2.5 Myr are preceded by six others with median ψ. A, Φ, the probability of sampling a species or any successors given that the species is extant in an interval-slice. Yellow dots give expected Φ given 10,000 simulations using ψ (and λ and μ) of the intervals of a species lifetime. B, The probability of a sampled sister species diverging in each interval-slice. C, The probability of a phylogenetic divergence in each interval-slice from either sampling a predecessor or from the divergence of a subsequently sampled sister taxon. D, Expected durations (d) for a species first appearing in each interval-slice. This also applies to a node that diverged in the same interval-slice. E[d] is the branch duration where P[no ancestors or divergences] reaches 0.50 (see Fig. 6C).

Figure 8

Figure 8. Probabilities of sampling clades of unknown size (Φ) under exemplar models of diversification. Origination, extinction, and sampling rates are per millions of years, with each interval 2.5 Myr long. Each Φi is for a 100 kyr slice. Black lines indicate Φ based on text eq. (9) and the rates of origination (λ, in light gray) and extinction (μ, in dark gray) of that time-slice and subsequent time-slices. For each curve, one of five sampling rates (ψ) is assumed for the entire duration. For pulsed extinction, the gray bar shows the average stage rate, with μi = 0.4 for the first 24 slices and μi = 10.4 for the final interval-slice, giving average μ = 0.8. Yellow dots represent the results of 10,000 simulations and give the proportion of runs in which a lineage appearing at that time ultimately generates a sampled specimen given subsequent diversification.

Figure 9

Figure 9. Probabilities of a divergence that ultimately generates a sampled species given the subsequent origination, extinction, and sampling rates.

Figure 10

Figure 10. Probabilities of setting a phylogenetic divergence in a particular 100 kyr interval-slice, either by sampling a predecessor or by the divergence of a sister taxon that is sampled in that interval-slice or in some subsequent one. The complements of these probabilities are the probabilities of a stratigraphic gap within a phylogeny over the particular interval-slices. The product of these probabilities over any set of interval-slices gives the prior probability of a branch duration spanning that set of interval-slices.

Figure 11

Figure 11. Expected branch durations (E[d]) for a species first appearing in an interval-slice or for an unsampled ancestor diverging in that interval-slice. E[d] is the branch duration where P[d|λ,μ,ψ] = 0.5. Note that E[d] at low ψ allows for several preceding intervals with “normal” diversification and sampling rates.

Figure 12

Table 2. The probability of sampling a clade of unknown size given origination and extinction = 0.4 and different levels of sampling (ψ). ΦB gives the global estimate for a clade given Bapst (2013); ΦD gives the global estimate given Didier et al (2017); and med. Φi gives the median 100 kyr estimate that allows for variation in diversification and sampling.