Hostname: page-component-5db58dd55d-xnzfm Total loading time: 0 Render date: 2026-05-30T21:56:29.682Z Has data issue: false hasContentIssue false

Sampling biases in the rise and fall of fossil marine animal genera

Published online by Cambridge University Press:  02 February 2026

Sapon Chupongstimun*
Affiliation:
The University of Chicago Department of the Geophysical Sciences , United States
*
Corresponding author: Sapon Chupongstimun; Email: saponc@uchicago.edu

Abstract

An important question in evolutionary biology and macroecology is whether taxa show systematic trajectories in occupancy, the proportion of geographic area occupied, over macroevolutionary timescales. Past studies have used fossils to document these trajectories, showing a symmetric rise and fall. In this study, I focus on several biases in the analyses of fossil occupancy trajectories that have been unaccounted for. First, better sampling of boundary bins in a taxon’s stratigraphic range, paradoxically, results in lower mean occupancy of taxa in those bins. This is because better sampling allowed more taxa with low occupancies to be included in the mean occupancies of those bins compared with intermediate bins. Second, the possibility that taxa may have incomplete durations within boundary bins could also lower occupancies in those bins. Finally, a bias can also exist when the number of sampled sites is not constant throughout a taxon’s stratigraphic range. I use simulations to show that the first bias can be corrected by conditioning these boundary bins to be sampled in the same way as intermediate bins. To mitigate the second bias, I use higher-resolution time bins to constrain the intervals over which taxa’s occupancies are measured so that they are comparable between boundary and intermediate time bins. I also present an approach that can correct for the last bias by subsampling geographic sites, testing its impact in a simulation. Considering these factors, the occupancy trajectories of marine animal genera look to be a relatively gradual rise post-origination with a sudden decline before extinction.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of Paleontological Society
Figure 0

Figure 1. Simulation results of stable occupancy trajectories. A, Taxa with zero occupancies are excluded from the mean occupancy within each time bin, resulting in the bias of lower occupancy in boundary bins. B, Taxa that are not sampled are given an occupancy of zero, resulting in the bias of higher mean occupancies in boundary bins. C, Taxa with zero occupancies are excluded from the mean occupancies. Taxa in the first and last time bins are also excluded from the mean occupancies if they are not sampled in the second and penultimate bins, respectively. This treatment results in the expected stable mean occupancy trajectory.

Figure 1

Figure 2. Simulation results of stable mean occupancy trajectories with sequentially increasing number of sites. In all three scenarios, taxa are included in the mean occupancies of the first and last bins only if they are present in the second and penultimate bin, after subsampling (if performed), respectively. A, No subsampling is performed, resulting in sequentially decreasing occupancy. B, Each time bin is subsampled to 30 sites, and a taxon is only included in a time bin’s mean occupancy if it has nonzero occupancy in that bin after subsampling. This does not eliminate the bias from differences in the number of sampled sites. C, Each time bin is subsampled to 30 sites. A taxon is only included in the mean occupancy of a time bin if it has nonzero occupancy in that bin, as well as in both boundary bins, after subsampling. This results in the expected stable occupancy.

Figure 2

Figure 3. Mean occupancy trajectories of genera from stratigraphic ranges of 5 to 12 time bins. Genera that went extinct during mass extinctions or survive close to the Recent are excluded. If a genus is not sampled in a time bin, it is excluded from the mean occupancy for that time bin. Error bars represent 95% confidence intervals from bootstrap resampling. For the results shown in red, genera are only included in the mean occupancies of the first and last time bins if they are sampled in the second and penultimate time bins, respectively.

Figure 3

Figure 4. Histogram of Quantified Asymmetry (QuAsy) values calculated from the occupancy trajectories of individual genera that do not have gaps within their stratigraphic ranges. The vertical blue line shows the mean QuAsy value. QuAsy values are scaled to be from 0 to 1, with 0 representing complete symmetry.

Figure 4

Figure 5. Mean occupancy in the second to the last substages of genera that were sampled in the first substages of their first three stage-level time bins, and in the first to the penultimate substages of genera that were sampled in the last substages of their last three stage-level time bins. Genera that went extinct during mass extinctions or that survive close to the Recent are excluded. For the first and last stage-level time bins, only genera that were also sampled in their second and penultimate stage-level time bins, respectively, are included. The fall in occupancy seems gradual, but there is an abrupt drop at the final bin. Mean occupancies adjusted by Chao et al.’s (2015) method are shown in red. The Quantified Asymmetry (QuAsy) value, scaled from 0 to 1, calculated from this mean trajectory is also given.