Hostname: page-component-89b8bd64d-ksp62 Total loading time: 0 Render date: 2026-05-09T02:18:02.426Z Has data issue: false hasContentIssue false

Bedrock geological map predictions for Phanerozoic fossil occurrences

Published online by Cambridge University Press:  27 January 2023

Shan Ye*
Affiliation:
Department of Geoscience, University of Wisconsin–Madison, Madison, Wisconsin 53706, U.S.A. E-mail: shan.ye@wisc.edu, peters@geology.wisc.edu
Shanan E. Peters
Affiliation:
Department of Geoscience, University of Wisconsin–Madison, Madison, Wisconsin 53706, U.S.A. E-mail: shan.ye@wisc.edu, peters@geology.wisc.edu
*
*Corresponding author.

Abstract

Geographically explicit, taxonomically resolved fossil occurrences are necessary for reconstructing macroevolutionary patterns and for testing a wide range of hypotheses in the Earth and life sciences. Heterogeneity in the spatial and temporal distribution of fossil occurrences in the Paleobiology Database (PBDB) is attributable to several different factors, including turnover among biological communities, socioeconomic disparities in the intensity of paleontological research, and geological controls on the distribution and fossil yield of sedimentary deposits. Here we use the intersection of global geological map data from Macrostrat and fossil collections in the PBDB to assess the extent to which the potentially fossil-bearing, surface-expressed sedimentary record has yielded fossil occurrences. We find a significant and moderately strong positive correlation between geological map area and the number of fossil occurrences. This correlation is consistent regardless of map unit age and binning protocol, except at period level; the Neogene and Quaternary have non-marine map units covering large areas and yielding fewer occurrences than expected. The sedimentary record of North America and Europe yields significantly more fossil occurrences per sedimentary area than similarly aged deposits in most of the rest of the world. However, geographic differences in area and age of sedimentary deposits lead to regionally different expectations for fossil occurrences. Using the sampling of surface-expressed sedimentary units in North America and Europe as a predictor for what might be recoverable from the surface-expressed sedimentary deposits of other regions, we find that the rest of the globe is approximately 45% as well sampled in the PBDB. Using age and area of bedrock and sampling in North America and Europe as a basis for prediction, we estimate that more than 639,000 occurrences from outside these regions would need to be added to the PBDB to achieve global geological parity in sampling. In general, new terrestrial fossil occurrences are expected to have the greatest impact on our understanding of macroevolutionary patterns.

Information

Type
Featured Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2023. Published by Cambridge University Press on behalf of The Paleontological Society
Figure 0

Figure 1. Geological map and fossil collections. A, “Small”-scale geological map from the Macrostrat database (Precambrian and igneous/metamorphic rocks are separately colored). B, Matched PBDB collections colored by the color scheme of their matched Macrostrat map polygons. C, Unmatched collections colored by their age in the PBDB and the standard colors for periods.

Figure 1

Figure 2. Map of depositional environments based on PBDB collection matches. A, The fraction of marine collections in each polygon with at least 1 matched collection. B, Classification of inferred environments of each polygon based on 40% and 60% cutoffs of the marine fraction.

Figure 2

Figure 3. Matched and unmatched geological map areas vs. age showing overall area–age patterns. A, Matched and unmatched granular polygons. B, As in A, but for distinct map units. Matched fraction in A and B shown by blue curves and blue axis labels. C, Inferred environments of deposition for geological map area based on PBDB collection matches to individual granular polygons. D, As in C, but with environments determined on the basis of distinct map units matched to PBDB collections. In both C and D, environments are determined by the fraction of fossil environments in each polygon or unit (see text for criteria). Polygons or map units with no matched fossil collection have an unknown environment. Cm, Cambrian; O, Ordovician; S, Silurian; D, Devonian; C, Carboniferous; P, Permian; Tr, Triassic; J, Jurassic; K, Cretaceous; Pg, Paleogene; N, Neogene.

Figure 3

Figure 4. Time series of matched and unmatched collections within marine and terrestrial environments using PBDB age estimates. A, Number of marine collections vs. age and the fraction of those collections matched to map units (blue). B, Number of terrestrial collections vs. age and the fraction of those collections matched to map units (blue). C, Stack plot of the number of marine (blue) and terrestrial (tan) fossil occurrences. D, As in C, but for genera. E, Time series of global occurrences normalized by global sedimentary map area. F, As in E, but for distinct genera normalized by global sedimentary map area. Cm, Cambrian; O, Ordovician; S, Silurian; D, Devonian; C, Carboniferous; P, Permian; Tr, Triassic; J, Jurassic; K, Cretaceous; Pg, Paleogene; N, Neogene.

Figure 4

Figure 5. Scatter plots and Pearson's r values of occurrences and genera vs. map area. A, Occurrences vs. granular polygon area. B, Occurrences vs. distinct map unit area. C, Occurrences vs. map area in 10-Myr time bins. D, Occurrences vs. map area in geological periods. E, Genera vs. granular polygon area. F, Genera vs. distinct map unit area. G, Genera vs. map area in 10-Myr time bins. H, Genera vs. map area in geological periods. Cm, Cambrian; O, Ordovician; S, Silurian; D, Devonian; C, Carboniferous; P, Permian; Tr, Triassic; J, Jurassic; K, Cretaceous; Pg, Paleogene; N, Neogene; Q, Quaternary.

Figure 5

Figure 6. Observed spatial distribution and density of fossil occurrences within sedimentary polygons. A, Raw number of occurrences in each map polygon. B, Occurrences per square kilometer in each map polygon. C, Raw number of occurrences in each distinct map unit. D, Occurrences per square kilometer in each distinct map unit. D shows the preferred estimate.

Figure 6

Figure 7. Regional subdivisions, sedimentary map area, and occurrences per map area using map age estimates. A, The regional division of study areas used here. B, Sedimentary rock area through time in each of these regions. C, Occurrence per square kilometer in these different regions tabulated by polygon ages. Cm, Cambrian; O, Ordovician; S, Silurian; D, Devonian; C, Carboniferous; P, Permian; Tr, Triassic; J, Jurassic; K, Cretaceous; Pg, Paleogene; N, Neogene.

Figure 7

Table 1. Number (n) of Phanerozoic distinct map units with at least 1 occurrence and the median area of distinct map units (in km2) in each region.

Figure 8

Table 2. Spearman's rho between the map area (km2) and occurrence counts and genus diversity at the granular polygon and map unit levels. All p-values < 0.05.

Figure 9

Figure 8. Regional time series of marine (blue) and terrestrial (black) mean occurrences after 1000 iterations of a Monte Carlo approach to redistributing fossil occurrences based on geological map units (solid lines) in North America (A), Europe (B), Africa (C), Oceania (D), Central and South America (E), Eastern Europe and Middle East (F), and Asia (G). Also shown are the matched occurrences through time in the original PBDB database, tabulated by map ages (dashed lines). The envelope denotes the range between the 1st and 3rd quantiles of the modeling results after 1000 iterations. Cm, Cambrian; O, Ordovician; S, Silurian; D, Devonian; C, Carboniferous; P, Permian; Tr, Triassic; J, Jurassic; K, Cretaceous; Pg, Paleogene; N, Neogene.

Figure 10

Table 3. Observed and predicted (median and 1st and 3rd quantiles) occurrences in each region (in thousands) and the completeness of each region (observed/median prediction).

Figure 11

Table 4. Mean absolute percentage error (MAPE) between observed and predicted occurrences in each region.

Figure 12

Figure 9. Observed map polygon-matched (dashed) and median predicted (solid) polygon-hosted global fossil occurrences based on a Monte Carlo occurrence redistribution algorithm (see text). The envelopes around the solid curves show the range between the 1st and 3rd quantiles of the modeling results after 1000 iterations. Cm, Cambrian; O, Ordovician; S, Silurian; D, Devonian; C, Carboniferous; P, Permian; Tr, Triassic; J, Jurassic; K, Cretaceous; Pg, Paleogene; N, Neogene.

Figure 13

Figure 10. Predicted density of global fossil occurrences per square kilometer after using the geological record of sampling in North America and Europe and other regional geological records as a basis for the prediction.