Hostname: page-component-77c78cf97d-hf2s2 Total loading time: 0 Render date: 2026-04-26T10:01:27.981Z Has data issue: false hasContentIssue false

Linking host plants to damage types in the fossil record of insect herbivory

Published online by Cambridge University Press:  09 January 2023

Sandra R. Schachat*
Affiliation:
Department of Geological Sciences, Stanford University, Stanford, California 94305, U.S.A. E-mail: schachat@stanford.edu, sschachat@schmidtsciencefellows.org, jlpayne@stanford.edu, chkenboy@stanford.edu
Jonathan L. Payne
Affiliation:
Department of Geological Sciences, Stanford University, Stanford, California 94305, U.S.A. E-mail: schachat@stanford.edu, sschachat@schmidtsciencefellows.org, jlpayne@stanford.edu, chkenboy@stanford.edu
C. Kevin Boyce
Affiliation:
Department of Geological Sciences, Stanford University, Stanford, California 94305, U.S.A. E-mail: schachat@stanford.edu, sschachat@schmidtsciencefellows.org, jlpayne@stanford.edu, chkenboy@stanford.edu
*
*Corresponding author.

Abstract

Studies of insect herbivory on fossilized leaves tend to focus on a few, relatively simple metrics that are agnostic to the distribution of insect damage types among host plants. More complex metrics that link particular damage types to particular host plants have the potential to address additional ecological questions, but such metrics can be biased by sampling incompleteness due to the difficulty of distinguishing the true absence of a particular interaction from the failure to detect it—a challenge that has been raised in the ecological literature. We evaluate a range of methods for characterizing the relationships between damage types and host plants by performing resampling and subsampling exercises on a variety of datasets. We found that the components of beta diversity provide a more valid, reliable, and interpretable method for comparing component communities than do bipartite network metrics and that the rarefaction of interactions represent a valid, reliable, and interpretable method for comparing compound communities. Both beta diversity and rarefaction of interactions avoid the potential pitfalls of multiple comparisons. Finally, we found that the host specificity of individual damage types is challenging to assess. Whereas bipartite network metrics are sufficiently biased by sampling incompleteness to be inappropriate for fossil herbivory data, alternatives exist that are perfectly suitable for fossil datasets with sufficient sample coverage.

Information

Type
Articles
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/licenses/by-nc-nd/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided that no alterations are made and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use and/or adaptation of the article.
Copyright
Copyright © The Author(s), 2023. Published by Cambridge University Press on behalf of The Paleontological Society
Figure 0

Figure 1. A comparison of the sampling completeness that can be expected for studies of fossil herbivory (A) with the sampling completeness needed for methods that link host plants to damage types to be unbiased by sampling completeness (B). A, Rarefaction of damage types on the two dominant host plants at the Colwell Creek Pond assemblage. The solid lines and corresponding 84% confidence intervals represent interpolated damage type diversity, and the dashed lines with question marks represent extrapolated diversity. B, An illustration of the sampling completeness that is needed for bipartite network analysis not to be biased by sampling: the rarefaction curve for each host plant should have sample coverage sensu Chao and Jost (2012) above 0.99. All rarefaction curves shown in this panel have coverage between 0.995 and 0.997.

Figure 1

Figure 2. The sampling evenness for host plants in neontological (A, B) and paleontological (C, D) datasets that can be used to link host plants to herbivores or damage types. A, Basset et al. (1996); this maximally even sampling is representative of various other neontological studies of plant–insect networks (Novotny et al. 2002, 2004, 2012; Lundgren and Olesen 2005; Olesen et al. 2008; Pinheiro et al. 2008; Gibson et al. 2011; Grass et al. 2013; Trøjelsgaard et al. 2015; Oleques et al. 2019; Zemenick et al. 2021). B, Lewis et al. (2002). C, Currano et al. (2008). D, Xu et al. (2018).

Figure 2

Table 1. The percentiles of leaves on which damage types were observed at the Willershausen assemblage.

Figure 3

Figure 3. Mean values and 95% confidence intervals for bipartite network metrics generated by resampling and subsampling the cleaned Willershausen dataset in its entirety. DT, damage type; PH, plant host.

Figure 4

Figure 4. An example of a nearly unbiased estimator. Mean values and 95% confidence intervals for coverage-based rarefaction generated by resampling and subsampling the Willershausen dataset. Moreover, coverage-based rarefaction performs as a consistent estimator, in that estimates converge on the true value as sample size increases. No results are presented for 300 subsampled leaves from the complete dataset at sample coverage of 0.9, because some iterations of this sampling routine yielded an observed sample coverage below 0.9.

Figure 5

Figure 5. Mean values and 95% confidence intervals for bipartite network metrics generated by resampling and subsampling data for the 10 host plants at Willershausen represented by the highest numbers of leaves. DT, damage type; PH, plant host.

Figure 6

Figure 6. Mean values and 95% confidence intervals for beta-diversity metrics generated by resampling and subsampling data for the two most abundant host plants from (A) Willershausen, (B) Colwell Creek Pond, and (C) Williamson Drive. At the highest sample sizes, represented in dark blue, the data were resampled rather than subsampled. Macronrptrs.scheuchzeri = Macroneuropteris scheuchzeri; Sgllrphyllm. = Sigillariophyllum.

Figure 7

Figure 7. Mean values and 95% confidence intervals for the number of plant taxa on which various damage types appear, calculated with the Willershausen dataset.

Figure 8

Figure 8. False positive results of “specialized” damage generated by iteratively resampling data from Colwell Creek Pond. We treated each iteration in which DT032 or DT120 was observed on only one host plant taxon as a false positive. The heat maps show the percentage of iterations for each amount of subsampled surface area in which a false positive result was recovered, arranged by the number of specimens on which the damage type was observed. The histograms show the summed percentages, by number of specimens.

Figure 9

Figure 9. Mean values and 95% confidence intervals for coverage-based rarefaction of interactions. The datasets presented here are Williamson Drive and Colwell Creek Pond, both from the Permian of Texas (rarefied to a sample coverage of 0.771) and a simulated dataset that mimics the patterns seen among angiosperms at Willershausen (rarefied to a sample coverage of 0.726).

Figure 10

Figure 10. Comparison of the raw and rarefied interaction data from Colwell Creek Pond and Williamson Drive. Each column of each graph represents a damage type. The heat maps show the prevalence of each interaction, and the asterisks denote interactions that remain after rarefying data from each assemblage to a sample coverage of 0.771.

Figure 11

Figure 11. Mean values and 95% confidence intervals for bipartite network metrics, generated by subsampling each dataset to 300 leaves. DT, damage type; PH, plant host.

Figure 12

Table 2. The variety of narratives about the PETM supported by different combinations of bipartite network metrics.

Figure 13

Table A1. A toy example of the input used for bipartite network analysis. For rarefaction of interactions (Dyer et al. 2010), the input would be a vectorized version of this matrix, which could take any of the following forms: [ 1 5 0 2 2 0 0 6 0 0 1 0 0 1 0 1 0 3 0 1 ], or [ 1 5 2 2 6 1 1 1 3 1 ], or [ 6 5 3 2 2 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 ], or [ 6 5 3 2 2 1 1 1 1 1 ].