Skip to main content
    • Aa
    • Aa

A simple way to improve multivariate analyses of paleoecological data sets

  • John Alroy (a1)

Multivariate methods such as cluster analysis and ordination are basic to paleoecology, but the messy nature of fossil occurrence data often makes it difficult to recover clear patterns. A recently described faunal similarity index based on the Forbes coefficient improves results when its complement is employed as a distance metric. This index involves adding terms to the Forbes equation and ignoring one of the counts it employs (that of species found in neither of the samples under consideration). Analyses of simulated data matrices demonstrate its advantages. These matrices include large and small samples from two partially overlapping species pools. In a cluster analysis, the widely used Dice coefficient and the Euclidean distance metric both create groupings that reflect sample size, the Simpson index suggests large differences that do not exist, and the corrected Forbes index creates groupings based strictly on true faunal overlap. In a principal coordinates analysis (PCoA) the Forbes index almost removes the sample-size signal but other approaches create a second axis strongly dominated by sample size. Meanwhile, species lists of late Pleistocene mammals from the United States capture biogeographic signals that standard ordination methods do recover, but the adjusted Forbes coefficient spaces the points out more sensibly. Finally, when biome-scale lists for living mammals are added to the data set and extinct species are removed, correspondence analysis misleadingly separates out the biome lists, and PCoA based on the Dice coefficient places them to the edge of the cloud of fossil assemblage data points. PCoA based on the Forbes index places them in more reasonable positions. Thus, only the adjusted Forbes index is able to recover true biological patterns. These results suggest that the index may be useful in analyzing not only paleontological data sets but any data set that includes species lists having highly variable lengths.

Linked references
Hide All

This list contains references from the content that can be linked to their source. For a full set of references and notes please see the PDF or HTML where available.

J. R. Bonelli Jr., C. E. Brett , A. I. Miller , and J. B. Bennington . 2006. Testing for faunal stability across a regional biotic transition: quantifying stasis and variation among recurring coral-rich biofacies in the Middle Devonian Appalachian Basin. Paleobiology 32:2037.

J. H. Brown , and P. F. Nicoletto . 1991. Spatial scaling of species composition: body masses of North American land mammals. American Naturalist 138:14781512.

A. M. Bush , and R. I. Brame . 2010. Multiple paleoecological controls on the composition of marine fossil assemblages from the Frasnian (Late Devonian) of Virginia, with a comparison of ordination methods. Paleobiology 36:573591.

A. Chao , R. L. Chazdon , R. K. Colwell , and T.-J. Shen . 2005. A new statistical approach for assessing similarity of species composition with incidence and abundance data. Ecology Letters 8:148159.

P. G. N. Digby , and R. A. Kempton . 1987. Multivariate analysis of ecological communities. Chapman and Hall, London.

H. G. Gauch 1982. Multivariate analysis in community ecology. Cambridge University Press, Cambridge.

J. C. Gower 1966. Some distance properties of latent root and vector methods used in multivariate analysis. Biometrika 53:325338.

E. M. Hagmeier , and C. D. Stults . 1964. A numerical analysis of the distributional patterns of North American mammals. Systematic Zoology 13:125155.

M. O. Hill 1973. Reciprocal averaging: an eigenvector method of ordination. Journal of Ecology 61:237249.

M. O. Hill , and H. G. Gauch . 1980. Detrended correspondence analysis, an improved ordination technique. Vegetatio 42:4758.

S. M. Holland , A. I. Miller , D. L. Meyer , and B. F. Dattilo . 2001. The detection and importance of subtle biofacies within a single lithofacies: the Upper Ordovician Kope Formation of the Cincinnati, Ohio region. Palaios 16:205217.

Z. Hubálek 1982. Coefficients of association and similarity, based on binary (presence-absence) data: an evaluation. Biological Reviews 57:669689.

P. Legendre , and E. D. Gallagher . 2001. Ecologically meaningful transformations for ordination of species data. Oecologia 129:271280.

R. A. Reyment 1963. Multivariate analytical treatment of quantitative species associations: an example from palaeoecology. Journal of Animal Ecology 32:535547.

R. N. Shepard 1962. The analysis of proximities: multidimensional scaling with an unknown distance function. II. Psychometrika 27:219246.

G. G. Simpson 1943. Mammals and the nature of continents. American Journal of Science 241:131.

G. G. Simpson 1964. Species density of North American Recent mammals. Systematic Zoology 13:5773.

T. Tsubamoto , M. Takai , and N. Egi . 2004. Quantitative analyses of biogeography and faunal evolution of middle to late Eocene mammals in East Asia. Journal of Vertebrate Paleontology 24:657667.

M. H. Williamson 1978. The ordination of incidence data. Journal of Ecology 66:911920.

Recommend this journal

Email your librarian or administrator to recommend adding this journal to your organisation's collection.

  • ISSN: 0094-8373
  • EISSN: 1938-5331
  • URL: /core/journals/paleobiology
Please enter your name
Please enter a valid email address
Who would you like to send this to? *


Altmetric attention score

Full text views

Total number of HTML views: 0
Total number of PDF views: 9 *
Loading metrics...

Abstract views

Total abstract views: 201 *
Loading metrics...

* Views captured on Cambridge Core between September 2016 - 20th July 2017. This data will be updated every 24 hours.