Hostname: page-component-5db58dd55d-jhf8m Total loading time: 0 Render date: 2026-05-31T07:24:35.378Z Has data issue: false hasContentIssue false

Bayesian inference of phylogenetic trees is not misled by correlated discrete morphological characters

Published online by Cambridge University Press:  10 October 2025

Xueer Liu
Affiliation:
Key Laboratory of Vertebrate Evolution and Human Origins, Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing 100044, China University of Chinese Academy of Sciences , Beijing 101408, China
Chi Zhang*
Affiliation:
Key Laboratory of Vertebrate Evolution and Human Origins, Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing 100044, China University of Chinese Academy of Sciences , Beijing 101408, China
*
Corresponding author: Chi Zhang; Email: zhangchi@ivpp.ac.cn

Abstract

Morphological characters are central to phylogenetic inference, especially for fossil taxa for which genomic data are unavailable. While Bayesian methods have gained popularity in recent years, they typically assume characters evolve independently, despite known correlations among characters. Here, we assess the impact of character correlation and evolutionary rate heterogeneity on Bayesian phylogenetic inference using extensive simulations of binary characters evolving under independent and correlated models. We find that Bayesian inference assuming character independence accurately recovers tree topologies even when characters are strongly correlated or evolve under heterogeneous rates. However, branch lengths or clock rates tend to be underestimated, particularly under extreme rate heterogeneity. These biases are partially corrected using models that integrate over character-state heterogeneity. Our results demonstrate that Bayesian methods are robust to violations of character independence in topological inference, supporting their continued use in morphological phylogenetics.

Information

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/licenses/by-nc-nd/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided that no alterations are made and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use and/or adaptation of the article.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Paleontological Society
Figure 0

Figure 1. The distribution of tree length (A) and the numbers of extant and extinct tips (B) of the simulated trees.

Figure 1

Table 1. Models and settings used in the simulations and inferences. See “Methods” for the explanations of the symbols.

Figure 2

Figure 2. Tree distance metrics (Quartet and Mutual Clustering Information [MCI]) comparing the inferred tree with the true tree generating the data. Each violin plot contains 100 replicates. The left four panels show the results of non-clock analyses (A, C, E, G), while the right four panels show the results of tip-dating analyses (B, D, F, H). Panels labeled “w/ missing” (E–H) indicate scenarios with missing data. The numbers on the x-axis correspond to the following experiments (simulation model vs. inference model): 1, M2v–vs–M2v; 2, M2v–vs–F2v; 3, G4v(α = 10)–vs–M2v; 4, G4v(α = 10)–vs–F2v; 5, G8v(α = 10)–vs–M2v; 6, G8v(α = 10)–vs–F2v; 7, F2v(α = 1)–vs–M2v; 8, F2v(α = 1)–vs–F2v; 9, G4v(α = 1)–vs–M2v; 10, G4v(α = 1)–vs–F2v; 11, G8v(α = 1)–vs–M2v; 12, G8v(α = 1)–vs–F2v; 13, F2v(α = 1, v = 4)–vs–M2v; 14, F2v(α = 1, v = 4)–vs–F2v; 15, G4v(α = 1, v = 4)–vs–M2v; 16, G4v(α = 1, v = 4)–vs–F2v; 17, G8v(α = 1, v = 4)–vs–M2v; 18, G8v(α = 1, v = 4)–vs–F2v.

Figure 3

Figure 3. Relative bias (posterior mean minus the true value, then divided by the true value) and relative width of credibility interval (CI) (95% CI width divided by the true value) for each of the following experiments (simulation model vs. inference model): 1, M2v–vs–M2v; 2, M2v–vs–F2v; 3, G4v(α = 10)–vs–M2v; 4, G4v(α = 10)–vs–F2v; 5, G8v(α = 10)–vs–M2v; 6, G8v(α = 10)–vs–F2v; 7, F2v(α = 1)–vs–M2v; 8, F2v(α = 1)–vs–F2v; 9, G4v(α = 1)–vs–M2v; 10, G4v(α = 1)–vs–F2v; 11, G8v(α = 1)–vs–M2v; 12, G8v(α = 1)–vs–F2v; 13, F2v(α = 1, v = 4)–vs–M2v; 14, F2v(α = 1, v = 4)–vs–F2v; 15, G4v(α = 1, v = 4)–vs–M2v; 16, G4v(α = 1, v = 4)–vs–F2v; 17, G8v(α = 1, v = 4)–vs–M2v; 18, G8v(α = 1, v = 4)–vs–F2v. Each violin plot contains 100 replicates. The left four panels show the tree lengths from non-clock analyses (A, C, E, G), while the right four panels show the tree heights from tip-dating analyses (B, D, F, H). Panels labeled “w/ missing” (E–H) indicate scenarios with missing data.

Figure 4

Figure 4. Relative bias (posterior mean minus the true value, then divided by the true value) and relative width of credibility interval (CI) (95% CI width divided by the true value) of the base clock rate for each of the following experiments (simulation model vs. inference model): 1, M2v–vs–M2v; 2, M2v–vs–F2v; 3, G4v(α = 10)–vs–M2v; 4, G4v(α = 10)–vs–F2v; 5, G8v(α = 10)–vs–M2v; 6, G8v(α = 10)–vs–F2v; 7, F2v(α = 1)–vs–M2v; 8, F2v(α = 1)–vs–F2v; 9, G4v(α = 1)–vs–M2v; 10, G4v(α = 1)–vs–F2v; 11, G8v(α = 1)–vs–M2v; 12, G8v(α = 1)–vs–F2v; 13, F2v(α = 1, v = 4)–vs–M2v; 14, F2v(α = 1, v = 4)–vs–F2v; 15, G4v(α = 1, v = 4)–vs–M2v; 16, G4v(α = 1, v = 4)–vs–F2v; 17, G8v(α = 1, v = 4)–vs–M2v; 18, G8v(α = 1, v = 4)–vs–F2v. Each violin plot contains 100 replicates. The left two panels are scenarios without missing data (A and C), while the right two panels labeled with “w/ missing” (B and D) indicate scenarios with missing data.