Phylogenetic Comparative Methods

Current understanding

Phylogenetic comparative methods are used throughout macroevolution to test whether a trait influences diversification rates, but at least one class of these methods — binary state speciation and extinction (BiSSE) — produces false positives at rates that should give pause to anyone interpreting a significant result. The core problem is diversification-rate heterogeneity across the tree: when background speciation rates vary among lineages for reasons unrelated to the focal trait, BiSSE detects that variation and attributes it to the trait anyway.

Two concrete benchmarks illustrate the scale of the problem. On the empirical cetacean phylogeny, which contains well-documented rate heterogeneity driven by the dolphin radiation, more than 77% of neutral character datasets simulated with no state-dependent diversification returned a statistically significant association between character state and speciation rate at p < 0.05 — and 58% cleared the p < 0.001 threshold 10.1093/sysbio/syu131, Finding 1. The second benchmark is harder to explain away: taxon name length, which cannot conceivably drive speciation, showed a significant correlation with speciation rate in more than 69% of vertebrate subtrees, with the false-positive rate approaching 100% (60 of 61 subtrees) in ray-finned fishes 10.1093/sysbio/syu131, Finding 2. Whether model adequacy testing and hidden-state corrections (e.g., HiSSE) reliably fix these problems — or merely redistribute them — remains an active area of debate in the field.

Supporting evidence

Contradictions / open disagreements

The cetacean result is specific to a phylogeny with unusually pronounced diversification heterogeneity. Phylogenies with more homogeneous background diversification dynamics would likely show lower false-positive rates, so the 77% figure is probably an upper bound rather than a universal baseline. Additionally, taxon name length carries weak phylogenetic signal because congeners share name prefixes; the false-positive rates for that character may partly reflect residual phylogenetic structure in the names themselves, not purely tree-wide diversification heterogeneity. Whether state-dependent diversification methods can be trusted without hidden-state correction is still contested, and the degree to which newer approaches (HiSSE, BAMM-trait combinations) solve versus reframe the model adequacy problem is only starting to be tested systematically.

Tealc’s citation-neighborhood suggestions

Question copied. Paste it into the NotebookLM tab.