I am currently working on a project where I study the change of a continuous morphological variable with body size (measured as body weight) across diverse taxa (insects, spiders, lizards, frogs). In order to account for the non-independence of the data, I built an approximate tree based on published phylogenies. I have a reasonable branching pattern down to family or genus level for most taxa, but lack statistically supported branch length.
I find a strong change of the scaling coefficient when using phylogenetic least squares (with a covariance matrix based on my tree) vs a non-corrected least-squares approach, suggesting that most of the change observed in my continuous variable is explained by evolutionary history, rather than by body mass as such. I would now like to investigate how early on in the tree these shifts occur.
Intuitively, I could start collapsing my tree into polytomies down to arbitrary taxonomic levels, eg genus, family, order etc, and re-do my phylogenetic least squares. I would expect a stepwise approximation of the pgls to the uncorrected result (in the extreme case, I would end up with a star phylogeny, which should give an identical result to my uncorrected regression). Such an approach may allow to say something about whether most of the change occurs between genera, families etc, but it requires the use of somewhat arbitrary (and controversial) taxonomic levels. I would be grateful if someone would have an alternative suggestions on how to perform such an analysis, with arbitrary branch length and non-ultrametric trees.
Thanks Thriceguy