Among-species comparisons can include phylogenetic information to account for non-independence arising from shared evolutionary history. Often, phylogenetic topologies and branch lengths are not known exactly, but are estimated with uncertainty. This uncertainty can be accounted for using methods recently described in a neat paper called Bayesian models for comparative analysis integrating phylogenetic uncertainty by Villemereuil et al. Here, I’ll demonstrate the method by estimating the body mass of the Siberut macaque (Macaca siberu).
(I don’t study macaques, but they’re cute enough to warrant a toy example)
Building the phylogeny
First, I downloaded a nexus file with DNA sequences from A molecular phylogeny of living primates by Polina Perelman et al., available here on TreeBASE. I culled the nexus file to include only the 14 species in the genus Macaca, and saved it as macaques.nex.
I used MrBayes to estimate the macaque phylogeny with the following file (analysis.nex):
1 2 3 4 5 6 7 8
Calling MrBayes from within R:
Here is the unrooted consensus tree:
Body mass data
Macaque weight data are available as part of a (much) larger dataset on the body mass of late quaternary mammals, by Smith and colleagues. I extracted the log body mass data for the 13 available macaque species. No body mass data were available for the Siberut macaque (M. siberu).
Body mass model
We will account for phylogenetic non-independence by considering average species weights to be multivariate normally distributed around a within-genus mean, with a covariance matrix $\Sigma$ that reflects phylogenetic distance (see Villemereuil et al.). The off-diagonal elements of this matrix are scaled by Pagel’s $\lambda$, which reflects the degree of phylogenetic signal in the data.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17
Then we can fit the model with OpenBUGS, estimating the missing body mass of M. siberu.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47
Here is our phylogenetically informed estimate of the body size for M. siberu. Pagel’s $\lambda$ indicates a weak but non-zero phylogenetic signal with mean = 0.417 and 95% BCI = (0.0156, 0.949). It should go without saying this is a toy example, and it may be better to go out and weigh some actual Siberut macaques (at a minimum, this would be a good excuse for a vacation).
References & further reading
Blomberg et al.): Independent contrasts and PGLS regression estimators are equivalent. Systematic Biology 2012.
Pagel: Inferring the historical patterns of biological evolution. Nature 1999.
Perelman et al.: A molecular phylogeny of living primates. PLoS Genetics 2011.
Smith et al.: Body mass of late quaternary mammals. Ecology 2003.
de Villemereuil et al.: Bayesian models for comparative analysis integrating phylogenetic uncertainty. BMC Evolutionary Biology 2012