Haplotype names in R
Emmanuel Paradis, the mastermind behind 'ape
' has struck again. This time he brings us the 'pegas
' package, the Population and Evolutionary Genetic Analysis system. This package has a function that collapses the haplotypes (unique DNA sequences) in a DNA alignment, something which is extremely useful in various analyses and in the calculation of genetic diversity.
x<-woodmouse[sample(15, size=110, replace=TRUE), ]
Unfortunately, the haplotypes are rather opaquely numbered by Roman numerals and makes it difficult to figure out where these samples came from. The attribute function above tells you which sequences in x make up which haplotypes in h but it's a bit tedious, particularly when dealing with large data sets. To combat this, I've written a function to label each of the haplotypes with the name given in the original DNAbin object:
for(i in 1:dim(hap)) attr(hap, "dimnames")[][i]<-nam[attr(hap, "index")[[i]]]
'hap' is the haplotype/DNAbin object obtained from running haplotype, while 'dat' is the original DNAbin object.
Let me know how it goes...
To leave a comment
for the author, please follow the link and comment on his blog: The Praise of Insects
offers daily e-mail updates
news and tutorials
on topics such as: visualization (ggplot2
), programming (RStudio
, Web Scraping
) statistics (regression
, time series
) and more...
If you got this far, why not subscribe for updates
from the site? Choose your flavor: e-mail
, or facebook