# Blog Archives

## Reading large data tables in R

May 2, 2014
By

Ok, I confess. Until yesterday I was one of those still trying to read large data tables using read.table. Then, I came across this thread in stackoverflow and I saw the light. Since I noticed that a lot of people still struggle with read.table I decided to write this very brief post. Imagine that you

## Summarize content of a vector or data.frame every n entries

August 8, 2013
By

I imagine that the same result can be achieved by a proper use of quantile, but I like to have an easy way to obtain summary statistics every n entries of my dataset be it a vector or data.frame. The function takes three parameters: the R object on which we need to obtain statistics (x),

## R and theater

June 3, 2012
By

You might ask what R has to do with theater. I assure you it has. I act in the theater group ‘ndescenze. We will soon present (actually, we just performed) a show based on the Marx Brothers Radio Shows. We shuffle actors and characters during the show (we like it complicated!) and we needed to

## Introduction to R

May 25, 2012
By

I am happy to repost the information I got about the course “Introduction to R” that will be organized by Milano R net in collaboration with Quantide. The course will be held in Milano, Italy, June 7-8, 2012, and is intended to introduce the unexperienced user to R. For furhter info visit milano R net

## Milano R net meeting

April 24, 2012
By

I received the announce of the first Milano R net meeting, and I am glad to post it below. If you are based in Milano area you might think about going. If you are not based in Milano area, you might think about going as well! Milano R net Milano R net is a users

## Extract different characters between two strings of equal length

January 3, 2012
By

In the desperate effort of understanding the secret of life it may be too simplistic to just count the differences between two strings of equal length. You might as well want to know where they differ. You can do that recycling most of the function published in a previous post. You can use it to

## Count different positions between two strings of equal length

November 26, 2011
By

This is another pretty simple function, written to help me solve the simplest representation of a trivial but tedious task. Most biologist are probably familiar with this task. How many nucleotide differences exist between two given sequences? I only faced the easiest part of the problem, i.e. I do not perform alignment, I just assume that

## Estimate decay of linkage disequilibrium with distance

August 9, 2011
By

It is well known that linkage disequilibrium (LD) decays with distance. Several functions have been proposed to estimate such decay. Among the most widely used are the Hill and Weir (1) formula for describing the decay of r2 and a formula proposed by Abecasis (2) for describing the decay of D’. I wrote R functions

## For happy-R blogging

May 3, 2011
By

You may notice that I don’t have that many posts on my blog, and they are all about R. The paucity of my posts makes me a bit sad, but not much, really… What makes (or better, used to make me) sad is that posts of R code (used to) look awful. However, your code

## Estimate Gene Diversity

May 3, 2011
By

I provide here an R function to estimate gene diversity of diallelic sites (e.g. SNPs), given allele frequencies at each segregating site. The function takes three input parameters: maf: a numeric value (or vector) representing minor allele frequency at each site. Default is 0.5 nreads: size of each resampling experiment. Default is 10000. nreplicates: Number