# Monthly Archives: August 2013

## Finding Correlations in Data with Uncertainty

August 11, 2013
By

A week or so ago a colleague of mine asked if I knew how to calculate correlations for data with uncertainties. Now, if we are going to be honest, then all data should have some level of experimental or measurement error. However, I suspect that in the majority of cases these uncertainties are ignored when

## Enhanced meboot package, simulating regression standard errors

August 11, 2013
By

In my June 25 post I described R- (i) code to change scale without changing the mean, and (ii) code to make a probability distribution symmetric by modifying order statistics.  Both are commonly encountered problems by R programmers.  My coauthor Javier Lopez-de-Lacalle of Spain has incorporated an efficient version of my code inside the maximum entropy bootstrap (meboot) package in R See the package...

## XML in R – A (German) tutorial / XML in R – ein Tutorial auf Deutsch

August 10, 2013
By

I used knitr to hack together a very short tutorial about XML in R.It's in German. And it's not very long. But, hey, it's free :)I hope it can be of help to someone who wants to get started with XML processing in R.Please feel free to post or send any ...

## Pappu Vs. Feku – Twitter Wars

August 10, 2013
By

In my quest to practice R and learn text mining, I am looking at one of the popular Twitter Wars between two political personalities of India who are fondly known in the TwitterVerse as ‘Pappu’ and ‘Feku’ which is basically their ‘ghar ka naam’ or ‘pyar wala naam’. Anyway, the discussion about the origin of the... Read More ...

## In case you missed it: July 2013 Roundup

August 9, 2013
By

In case you missed them, here are some articles from July of particular interest to R users: A new 90-second, creative commons video helps R enthusiasts share the history, community and applications of R. Analyst group Butler Analytics reviews 10 predictive analytics platforms, and says that "real analysts use R". An excellent example of Simpsons Paradox: US median wages...

## PIMCO Rolling Correlation, d3, R, gridSVG, lattice | Gets An Axis

August 9, 2013
By

Where else will you hear Pimco, rolling correlation, R, gridSVG, lattice, and d3 all in one post?  Let’s mix them all together to see what might happen.  For those here for the geekery, we will add a d3 axis for our y and it will follow the mouse.  For those who care nothing about d3 and R, you might...

## Approximate string matching in R

August 9, 2013
By

I have released a new version of the stringdist package. Besides a some new string distance algorithms it now contains two convenient matching functions: amatch: Equivalent to R's match function but allowing for approximate matching. ain: Similar to R's %in% … Continue reading →

## R-Squared for a VBGM

August 9, 2013
By
$R-Squared for a VBGM$

Recently, a fishR user asked me the following question: After fitting the age-length data into VBGM, I overviewed the results. But I can’t find the coefficient of determination () for the VBGM fitting. Because some reviewer want the the coefficient … Continue reading →

## inline 0.3.13

August 9, 2013
By

A minor maintenance release of inline is now on CRAN, and has just been already included in Debian. This release contains a patch kindly contributed by Mikhail Umorin which fixes the of \code{cfunction} with lists of signatures and function bodies. ...

## Data Scientists and Statisticians: Can’t We All Just Get Along

August 9, 2013
By

It seems that the title “data science” has taken the world by storm.  It’s a title that conjures up almost mystical abilities of a person garnering information from oceans of data with ease.  It’s where a data scientist can wave his or her hand like a Jedi Knight and simply tell the data what it