Merging Data Sets Based on Partially Matched Data Elements

September 26, 2012 | 0 Comments

A tweet from @coneee yesterday about merging two datasets using columns of data that don’t quite match got me wondering about a possible R recipe for handling partial matching. The data in question related to country names in a datafile that needed fusing with country names in a listing ... [Read more...]

Doodling With a Conversation, or Retweet, Data Sketch Around LAK12

May 2, 2012 | 0 Comments

How can we represent conversations between a small sample of users, such as the email or SMS converstations between James Murdoch’s political lobbiest and a Government minister’s special adviser (Leveson inquiry evidence), or the pattern of retweet activity around a couple of heavily retweeted individuals using a particular ... [Read more...]

Rescuing Twapperkeeper Archives Before They Vanish, Redux

December 11, 2011 | 0 Comments

In Rescuing Twapperkeeper Archives Before They Vanish, I described a routine for grabbing Twapperkeeper archives, parsing them, and saving them to a local desktop file using the R programming language (downloading RStudio is the easiest way I know of getting R…). Following a post fron @briankelly (Responding to the Forthcoming ... [Read more...]

Getting Started With Twitter Analysis in R

November 9, 2011 | 0 Comments

Earlier today, I saw a post vis the aggregating R-Bloggers service a post on Using Text Mining to Find Out What @RDataMining Tweets are About. The post provides a walktrhough of how to grab tweets into an R session using the twitteR library, and then do some text mining on ... [Read more...]

The Visual Difference – R and Anscombe’s Quartet

August 30, 2011 | 0 Comments

I spent a chunk of today trying to get my thoughts in order for a keynote presentation at next week’s The Difference that Makes a Difference conference. The theme of my talk will be on how visualisations can be used to discover structure and pattern in data, and as ... [Read more...]

