Posts Tagged ‘ Tinkering ’

Merging Data Sets Based on Partially Matched Data Elements

September 26, 2012
By
Merging Data Sets Based on Partially Matched Data Elements

A tweet from @coneee yesterday about merging two datasets using columns of data that don’t quite match got me wondering about a possible R recipe for handling partial matching. The data in question related to country names in a datafile that needed fusing with country names in a listing of ISO country codes. The original

Read more »

Interest Differencing: Folk Commonly Followed by Tweeting MPs of Different Parties

July 6, 2012
By
Interest Differencing: Folk Commonly Followed by Tweeting MPs of Different Parties

Earlier this year I doodled a recipe for comparing the folk commonly followed by users of a couple of BBC programme hashtags (Social Media Interest Maps of Newsnight and BBCQT Twitterers). Prompted in part by a tweet from Michael Smethurst/@fantasticlife about generating an ESP map for UK politicians (something I’ve also doodled before – Sketching

Read more »

Doodling With a Conversation, or Retweet, Data Sketch Around LAK12

May 2, 2012
By
Doodling With a Conversation, or Retweet, Data Sketch Around LAK12

How can we represent conversations between a small sample of users, such as the email or SMS converstations between James Murdoch’s political lobbiest and a Government minister’s special adviser (Leveson inquiry evidence), or the pattern of retweet activity around a couple of heavily retweeted individuals using a particular hashtag? I spent a bit of time

Read more »

Experimenting With iGraph – and a Hint Towards Ways of Measuring Engagement?

January 27, 2012
By
Experimenting With iGraph – and a Hint Towards Ways of Measuring Engagement?

For fear of being left way behind as Martin Hawksey starts to get to grips with R, (see for example how he’s using R to automate the annotation of Google Spreadsheets with calculations that don’t come readily or efficiently to hand in Google Spreadsheets itself), I thought I better try to get to grips with

Read more »

Rescuing Twapperkeeper Archives Before They Vanish, Redux

December 11, 2011
By
Rescuing Twapperkeeper Archives Before They Vanish, Redux

In Rescuing Twapperkeeper Archives Before They Vanish, I described a routine for grabbing Twapperkeeper archives, parsing them, and saving them to a local desktop file using the R programming language (downloading RStudio is the easiest way I know of getting R…). Following a post fron @briankelly (Responding to the Forthcoming Demise of TwapperKeeper), where Brian

Read more »

Getting Started With Twitter Analysis in R

November 9, 2011
By
Getting Started With Twitter Analysis in R

Earlier today, I saw a post vis the aggregating R-Bloggers service a post on Using Text Mining to Find Out What @RDataMining Tweets are About. The post provides a walktrhough of how to grab tweets into an R session using the twitteR library, and then do some text mining on it. I’ve been meaning to

Read more »

Google Spreadsheets API: Listing Individual Spreadsheet Sheets in R

September 7, 2011
By
Google Spreadsheets API: Listing Individual Spreadsheet Sheets in R

In Using Google Spreadsheets as a Database Source for R, I described a simple Google function for pulling data into R from a Google Visualization/Chart tools API query language query applied to a Google spreadsheet, given the spreadsheet key and worksheet ID. But how do you get a list of sheets in spreadsheet, without opening

Read more »

The Visual Difference – R and Anscombe’s Quartet

August 30, 2011
By
The Visual Difference – R and Anscombe’s Quartet

I spent a chunk of today trying to get my thoughts in order for a keynote presentation at next week’s The Difference that Makes a Difference conference. The theme of my talk will be on how visualisations can be used to discover structure and pattern in data, and as in many or my other recent

Read more »