Posts Tagged ‘ Data ’

More Dabblings With Local Sentencing Data

December 1, 2011
By
More Dabblings With Local Sentencing Data

In Accessing and Visualising Sentencing Data for Local Courts I posted a couple of quick ways in to playing with Ministry of Justice sentencing data for the period July 2010-June 2011 at the local court level. At the end of the post, I wondered about how to wrangle the data in R so that I

Read more »

“Home Runs by Park – 2011 Season” or “Man the Astros Sucked This Year”

November 24, 2011
By
“Home Runs by Park – 2011 Season” or “Man the Astros Sucked This Year”

I hate the Giants. Let this be known. What i was hoping to find was another reason to support my claim that their WS win in 2010 was a complete fluke.  So when digging through the game logs for the … Continue reading →

Read more »

This One’s Personal: Sanford Koufax vs. Randy Johnson…pffft

November 15, 2011
By
This One’s Personal: Sanford Koufax vs. Randy Johnson…pffft

I couldn’t let this one go. The conclusion draw here by this author that Randy Johnson was “the best pitcher of all time” was not something I could allow to slip through the cracks. Johnson was awesome. Incredible to watch. … Continue reading →

Read more »

R 101: The Subset Function

November 9, 2011
By

The subset function is available in base R and can be used to return subsets of a vector, martix, or data frame which meet a particular condition. In my three years of using R, I have repeatedly used the subset() function and believe that it is the most useful tool for selecting elements of a

Read more »

How Might Data Journalists Show Their Working? Sweave

November 1, 2011
By
How Might Data Journalists Show Their Working? Sweave

If part of the role of data journalism is to make transparent the justification behind claims that are, or aren’t, backed up by data, there’s good reason to suppose that the journalists should be able to back up their own data-based claims with evidence about how they made use of the data. Posting links to

Read more »

Power Tools for Aspiring Data Journalists: R

October 31, 2011
By
Power Tools for Aspiring Data Journalists: R

Picking up on Paul Bradshaw’s post A quick exercise for aspiring data journalists which hints at how you can use Google Spreadsheets to grab – and explore – a mortality dataset highlighted by Ben Goldacre in DIY statistical analysis: experience the thrill of touching real data, I thought I’d describe a quick way of analysing

Read more »

Show me your WAR face!

October 24, 2011
By
Show me your WAR face!

Below is a chart of the top 20 offensive players based on FanGraphs WAR for the 2011 season.  The various features and their corresponding metric are clear in the image. I’ve also included the leader and last place for each … Continue reading →

Read more »

Shipping Mix

October 20, 2011
By
Shipping Mix

With a fresh pile of historical global shipping data, we came back to the flow visualizations that illustrated tangible supply lines that facilitate global trade.  This time we've isolated two types of shipping vessels, cargo and tanker, in order ...

Read more »

How does Matt kemp become Andre Dawson?

October 18, 2011
By
How does Matt kemp become Andre Dawson?

While reading this article over at Fangraphs I was inspired to ask myself “what would Matt Kemp have to do between now and then end of his career to be seriously considered for the Hall of Fame?”.  This question comes … Continue reading →

Read more »

R Tools for FEC Campaign Finance Disclosure Data

October 17, 2011
By
R Tools for FEC Campaign Finance Disclosure Data

For my first contribution to the blog, I wanted to make some kind of enlightening visualization of campaign finance disclosure data from the Federal Election Commission’s website. It looks like they’re working on some new, easy-to-use data dumps here, but … Continue reading →

Read more »