Posts Tagged ‘ Data ’

Amateur Mapmaking: Getting Started With Shapefiles

January 13, 2012
By
Amateur Mapmaking: Getting Started With Shapefiles

One of the great things about (software) code is that people build on it and out from it… Which means that as well as producing ever more complex bits of software, tools also get produced over time that make it easier to do things that were once hard to do, or required expensive commercial...

Read more »

Over on F1DataJunkie, 2011 Season Review Doodles…

December 30, 2011
By
Over on F1DataJunkie, 2011 Season Review Doodles…

Things have been a little quiet, post wise here, of late, in part because of the holiday season… but I have been posting notes on a couple of charts in progress over on the F1DataJunkie blog. Here are links to the posts in chronological order – they capture the evolution of the chart design(s)...

Read more »

New Powerball (lottery) Rules Will Cost You More

December 16, 2011
By

The popular news are reporting that the Multi-State Lottery Commission (MUSL) will change the rules for their lottery game Powerball, effective Jan. 15, 2012. I sent an email to the MUSL (at 8:00am Dec, 14th) asking for the new official rules, but haven't received a response yet (as of 10:30am Dec, 16th). Hence,...

Read more »

Visualization of Prosper.com’s Loan Data Part I of II – Compare and Contrast with Lending Club

December 6, 2011
By
Visualization of Prosper.com’s Loan Data Part I of II – Compare and Contrast with Lending Club

Due to the positive feedback received on this post I thought I would re-create the analysis on another peer-to-peer lending dataset, courtesy of Prosper.com. You can access the Prosper Marketplace data via an API or by simply downloading XML files that are updated nightly http://www.prosper.com/tools/. If you are going to follow the route I...

Read more »

More Dabblings With Local Sentencing Data

December 1, 2011
By
More Dabblings With Local Sentencing Data

In Accessing and Visualising Sentencing Data for Local Courts I posted a couple of quick ways in to playing with Ministry of Justice sentencing data for the period July 2010-June 2011 at the local court level. At the end of the post, I wondered about how to wrangle the data in R so that...

Read more »

“Home Runs by Park – 2011 Season” or “Man the Astros Sucked This Year”

November 24, 2011
By
“Home Runs by Park – 2011 Season” or “Man the Astros Sucked This Year”

I hate the Giants. Let this be known. What i was hoping to find was another reason to support my claim that their WS win in 2010 was a complete fluke.  So when digging through the game logs for the … Continue reading

Read more »

This One’s Personal: Sanford Koufax vs. Randy Johnson…pffft

November 15, 2011
By
This One’s Personal: Sanford Koufax vs. Randy Johnson…pffft

I couldn’t let this one go. The conclusion draw here by this author that Randy Johnson was “the best pitcher of all time” was not something I could allow to slip through the cracks. Johnson was awesome. Incredible to watch. … Continue reading

Read more »

R 101: The Subset Function

November 9, 2011
By

The subset function is available in base R and can be used to return subsets of a vector, martix, or data frame which meet a particular condition. In my three years of using R, I have repeatedly used the subset() function and believe that it is the most useful tool for selecting elements of...

Read more »

How Might Data Journalists Show Their Working? Sweave

November 1, 2011
By
How Might Data Journalists Show Their Working? Sweave

If part of the role of data journalism is to make transparent the justification behind claims that are, or aren’t, backed up by data, there’s good reason to suppose that the journalists should be able to back up their own data-based claims with evidence about how they made use of the data. Posting links...

Read more »

Power Tools for Aspiring Data Journalists: R

October 31, 2011
By
Power Tools for Aspiring Data Journalists: R

Picking up on Paul Bradshaw’s post A quick exercise for aspiring data journalists which hints at how you can use Google Spreadsheets to grab – and explore – a mortality dataset highlighted by Ben Goldacre in DIY statistical analysis: experience the thrill of touching real data, I thought I’d describe a quick way of...

Read more »

Sponsors