Iraq-Wikileaks Analysis with R

[This article was first published on Fear and Loathing in Data Science, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

In a place of extreme violence and devoid of order, the practical subsumes the principle. I drifted down the path of bribery and corruption endemic to the streets of Baghdad”.

Jason Whiteley, Father of Money: Buying Peace in Baghdad

As I mentioned in a previous post, I wanted to explore the Wikileaks data of the US Military’s reported Significant Activities (SIGACTS).  It will be a subset of the famous Wikileaks classified US military documents.   Private Bradley Manning provided this material to Wikileaks.  He is now behind bars, receiving a 35-year sentence in 2013.  The subset of these documents I will use is available on The Guardian’s datablog website at this link:

The Guardian created this subset by selecting only those SIGACT reports that were associated with deaths of personnel and also that they felt did not compromise confidential sources.  It is stored in a Google Fusion Table.

The code provided merely scratches the surface of analysis that one can do with the data set of roughly 52,000 SIGACTs.  What I show is how to pull the data into R, conduct some basic data wrangling, create a subset, perform a cluster analysis and finally, build maps.  In creating the maps, I show how to create a static map with ggplotpackage as well as an interactive map with the leaflet package.

The subset of the data will focus on 2009 and the area assigned to Multi-National Division Baghdad since I spent 10 months of that year there and roughly 99% of the time in that Division’s Area of Responsibility.

The analysis with code and commentary is on at the following link:

To leave a comment for the author, please follow the link and comment on their blog: Fear and Loathing in Data Science. offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)