2256 search results for "map"

Please stop using Excel-like formats to exchange data

December 7, 2012
By
Please stop using Excel-like formats to exchange data

I know “officially” data scientists all always work in “big data” environments with data in a remote database, streaming store or key-value system. But in day to day work Excel files and Excel export files get used a lot and cause a disproportionate amount of pain. I would like to make a plea to my Related posts:

Read more »

UEFA Champions League Knockout Phase Draws: Monte Carlo Simulation with R

December 7, 2012
By

Draws for the knockout phase of the 2012–13 UEFA Champions League will be held in Nyon on the 20th December 2012. The rules of the draw are simple and are as follows:8 Group winner teams will be seeded.8 Group runner-up teams will be unseeded.Teams coming from the same group and from same association...

Read more »

Visualizing Baltimore with R and ggplot2: Crime Data

December 7, 2012
By
Visualizing Baltimore with R and ggplot2: Crime Data

The advent of municipal open data initiatives has been both a blessing and curse for my particular brand of data nerd. On one hand, it has opened up the possibility of developing deep and useful knowledge about the places we...

Read more »

How to spend an inordinate amount of time becoming efficient

December 6, 2012
By
How to spend an inordinate amount of time becoming efficient

I’ve spent a good deal of 2012 constructing a data warehouse to manage all the various data elements that my company has. Although we’re a small enterprise, the richness and complexity of the information is rather high. Moreover, as a data-driven organization, there’s a strong impetus to construct meaningful analysis with every bit of input

Read more »

R in the Cloud

December 6, 2012
By
R in the Cloud

I've been having some great fun parallelizing R code on Amazon's cloud. Now that things are chugging away nicely, it's time to document my foibles so I can remember not to fall into the same pits of despair again. The goal was to perform lots of trails of a randomized statistical simulation. The jobs were independent and fairly chunky, taking...

Read more »

Importing Data Into R from Different Sources

December 6, 2012
By

I have found that I get data from many different sources.  These sources range from simple .csv files to more complex relational databases, to structure XML or JSON files.  I have compiled the different approaches that one can use to easily access these datasets. Local Column Delimited Files This is probably the most common and

Read more »

ggplot2 0.9.3 and plyr 1.8 have been released!

December 6, 2012
By
ggplot2 0.9.3 and plyr 1.8 have been released!

We’re pleased to announce new versions of ggplot2 (0.9.3) and plyr (1.8).  To get up and running with the new versions, start a clean R session without ggplot2 or plyr loaded, and run install.packages(c("ggplot2", "gtable", "scales", "plyr")). Read on to find out what’s new. ggplot2 0.9.3 Most of the changes version 0.9.3 are bug fixes. Perhaps

Read more »

How a Mexican state ended up with more drug war homicides than total homicides

December 5, 2012
By
How a Mexican state ended up with more drug war homicides than total homicides

During 2007 and 2008 the Mexican state of Sinaloa had more drug war-related homicides than total homicides. This should in theory be impossible since drug war homicides are a subset of total homicides. How did this happen? Here is a chart from my old post highlighting the monthly difference between the...

Read more »

Modis QC Bits

December 5, 2012
By
Modis QC Bits

In the course of working through my MODIS  LST project and reviewing the steps that Imhoff and Zhang took as well has the data preparations other researchers have taken ( Neteler ) the issue of MODIS Quality control bits came up.  Every MODIS  HDF file comes with multiple SDS or multiple layers of data. For

Read more »

Shiny Server – Earthshattering News

December 4, 2012
By
Shiny Server – Earthshattering News

As you probably know, I am one of the strongest proponents of the Shiny package for developing interactive web applications Amongst the latest news from RStudio is that what was planned to be commercial software will now be free and Open Source (AGPLv3 license) To celebrate this momentous announcement, I have produced an Earthquake app.

Read more »