Posts Tagged ‘ Data ’

Function to Generate a Random Data Set

May 2, 2012
By
Function to Generate a Random Data Set

Often I find myself needing data sets to try functions and code out on or for teaching purposes.  I have a few stand-bys such as the mtcars and CO2 data sets in the base packages of R but sometimes I … Continue reading →

Read more »

Visualization of Reading Level Frequency by Congressional Bill Stage

April 15, 2012
By
Visualization of Reading Level Frequency by Congressional Bill Stage

  Here’s a fun example of how you might use my data on Congressional bill length and complexity.  Imagine you want to understand the empirical distribution of Flesch-Kincaid reading level for Congressional bills and how this distribution is related to … Continue reading →

Read more »

An unabashedly narcissistic data analysis of my own tweets. The…

April 2, 2012
By
An unabashedly narcissistic data analysis of my own tweets.
The…

pie( table( whence.i.tweet )) qplot( whence ) + coord_polar() pie( log( table( whence )))+RColorBrewer ggplot (see below) plot( density( tweets.len )) qplot(... stat="density") + geom_density qplot(...stat="bin") + geom_text(...) tweeple tweep...

Read more »

Disproportionality Data

March 25, 2012
By
Disproportionality Data

So I was hunting around for some data on disproportional electoral outcomes (when the proportion of voters cast for political parties is not close to the proportion of legislative seats that they win).Michael Gallagher keeps an updated version of his L...

Read more »

A plot of my citations in Google Scholar vs. Web of Science

March 8, 2012
By
A plot of my citations in Google Scholar vs. Web of Science

There has been some discussion about whether Google Scholar or one of the proprietary software companies numbers are better for citation counts. I personally think Google Scholar is better for a number of reasons: Higher numbers, but consistently/a...

Read more »

Download and Parse NAREIT Data

March 1, 2012
By
Download and Parse NAREIT Data

This is the first post of a series that describes how to download and parse specific data sets into R. These kinds of scripts can be functionalized further, but I doubt that these will ever find their way into a formal package. They are intended to be helpful to those facing similar tasks, but as

Read more »

Statistics on the length and linguistic complexity of bills

February 13, 2012
By
Statistics on the length and linguistic complexity of bills

  Where would you go to find out what the longest bill of the 112th Congress was by number of sections (H. R. 1473)?  How about by number of unique words (H.R. 3671)?  What about by Flesh-Kincaid reading level  (S. … Continue reading →

Read more »

Amateur Mapmaking: Getting Started With Shapefiles

January 13, 2012
By
Amateur Mapmaking: Getting Started With Shapefiles

One of the great things about (software) code is that people build on it and out from it… Which means that as well as producing ever more complex bits of software, tools also get produced over time that make it easier to do things that were once hard to do, or required expensive commercial software

Read more »

Over on F1DataJunkie, 2011 Season Review Doodles…

December 30, 2011
By
Over on F1DataJunkie, 2011 Season Review Doodles…

Things have been a little quiet, post wise here, of late, in part because of the holiday season… but I have been posting notes on a couple of charts in progress over on the F1DataJunkie blog. Here are links to the posts in chronological order – they capture the evolution of the chart design(s) to

Read more »

New Powerball (lottery) Rules Will Cost You More

December 16, 2011
By

The popular news are reporting that the Multi-State Lottery Commission (MUSL) will change the rules for their lottery game Powerball, effective Jan. 15, 2012. I sent an email to the MUSL (at 8:00am Dec, 14th) asking for the new official rules, but haven't received a response yet (as of 10:30am Dec, 16th). Hence, these

Read more »