Posts Tagged ‘ Data ’

Building a fact-based world view

January 7, 2011
By
Building a fact-based world view

Gapminder is an independent foundation based in Stockholm, Sweden. Its mission is “to debunk devastating myths about the world by offering free access to a fact-based world view“. They provide free online tools, data (more than 400 datasets freely available!) and videos “to better understand the changing world“. The initial development of Gapminder was the

Read more »

A Very Data Christmas

December 21, 2010
By
A Very Data Christmas

This week Google announced its Ngram Viewer, which allows you to explore the use of words in thousands of texts overtime, going back two hundred years. Given the relatively long time period covered by this massive data set, it is fun to explore how language has changed overtime. Some texts, however, seem to transcend time.

Read more »

Programming with R – Processing Football League Data Part II

December 3, 2010
By

Following on from the previous post about creating a football result processing function for data from the football-data.co.uk website we will add code to the function to generate a league table based on the results to date. To create the league table we need to count various things such as the number of games played, number

Read more »

Programming with R – Processing Football League Data Part I

November 23, 2010
By

In this post we will make use of football results data from the football-data.co.uk website to demonstrate creating functions in R to automate a series of standard operations that would be required for results data from various leagues and divisions. The first step is to consider what control options should be available as part of the

Read more »

My First R Package: infochimps

November 20, 2010
By

I have finally taken the plunge and created my first R package! As frequent readers will know, I often sing the praises of infochimps, a startup out of Austin, TX attempting to be the world’s data clearinghouse. While infochimps is an excellent resource for data sets, they also provide their own set excellent data

Read more »

Co-authorship Network of SSRN Conflict Studies eJournal

November 10, 2010
By

As part of my on-going research simulating network structure using graph motifs I have been collecting novel data sets to test and benchmark the method. Since I am a political scientist studying conflict, it was suggested to me to collect a co-authorship network within this sub-discipline. Such a network is useful for several reasons;

Read more »

Where People Share Links About NYC

October 27, 2010
By
Where People Share Links About NYC

Last week I participated in bit.ly’s fourth hackabit hack-a-thon, which is a wonderful opportunity for NYC area hackers to get together, eat pizza, drink energy drinks, and stay up late hacking with some of the best data geeks around. I was lucky enough to saddle up next to Hilary Mason, bit.ly’s lead scientist, recently named

Read more »

Benford’s Law Tests for Wikileaks Data

August 1, 2010
By
Benford’s Law Tests for Wikileaks Data

In my first post on the WL Afghanistan data I provided a very high-level view of the data, and found that it generally met expectations for frequency given its context and presumed data generating process. Next, I will look a bit deeper at this process and test if the observed frequencies of reports have properties

Read more »

Anatomy of a Life-Milestone Announcement on Facebook

July 15, 2010
By
Anatomy of a Life-Milestone Announcement on Facebook

As I have mentioned, I recently returned for a lovely trip to Europe. While on vacation my brilliant, beautiful, funny, and all around perfect girlfriend accepted my invitation to be my wife. Pause for shared overwhelming feeling of joy… While I am still basking in the glow of being the luckiest man on Earth, as

Read more »

Europe Data set:> eurodist                 Athens Barcelona…

July 3, 2010
By
Europe
Data set:> eurodist                 Athens Barcelona…

Europe Data set:> eurodist                 Athens Barcelona Brussels Calais Cherbourg Cologne CopenhagenBarcelona         3313                                                   

Read more »

Sponsors

Mango solutions



plotly webpage

dominolab webpage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)