5117 search results for "git"

CSI Stats: looking for traces of data fraud in R

December 6, 2013
By
CSI Stats: looking for traces of data fraud in R

Recently, I was looking at some published research, and I became concerned that it looked strange. Firstly, it had results wildly different to other similar studies, and more exciting / publishable ones. So, I was looking through the original paper … Continue reading →

Read more »

Le Monde puzzle [#843]

December 6, 2013
By
Le Monde puzzle [#843]

A Le Monde mathematical puzzle of moderate difficulty: How many binary quintuplets (a,b,c,d,e) can be found such that any pair of quintuplets differs by at least two digits? I solved it by the following R code that iteratively eliminates quintuplets that are not different enough from the first ones, for a random order of the

Read more »

On the growth of R and Python for data science

December 6, 2013
By
On the growth of R and Python for data science

A recent article by Matt Asay claims that "Python is displacing R as the language for data science". Python has certainly made some great strides in recent years, evolving beyond a data processing tool (an area where Python excels) to a data analysis tool. The Pandas project, in particular, has greatly expanded Python's ability to handle statistical data sets...

Read more »

Three Quick and Simple Data Cleaning Helper Functions (December 2013)

December 6, 2013
By

As I go about cleaning and merging data sets with R I often end up creating and using simple functions over and over. When this happens, I stick them in the DataCombine package. This makes it easier for me to remember how to do an operation and others can possibly benefit from simplified and (hopefully) more intuitive code....

Read more »

Using R to Analyze Yahoo Fantasy Football Data

December 6, 2013
By
Using R to Analyze Yahoo Fantasy Football Data

I recently created a gist that demonstrated how to authenticate with the Yahoo API, using the httr package. In this post, I will expand on this a little to downloading personal Yahoo fantasy football data and creating a graph showing my regular season ...

Read more »

New package: jsonlite. A smart(er) JSON encoder/decoder.

December 6, 2013
By
New package: jsonlite. A smart(er) JSON encoder/decoder.

This week we released a new package on CRAN: jsonlite. This package is a fork of RJSONIO by Duncan Temple Lang and builds on the same parser, but uses a different mapping between R objects and JSON data. The package vignette goes in great detail and has many examples on...

Read more »

New package: jsonlite. A smart(er) JSON encoder/decoder.

December 6, 2013
By
New package: jsonlite. A smart(er) JSON encoder/decoder.

This week we released a new package on CRAN: jsonlite. This package is a fork of RJSONIO by Duncan Temple Lang and builds on the same parser, but uses a different mapping between R objects and JSON data. The package vignette goes in great detail and has many examples on...

Read more »

Introduction to R and ggplot2 course

December 6, 2013
By

This course is about getting to grips with the basics of the R command line for basic data analysis and visualisation. There's a focus on maps and ggplot2. The first part of the course uses Rich Harris's "Short Introduction to R" (here). The second part works through an "Introduction to Spatial Data and ggplot2" (Cheshire and Lovelace, 2013), which can be found...

Read more »

Introduction to R and ggplot2 course

December 6, 2013
By

This course is about getting to grips with the basics of the R command line for basic data analysis and visualisation. There's a focus on maps and ggplot2. The first part of the course uses Rich Harris's "Short Introduction to R" (here). The second part works through an "Introduction to Spatial Data and ggplot2" (Cheshire and Lovelace, 2013), which can be found...

Read more »

Incidental Parameters Problem with Binary Response Data and Unobserved Individual Effects

December 5, 2013
By
Incidental Parameters Problem with Binary Response Data and Unobserved Individual Effects

It is a well known problem that in some models as the number of observations becomes large, econometric estimators fail to converge on consistent estimators.  The leading case of this is when estimating a binary response model with panel data with...

Read more »