I used this idea posted on Stack Overflow to plug together a function for reading data from Google Docs spreadsheets into R. google_ss <- function(gid = NA, key = NA) { if (is.na(gid)) {stop("\nWorksheetnumber (gid) is missing\n")} if (is....

Oracle provides the Oracle R Distribution, an Oracle-supported distribution of open source R. Support for Oracle R Distribution is provided to customers of the Oracle Advanced Analytics option and the Oracle Big Data Appliance. The Oracle R Distribu...

We continue working our way through the examples, case studies, and exercises of what is affectionately known here as “the two bears book” (Swedish björn = bear) and more formally as Non-Life Insurance Pricing with Generalized Linear Models by Esbjörn Ohlsson and Börn Johansson (Amazon UK | US). At...

In case you missed them, here are some articles from February of particular interest to R users. February 29 marked the 12th anniversary of the release of R 1.0.0, and the release of R 2.14.2. A list of commercial vendors who have integrated R with their products for data, analysis, and presentation. The rmr package (part of the RHadoop...

We're starting to see a deluge of companies who businesses are all about making data analysis/science/insight "easy for the non-expert". We've been here before, quite a few times sadly. When I started writing software 12 years ago, there was...

With BerkeleyEarth 1.6 posted to CRAN I figured it was time to do some sample programs to explain how the package worked and integrated with other packages. Also, I have some issues to check out with the metadata; and in the long run I want to reformulate my metadata package to include some new resources.

(This article was first published on twotorials by anthony damico, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on his blog: twotorials by anthony damico. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL,...

Italy now has three local R user groups, thanks to the recent formation of MilanoR in the city of Milan. Founded by R consulting company Quantide's Andrea Spanò, R-core member and University of Milan professor Stefano Iacus, and quantitative consultant Daniele Amberti, MilanoR will be a forum to "exchange knowledge, learn and share tricks and techniques, and provide R...

If the goal of the Invisible Children campaign, which has received millions of dollars of contributions since the Kony 2012 video went viral, is to convince us that the money is being put to humanitarian efforts, they could do a lot better than this chart: Putting 37% of expenses into programs in Africa is a decent result -- many...

Breaking my dissertation and administrata induced silence for a small rant combining two of my favorite things – Apple Computer Inc, and simulation. Recently, the New York Times featured the article ‘Apple Confronts the Law of Large Numbers‘. The fundamental assertion? That the earnings growth and stock price of Apple cannot continue its rapid rise.

In the previous post on RcppEigen I described an example of sampling from collection of multinomial distributions represented by a matrix of probabilities. In the timing example the matrix was 100000 by 5 with each of the 100000 rows summing...

This is a very basic introduction to the ggplot2 package. A much more detailed description of the package can be found in this book ggplot2: Elegant Graphics for Data Analysis. On his website (http://had.co.nz/ggplot2/) package author Hadley Wickham describes ggplot2 asa plotting system for R, based on the grammar of graphics, which tries to take...

Here is the inaugural R-Podcast Screencast: Basic Interaction with R. This screencast contains audio from episode 3 of the R-Podcast. In this screencast I demonstrate how to create a vector of numerical data, calculating means, installing and loading packages, and getting help for a function. You can find the R code demonstrated in this episode

This will be probably be a final posting on interpolation of xyz data as I believe I have come to some conclusions to my original issues. I show three methods of xyz interpolation:1. The quick and dirty method of interpolating projected xyz points (bi-linear)2. Interpolation using Cartesian coordinates (bi-linear)3. Interpolation using spherical coordinates and...

A bit of testing of the estimation of the variance matrix for S&P 500 stocks in 2011. Previously There was a plot in “Realized efficient frontiers” showing the realized volatility in 2011 versus a prediction of volatility at the beginning of the year for a set of random portfolios. A reader commented to me privately … Continue reading...

The Manifesto Project (former Manifesto Research Group, Comparative Manifestos Project) has assembled a database of ‘quantitative content analyses of parties’ election programs from more than 50 countries covering all free, democratic elections since 1945′ and is freely accessible online. The … Continue reading →

In this episode: New versions of R and ggplot2 available, listener feedback, and an interactive session with R. The R code discussed in this episode will be available in our GitHub repository, see the show notes for details. There will be a companion screencast to accompany this episode which will be posted shortly. As always,

I recently had to prepare a beamer presentation in hindi/devanagari. I usually use emacs-orgmode with a lot of R source code embedded in it to prepare my beamer presentations. To adapt the entire setup to work with devanagari, this is what I needed to do. Make orgmode export to latex using xetex rather than

I was grading my Master projects this morning and came upon this graph: which compares the variability of an importance-sampling estimator versus its self-normalised alternative… This is an interesting case in that self-normalisation does considerably degrade the quality of the approximation in that setting. In other cases, self-normalisation may bring a clear improvement. (This reminded

Recently, I needed to figure out how many extension cords I was going to need to buy in order to reach parts of my field site. Wandering around in the field with a surveyor's tape was an option, but so was plotting distances on an aerial image I had of...

The following R function returns the distance between two points located on a plot. The distance returned is in the same units as that of the plot.interDist aa dx dy sqrt(sum(c(dx^2, dy^2)))}