## Scraping Flora of North America

January 27, 2012
By

So Flora of North America is an awesome collection of taxonomic information for plants across the continent. However, the information within is not easily machine readable. So, a little web scraping is called for. rfna is an R package to collect inf...

## Parallel R Model Prediction Building and Analytics

January 26, 2012
By

Modifying R code to run in parallel can lead to huge performance gains. Although a significant amount of code can easily be run in parallel, there are some learning techniques, such as the Support Vector Machine, that cannot be easily parallelized. However, there is an often overlooked way to speed up these and other models. It...

## Text Mining and The Danish Immigration Debate

January 26, 2012
By

I have for a while now wanted to learn how to do text mining in R. I have played a bit around with the tm package before, but nothing really serious. Being a Danish expat that might soon move back to Denmark with a Swiss girlfriend and child, the Dani...

## Filtering a list with the Filter higher-order function

January 26, 2012
By

Last week markbulling over at Drunks & Lampposts posted a method of using sapply to filter a list by a predicate. Today the @RLangTip tip of the day was to use sapply similarly. This made makes me wonder if R‘s … Continue reading →

## Social Media Interest Maps of Newsnight and BBCQT Twitterers

January 26, 2012
By

I grabbed independent samples of 1500 recent users of the #newsnight and #bbcqt hashtags within a minute or two of each other about half an hour ago. Here’s who’s followed by 25 or more of the recent hashtaggers in each case. Can you distinguish the programmes each audience interest projection map relates to? Here’s the

## All I Don’t Know About Surveys

January 26, 2012
By

I believe my survey experiment Survey Time illustrated later in this post and in Google’s Visualization of Survey Responses mainly pointed out how ignorant I am about the very complicated science of surveying from building the survey to the visualiza...

## SAS Macro Simplifies SAS and R integration

January 26, 2012
By

Many of us feel very enthusiastic about R. It's free, it features cutting edge applications, it has a large community of users contributing for mutual benefit, and on and on. There are also many things to like about SAS, including stability, backwards...

## Penny pinching yields pennies

January 26, 2012
By

A Greek Professor of Economics has written up a thought-provoking series of articles on complexity fetishism. As part of his …Continue reading »

## Parallel R Model Prediction Building and Analytics

January 26, 2012
By

Modifying R code to run in parallel can lead to huge performance gains. Although a significant amount of code can easily be run in parallel, there are some learning techniques, such as the Support Vector Machine, that cannot be easily parallelized. However, there is an often overlooked way to speed up these and other models. It involves executing the...

## A Short Example with R-Package osmar..

January 26, 2012
By

Following up my last post in which I praised the capabilities of the osmar-package I give a short example...ps: You can also find this example at GitHub HERE.library(osmar) # this pulls the data from the OSM-Api:mydistrict <- get_osm(relation(85647)...

January 26, 2012
By

Copyright notice: I hereby release all of the writing  content that is tagged by R, under the cc-by-sa copyrights (date: Jan 26 2012), as long as the copied content comes with proper attribution which also  includes a link to the source of the article .

## Exploring the functions in a package

January 26, 2012
By

Sometimes it can be useful to list all the functions inside a package. This is done in the same way that you would list variables in your workspace. That is, using ls. The syntax is ls(pos = "package:packagename"), which is easy enough if you can remember it. Unfortunately, I never can, and have to type

## How to create the best Interactive R Language Online Learning Platform from the views of R community?

January 26, 2012
By

How to create the best Interactive R Language Online Learning Platform from the views of R community?: R offers a breadth and depth in statistical computing beyond what is available in commercial closed source products. Yet R remains, primarily, a ...

## Hottest 100 for 2011

January 26, 2012
By

Another year, another Australia Day. Another Australia Day, another Triple J Hottest 100. And that, of course, means an excellent excuse to  set R to work on the chart data. For those outside Australia, the Hottest 100 is a chart of the most popular songs of the previous year, as voted by the listeners of

## osmar – Don’t Miss this New R-Geo-Package!

January 26, 2012
By

The osmar-package enables you to retrieve all geographic elements of OpenStreetMap via its API.I.e., you can retrieve a street, river, state-boundary or whatever and use this as a spatial object in R.It's overwhelming thinking of the endless playg...

## Paper on org-mode and reproducible research

January 26, 2012
By

As I was talking recently about reproducible research, I have to post this. A new paper by Eric Schulte, Dan Davison, Thomas Dye, Carsten Dominik. If you haven’t heard about them, you haven’t been on the org-mode mailing list. They could be called the main contributors to org-mode and the part of org-mode called babel,

## Announcing the winners of the Applications of R in Business contest

January 25, 2012
By

It's with great pleasure that the team at Revolution Analytics announces the winners of the inaugural "Applications of R in Business" contest: Grand Prize ($10,000): Shannon Terry and Ben Ogorek, Nationwide Insurance for 'A Direct Marketing In-flight Forecasting System' Runner Up Prize ($5,000): Jeffrey Breen, Atmosphere Research Group for 'Mining Twitter for Airline Consumer Sentiment' Congratulations to both winners,...

## Random matrix theory and APT’s daily global model

January 25, 2012
By

Someone emailed me recently and asked about how APT uses random matrix theory in their factor model. Another question I …Continue reading »

## ABC [PhD] course

January 25, 2012
By

As mentioned in the latest post on ABC, I am giving a short doctoral course on ABC methods and convergence at CREST next week. I have now made a preliminary collection of my slides (plus a few from Jean-Michel Marin’s), available on slideshare (as ABC in Roma, because I am also giving the course in

## ESPN Prediction Performance for the NFL

January 25, 2012
By

Description:ESPN 'experts' predict the National Football League wins/losses each week.  The above chart shows the percentage of their correct guesses and an overall trend, week by week.Data:http://espn.go.com/nfl/picksAnalysis:The graph shows an i...

## RStudio v0.95 Released

January 25, 2012
By

The final version of RStudio v0.95 is now available for download from our website (thanks to everyone who put the preview release through its paces over the last couple of weeks!). Highlights of the new release include: Projects — A new system for managing R projects that enables easy switching between working directories and per-project contexts

## RStudio does not start when using a R version built from source

January 25, 2012
By

Because I do not have root privileges on some of the machines I use, I build part of my software from source and install it into my own homedirectory. Doing that requires one to download R from CRAN, e.g. R… See more ›

## A wordcloud comparison of the 2011 and 2012 #SOTU

January 24, 2012
By

I wrote a quick (and very dirty) R script for creating a comparison cloud and a commonality cloud for President Obama’s 2011 and 2012 State of the Union speeches*. The cloud on the left shows words that have different frequencies between the two spe...

## Project Euler in R: Problem 25

January 24, 2012
By

Solutions in R to Project Euler problems are generally hard to find, so I've recently started posting R solutions on this site.  Here are the previous problems: problem 22, problem 23 and problem 24Now let's...

## R Appears Among Top 20 Programming Languages

January 24, 2012
By

Speaking of R… On the 16th, TIOBE Software released the Tiobe Index of the most popular programming languages. For the first time ever, R is in the Top 20. The top spots are, no surprise, occupied by Java and C respectively. More after the jump. The way this index is assembled is…interesting. The full way

## Geocode your data using, R, JSON and Google Maps’ Geocoding APIs

January 24, 2012
By

Over the last year and half, I have faced numerous challenges with geocoding the data that I have used to showcase my passion for location analytics.  In 2012, I decided to take thing in my control and turned to R.  Here, I am sharing a simpl...

## BBBT Podcast with David Smith

January 24, 2012
By

Last Friday I was fortunate enough to have the opportunity to travel to Boulder, CO and present R and Revolution Analytics to about 20 analysts as part of the Boulder BI Brain Trust (BBBT). It was a great pleasure to describe the vision for Revolution R Enterprise and got some great feedback from the particpants. During the event, I...

## It is "simply" the average value

January 24, 2012
By

for some obscure reasons, simple things are usually supposed to be simple. Recently, on the internet, I saw a lot of posts on the "average time in which you hold a stock", and two rather different values are mentioned "Take any stock in the United ...