Scraping Flora of North America

January 27, 2012
By

So Flora of North America is an awesome collection of taxonomic information for plants across the continent. However, the information within is not easily machine readable. So, a little web scraping is called for. rfna is an R package to collect inf...

Read more »

Parallel R Model Prediction Building and Analytics

January 26, 2012
By

Modifying R code to run in parallel can lead to huge performance gains. Although a significant amount of code can easily be run in parallel, there are some learning techniques, such as the Support Vector Machine, that cannot be easily parallelized. However, there is an often overlooked way to speed up these and other models. It...

Read more »

Text Mining and The Danish Immigration Debate

January 26, 2012
By
Text Mining and The Danish Immigration Debate

I have for a while now wanted to learn how to do text mining in R. I have played a bit around with the tm package before, but nothing really serious. Being a Danish expat that might soon move back to Denmark with a Swiss girlfriend and child, the Dani...

Read more »

Filtering a list with the Filter higher-order function

January 26, 2012
By

Last week markbulling over at Drunks & Lampposts posted a method of using sapply to filter a list by a predicate. Today the @RLangTip tip of the day was to use sapply similarly. This made makes me wonder if R‘s … Continue reading →

Read more »

Social Media Interest Maps of Newsnight and BBCQT Twitterers

January 26, 2012
By
Social Media Interest Maps of Newsnight and BBCQT Twitterers

I grabbed independent samples of 1500 recent users of the #newsnight and #bbcqt hashtags within a minute or two of each other about half an hour ago. Here’s who’s followed by 25 or more of the recent hashtaggers in each case. Can you distinguish the programmes each audience interest projection map relates to? Here’s the

Read more »

All I Don’t Know About Surveys

January 26, 2012
By
All I Don’t Know About Surveys

I believe my survey experiment Survey Time illustrated later in this post and in Google’s Visualization of Survey Responses mainly pointed out how ignorant I am about the very complicated science of surveying from building the survey to the visualiza...

Read more »

SAS Macro Simplifies SAS and R integration

January 26, 2012
By
SAS Macro Simplifies SAS and R integration

Many of us feel very enthusiastic about R. It's free, it features cutting edge applications, it has a large community of users contributing for mutual benefit, and on and on. There are also many things to like about SAS, including stability, backwards...

Read more »

Penny pinching yields pennies

January 26, 2012
By
Penny pinching yields pennies

A Greek Professor of Economics has written up a thought-provoking series of articles on complexity fetishism. As part of his …Continue reading »

Read more »

Parallel R Model Prediction Building and Analytics

January 26, 2012
By

Modifying R code to run in parallel can lead to huge performance gains. Although a significant amount of code can easily be run in parallel, there are some learning techniques, such as the Support Vector Machine, that cannot be easily parallelized. However, there is an often overlooked way to speed up these and other models. It involves executing the...

Read more »

A Short Example with R-Package osmar..

January 26, 2012
By
A Short Example with R-Package osmar..

Following up my last post in which I praised the capabilities of the osmar-package I give a short example...ps: You can also find this example at GitHub HERE.library(osmar) # this pulls the data from the OSM-Api:mydistrict <- get_osm(relation(85647)...

Read more »

Creative Commons license

January 26, 2012
By
Creative Commons license

Copyright notice: I hereby release all of the writing  content that is tagged by R, under the cc-by-sa copyrights (date: Jan 26 2012), as long as the copied content comes with proper attribution which also  includes a link to the source of the article .

Read more »

Exploring the functions in a package

January 26, 2012
By
Exploring the functions in a package

Sometimes it can be useful to list all the functions inside a package. This is done in the same way that you would list variables in your workspace. That is, using ls. The syntax is ls(pos = "package:packagename"), which is easy enough if you can remember it. Unfortunately, I never can, and have to type

Read more »

How to create the best Interactive R Language Online Learning Platform from the views of R community?

January 26, 2012
By

How to create the best Interactive R Language Online Learning Platform from the views of R community?: R offers a breadth and depth in statistical computing beyond what is available in commercial closed source products. Yet R remains, primarily, a ...

Read more »

Hottest 100 for 2011

January 26, 2012
By
Hottest 100 for 2011

Another year, another Australia Day. Another Australia Day, another Triple J Hottest 100. And that, of course, means an excellent excuse to  set R to work on the chart data. For those outside Australia, the Hottest 100 is a chart of the most popular songs of the previous year, as voted by the listeners of

Read more »

osmar – Don’t Miss this New R-Geo-Package!

January 26, 2012
By
osmar – Don’t Miss this New R-Geo-Package!

The osmar-package enables you to retrieve all geographic elements of OpenStreetMap via its API.I.e., you can retrieve a street, river, state-boundary or whatever and use this as a spatial object in R.It's overwhelming thinking of the endless playg...

Read more »

Paper on org-mode and reproducible research

January 26, 2012
By
Paper on org-mode and reproducible research

As I was talking recently about reproducible research, I have to post this. A new paper by Eric Schulte, Dan Davison, Thomas Dye, Carsten Dominik. If you haven’t heard about them, you haven’t been on the org-mode mailing list. They could be called the main contributors to org-mode and the part of org-mode called babel,

Read more »

Announcing the winners of the Applications of R in Business contest

January 25, 2012
By

It's with great pleasure that the team at Revolution Analytics announces the winners of the inaugural "Applications of R in Business" contest: Grand Prize ($10,000): Shannon Terry and Ben Ogorek, Nationwide Insurance for 'A Direct Marketing In-flight Forecasting System' Runner Up Prize ($5,000): Jeffrey Breen, Atmosphere Research Group for 'Mining Twitter for Airline Consumer Sentiment' Congratulations to both winners,...

Read more »

Random matrix theory and APT’s daily global model

January 25, 2012
By
Random matrix theory and APT’s daily global model

Someone emailed me recently and asked about how APT uses random matrix theory in their factor model. Another question I …Continue reading »

Read more »

ABC [PhD] course

January 25, 2012
By
ABC [PhD] course

As mentioned in the latest post on ABC, I am giving a short doctoral course on ABC methods and convergence at CREST next week. I have now made a preliminary collection of my slides (plus a few from Jean-Michel Marin’s), available on slideshare (as ABC in Roma, because I am also giving the course in

Read more »

ESPN Prediction Performance for the NFL

January 25, 2012
By
ESPN Prediction Performance for the NFL

Description:ESPN 'experts' predict the National Football League wins/losses each week.  The above chart shows the percentage of their correct guesses and an overall trend, week by week.Data:http://espn.go.com/nfl/picksAnalysis:The graph shows an i...

Read more »

RStudio v0.95 Released

January 25, 2012
By
RStudio v0.95 Released

The final version of RStudio v0.95 is now available for download from our website (thanks to everyone who put the preview release through its paces over the last couple of weeks!). Highlights of the new release include: Projects — A new system for managing R projects that enables easy switching between working directories and per-project contexts

Read more »

RStudio does not start when using a R version built from source

January 25, 2012
By

Because I do not have root privileges on some of the machines I use, I build part of my software from source and install it into my own homedirectory. Doing that requires one to download R from CRAN, e.g. R… See more ›

Read more »

A wordcloud comparison of the 2011 and 2012 #SOTU

January 24, 2012
By
A wordcloud comparison of the 2011 and 2012 #SOTU

I wrote a quick (and very dirty) R script for creating a comparison cloud and a commonality cloud for President Obama’s 2011 and 2012 State of the Union speeches*. The cloud on the left shows words that have different frequencies between the two spe...

Read more »

Project Euler in R: Problem 25

January 24, 2012
By

Solutions in R to Project Euler problems are generally hard to find, so I've recently started posting R solutions on this site.  Here are the previous problems: problem 22, problem 23 and problem 24Now let's...

Read more »

R Appears Among Top 20 Programming Languages

January 24, 2012
By
R Appears Among Top 20 Programming Languages

Speaking of R… On the 16th, TIOBE Software released the Tiobe Index of the most popular programming languages. For the first time ever, R is in the Top 20. The top spots are, no surprise, occupied by Java and C respectively. More after the jump. The way this index is assembled is…interesting. The full way

Read more »

Geocode your data using, R, JSON and Google Maps’ Geocoding APIs

January 24, 2012
By

Over the last year and half, I have faced numerous challenges with geocoding the data that I have used to showcase my passion for location analytics.  In 2012, I decided to take thing in my control and turned to R.  Here, I am sharing a simpl...

Read more »

BBBT Podcast with David Smith

January 24, 2012
By

Last Friday I was fortunate enough to have the opportunity to travel to Boulder, CO and present R and Revolution Analytics to about 20 analysts as part of the Boulder BI Brain Trust (BBBT). It was a great pleasure to describe the vision for Revolution R Enterprise and got some great feedback from the particpants. During the event, I...

Read more »

It is "simply" the average value

January 24, 2012
By
It is "simply" the average value

for some obscure reasons, simple things are usually supposed to be simple. Recently, on the internet, I saw a lot of posts on the "average time in which you hold a stock", and two rather different values are mentioned "Take any stock in the United ...

Read more »

CBC Reviews: Revolution R (in which this doesn’t go well)

January 24, 2012
By
CBC Reviews: Revolution R (in which this doesn’t go well)

Update!: The latest version of Revolution R, which added support for RHEL 6, appears to work (it appears to at least install, run, and perform basic tasks). See this post for more details. I’ve come to enjoy using R. I had dabbled with it in the past, but found it painfully opaque, and the Effort:Reward

Read more »