rOpenSci won 3rd place in the PLoS-Mendeley Binary Battle!

November 30, 2011
By
rOpenSci won 3rd place in the PLoS-Mendeley Binary Battle!

I am part of the rOpenSci development team (along with Carl Boettiger, Karthik Ram, and Nick Fabina).   Our website: http://ropensci.org/.  Code at Github: https://github.com/ropensciWe entered two of our R packages for integrating with ...

Read more »

rOpenSci won 3rd place in the PLoS-Mendeley Binary Battle!

November 30, 2011
By
rOpenSci won 3rd place in the PLoS-Mendeley Binary Battle!

I am part of the rOpenSci development team (along with Carl Boettiger, Karthik Ram, and Nick Fabina).   Our website: http://ropensci.org/.  Code at Github: https://github.com/ropensciWe entered two of our R packages for integrating with ...

Read more »

Free ggplot2 webinar from Hadley Wickham

November 30, 2011
By

The Orange County R Users Group is hosting a free webinar presented by Hadley Wickham, author of the ggplot2 graphics package for R. The webinar, "Advanced Visualizations in R with Hadley Wickham" is live from 6PM-7PM Pacific Time tomorrow, December 1. You can register at the LinkedIn event page below, as long as there are spaces left (it's limited...

Read more »

rOpenSci is a runner-up in the Mendeley Binary Battle!

November 30, 2011
By

We just got word that rOpenSci was a runner-up in the first Binary Battle!  Thank you for all the support so far! We entered two of our packages for integrating with PLoS Journals (rplos) and Mendeley (RMendeley) in the Mendeley-PLoS Binary Battle.  Get them at GitHub (rplos; RMendeley). These two packages allow users to search and retrieve

Read more »

GUI for sending email in R (using sendEmail)

November 30, 2011
By
GUI for sending email in R (using sendEmail)

After writing the last post on using sendEmail to send email from R I decided to create a simple GUI to enable this functionality. A snapshot image of the GUI is shown above. To use this GUI, you will need to install the following packages in R: gWidgets gWidgetsRGtk2 Windows GTK Bundle More information on

Read more »

Alpha decay in portfolios

November 30, 2011
By
Alpha decay in portfolios

How does the effect of our expected returns change over time?  This is not academic  curiosity, we want to know in the context of our portfolio if we can.  And we can — we visualize the effect of expected returns in situ. First step The idea is to look at the returns of portfolios that … Continue reading...

Read more »

Job Satisfaction in England – GGPlot #2

November 29, 2011
By
Job Satisfaction in England – GGPlot #2

I’ve recently been scouring the internet for a public opinion data set pertaining to job satisfaction. I was particularly interested in examining how gender, age, and socio-economic status influence how satisfied an individual is with their current employment situation. For example, existing research suggests that women and private-sector employees tend to have higher levels of

Read more »

The art of R programming

November 29, 2011
By
The art of R programming

This is a gem of a book. It will become the book I give PhD students when they are learning how to write good R code. That is, if I ever see it again. I had hoped to write a review of it, but I haven’t seen it since it arrived in the mail a

Read more »

Learning R as a language

November 29, 2011
By
Learning R as a language

Books written to teach a general purpose programming language are usually organized according to the features of the language and examples often show how a particular language feature is interpreted by a compiler. Books about domain specific languages are usually organized in a way that makes sense in the corresponding application domain and examples usually

Read more »

Ulam Spirals in R and ggplot

November 29, 2011
By
Ulam Spirals in R and ggplot

Having seen a twitter post speed by about Ulam Spirals I started to read up.  As the story goes in 1963 Stanislaw Ulam was bored at conference and started scribbling numbers in a spiral. What he discovered was a strange diaginal pattern of Prime Nu...

Read more »

Clearning up the sqldf confusion

November 29, 2011
By

Apparently I have issues with my reading comprehension and with Textmate (initially) when it comes to using the sqldf package. G. pointed out in the previous comments, I could have just used options(gsubfn.engine = "R") instead of going through the trouble of installing the tcltk binaries. If you’ve got a happy distribution of R that

Read more »

RcppArmadillo 0.2.31

November 29, 2011
By

Conrad Sanderson just released the second pre-release 2.3.92 of what will be Armadillo 2.4.*. This is now in RcppArmadillo release 0.2.31 which is already on CRAN as of this morning. The NEWS entries summarising the changes for both a...

Read more »

R still the preferred tool of predictive modelers competing at Kaggle

November 29, 2011
By
R still the preferred tool of predictive modelers competing at Kaggle

As reported on the Kaggle blog No Free Hunch, R remains the preferred tool for data scientists seeking to win the prizes in the predictive modeling competitions: More than 30% of Kaggle competitors report using R for their analysis, up from 22% a year ago. R's flexibility and the breadth of packages for machine learning and predictive modeling make...

Read more »

Relation Between Fires and Distanse to the Nearest Road (Recalculated)

November 29, 2011
By
Relation Between Fires and Distanse to the Nearest Road (Recalculated)

As you may already know, I'm a proud owner of AMD FX-8150 8-core CPU. And I've purchased it not for gaming reasons, but for science. My previous CPU was painfully slow with such calculations as determination of the relation between fires and distance t...

Read more »

Permanently Setting the CRAN repository

November 29, 2011
By

Setting the CRAN repository so that it does not ask every time you try to install a package  is something that I think few people bother to do, but it is so simple and can save a fair bit of frustration when working.  This is accomplished through a setting in one of the Rprofile files.  There

Read more »

Review of "The Art of R Programming" by Norman Matloff

November 29, 2011
By

By Joseph Rickert Anyone seeking to learn R faces two major challenges: (1) learning how to swim in the sea of information: R packages, books, websites, blog posts, message boards etc. that threatens to drown a newbie and (2) and coming to grips with the structure, syntax and features of the language itself. Having some idea of what one...

Read more »

Contributions to the R source

November 29, 2011
By

One of the nice things about tracking the R subversion repository using git instead of subversion is you can do git shortlog -s -n which gives you 19855 ripley 6302 maechler 5299 hornik 2263 pd 1153 murdoch 813 iacus 716 luke 6...

Read more »

Example 9.16: Small multiples

November 29, 2011
By
Example 9.16: Small multiples

Small multiples are one of the great ideas of graphics visionary Edward Tufte (e.g., in Envisioning Information). Briefly, the idea is that if many variations on a theme are presented, differences quickly become apparent. Today we offer general guida...

Read more »

Accessing and Visualising Sentencing Data for Local Courts

November 29, 2011
By
Accessing and Visualising Sentencing Data for Local Courts

A recent provisional data release from the Ministry of Justice contains sentencing data from English(?) courts, at the offence level, for the period July 2010-June 2011: “Published for the first time every sentence handed down at each court in the country between July 2010 and June 2011, along with the age and ethnicity of each

Read more »

outersect(): The opposite of R’s intersect() function

November 29, 2011
By
outersect(): The opposite of R’s intersect() function

The Objective To find the non-duplicated elements between two or more vectors (i.e. the ‘yellow sections of the diagram above) The Problem I needed the opposite of R’s intersect() function, an “outersect()“. The closest I found was setdiff() but the order of the input vectors produces different results, e.g. setdiff() produces all elements of the first

Read more »

A/B Testing in R – Part 1

November 29, 2011
By

A/B testing is a method for comparing the effectiveness of several different variations of a web page. For example, an online clothing retailer that specializes in mens’ streetwear may want to examine whether a black or pink background results in more purchases from visitors to the site. Lets say that our online store is just

Read more »

Trading Strategy Sensitivity Analysis

November 28, 2011
By
Trading Strategy Sensitivity Analysis

When designing a trading strategy, I want to make sure that small changes in the strategy parameters will not transform the profitable strategy into the loosing one. I will study the strategy robustness and profitability under different parameter scenarios using a sample strategy presented by David Varadi in the Improving Trend-Following Strategies With Counter-Trend Entries

Read more »

Dealing with R and HANA

November 28, 2011
By
Dealing with R and HANA

First things first...what's "R"? Simply put...is a programming language and software environment for statistical computing and graphics. More infomation can be found here R on WikipediaI have code in many programming languages, some of them very commer...

Read more »

How to speed up loops in R

November 28, 2011
By
How to speed up loops in R

As with any language, there are often several ways to code up the solution to a programming problem in R. If performance of the code is important (i.e. it's something you plan to run many times, or with a lot of data), how you code the solution can often have a big impact on how fast it runs. For...

Read more »

R’s Distrotheque

November 28, 2011
By
R’s Distrotheque

(Update: The csound package is now available on CRAN.) Do your random variables need to groove more? Of course they do. That's why I've been working on the upcoming csound package for R, which connects to Csound computer synthesis software to make any sound imaginable. Your computer'll be the hippest sample space on the randomized

Read more »

Retrieve GBIF Species Occurrence Data with Function from dismo Package

November 28, 2011
By
Retrieve GBIF Species Occurrence Data with Function from dismo Package

..The dismo package is awesome: with some short lines of code you can read & map species distribution data from GBIF (the global biodiversity information facility) easily:Read more »

Read more »

Course: Financial Data Modeling and Analysis in R

November 28, 2011
By

The University of Washington is holding a web-based course which will be of interest to anyone who wants to learn about financial modeling with R: Financial Data Modeling and Analysis in R (AMATH 542) is a comprehensive introduction to the R statistical programming language for computational finance offered by the University of Washington Computational Finance program and taught by...

Read more »

Where the Worlds of Dentistry and Cartography Collide

November 28, 2011
By
Where the Worlds of Dentistry and Cartography Collide

As I was getting a root canal last week, my dental X-Rays reminded me anew of an optical illusion that stumped us for a short time recently when we were developing our heatmapping engine.My X-Rays, before during and after a recent root canal.  The...

Read more »

Predicting Gender

November 28, 2011
By
Predicting Gender

If there are two (can be generalized to n) classes and both follow the same distribution (but with different parameters) it is possible to predict which class an observations comes from. Here I’ll try to predict a sample’s gender based on their height. The distribution of a person’s height is more or less normal. There

Read more »