13 videos for learning R

May 5, 2010
By

Jeromy Anglim has just posted a nice round-up of instructional videos for learning R. The videos are categorized into four levels: What is R? Introductory R? Intermediate and Advanced Materials on R and Related Resources (including a link to written resources for getting started with R). Check out all the links at Jeromy's post, linked below. Jeromy Anglim's blog:...

Read more »

Fun with R: Clustering and MDS

May 5, 2010
By
Fun with R: Clustering and MDS

I've seen K-means clustering, PCA, etc. done some over at Beyond the Boxscore and Baseball Analysts (and the now defunct Statspeak), but I thought I'd just check out some clustering on the young fantasy season using the traditional 5x5 categories with ...

Read more »

Videos on Data Analysis with R: Introductory, Intermediate, and Advanced Resources

May 4, 2010
By

If you want to learn about R through videos, there are now a large number of options. This post provides links to many of these video under the headings of: (a) What is R? (b) Introductory R, and (c) Intermediate and Advanced R. What is R?If you are...

Read more »

Videos on Data Analysis with R: Introductory, Intermediate, and Advanced Resources

May 4, 2010
By
Videos on Data Analysis with R: Introductory, Intermediate, and Advanced Resources

If you want to learn about R through videos, there are now a large number of options.This post provides links to many of these video under the headings of:(a) What is R?(b) Introductory R, and(c) Intermediate and Advanced R.What is R?If you are evaluat...

Read more »

You won, but how much was luck and how much was skill?

May 4, 2010
By
You won, but how much was luck and how much was skill?

In baseball, what are the chances the winner will win again against the same opponent the very next day?

Read more »

Modifying basic plots in R

May 4, 2010
By

Below is a walk-through of some of the basics of customizing plot in R. These are all based on the graphics package that comes in the base installation of R. Let’s start by making a basic plot in R. In the code snippets below, green text behind a...

Read more »

Difficulty with mcsm?

May 4, 2010
By
Difficulty with mcsm?

An email from Keith I got this morning: Professor Robert, I have loaded the mcsm package to windows. The following messages appear in the R console: trying URL 'http://cran.stat.ucla.edu/bin/windows/contrib/2.9/mcsm_1.0.zip' Content type 'application/zip' length 193590 bytes (189 Kb) opened URL downloaded 189 Kb package 'mcsm' successfully unpacked and MD5 sums checked But when I use the

Read more »

Compcache on Ubuntu on Amazon EC2

May 4, 2010
By
Compcache on Ubuntu on Amazon EC2

The following fully-automatic Bash script downloads, compiles, and initializes compcache version 0.6.2 on Ubuntu Karmic Koala (9.10). This script creates two swaps with a maximum of 4GB uncompressed size each. Two swaps are used to take advantage of 2 CPUs (or CPU cores in a multicore CPU). Compcache is a fascinating memory compression system. The

Read more »

R: directing output to file on the fly, output flushing

May 4, 2010
By

To start sending all output to a file, do this: sink("path/to/filename") # Direct all output to file print("Hi there") # Will be printed to file sink() # Turn off buffing to file Related to this I recently had to use: flush.console() This forces your console to print out any buffered content. Doing this will cost time, but if you are running

Read more »

Where do you want an R User Group?

May 4, 2010
By

Wishing there was a local R User's group in your area, but can't find one? New user groups are springing up all the time, but you can kickstart the process by registering interest for a group in your area at meetup.com. As you can see from the map below, there's interest in many places -- your vote could prompt...

Read more »

Developing a user-friendly regular expression function (easyGregexpr)

May 4, 2010
By

In the past few months, I've developed a set of functions for automating model estimation and interpretation using Mplus, an outstanding latent variable modeling program that has unparalleled flexibility for complex models (e.g., factor mixture models). I recently rolled these functions into an R package called MplusAutomation. Because the package focuses on extracting various parameters from text...

Read more »

Developing a user-friendly regular expression function (easyGregexpr)

May 4, 2010
By

In the past few months, I've developed a set of functions for automating model estimation and interpretation using Mplus, an outstanding latent variable modeling program that has unparalleled flexibility for complex models (e.g., factor mixture models). I recently rolled these functions into an R package called MplusAutomation. Because the package focuses on extracting various parameters from text...

Read more »

Virtual Conference: R the Language

May 3, 2010
By
Virtual Conference: R the Language

On Tuesday May 4th at 9:30 PM central, 10:30 eastern, I’ll be giving a live online presentation as part of the Vconf.org open conference series. I’ll be speaking about R and why I started using R a couple years ago. This is NOT going to be a technical presentation but rather an illustration of how

Read more »

Building Scoring and Ranking Systems in R

May 3, 2010
By
Building Scoring and Ranking Systems in R

This guest article was written by author and consultant Tristan Yates (see his bio below). It emphasizes R’s data object manipulation and scoring capabilities via a detailed financial analysis example. Scoring and ranking systems are extremely valuable management tools. They can be used to predict the future, make decisions, and improve behavior – sometimes all The post Building...

Read more »

Building Scoring and Ranking Systems in R

May 3, 2010
By

This guest article was written by author and consultant Tristan Yates (see his bio below). It emphasizes R's data object manipulation and scoring capabilities via a detailed financial analysis example. Scoring and ranking systems are extremely valuabl...

Read more »

Introduction to R, live virtual talk from JD Long tomorrow

May 3, 2010
By

JD Long (of Cerebral Mastication fame) will be giving a live "virtual talk" introducing R: "What it's best at and how you can use it to help you massage data". The talk will be at 7:30PM Pacific time tomorrow, Tuesday May 4. You can register at the link below. vconf.org: Presentation: R The Language

Read more »

First annual R plot replication prize

May 3, 2010
By
First annual R plot replication prize

$100 to the first person who can figure out how I created this plot and replicate it. Some hints: It was done in R. There is only one underlying probability distribution involved (one “rdist()“). Including the “plot” statement, I created this with 3 short lines of code. This is based on a random sampling of unstated size, so

Read more »

Example 7.35: Propensity score matching

May 3, 2010
By
Example 7.35: Propensity score matching

As discussed in example 7.34, it's sometimes preferable to match on propensity scores, rather than adjust for them as a covariate.SASWe use a suite of macros written by Jon Kosanke and Erik Bergstralh at the Mayo Clinic. The dist macro calculates the ...

Read more »

Displaying data using level plots

May 3, 2010
By
Displaying data using level plots

A level plot is a type of graph that is used to display a surface in two rather than three dimensions – the surface is viewed from above as if we were looking straight down and is an alternative to a contour plot – geographic data is an example of where this type of graph

Read more »

The new GUI for ggplot2 (using Deducer) – the designer wants your opinion

May 1, 2010
By

After discovering that R is expected (this summer) to have a GUI for ggplot2 (through deducer), I later found Ian’s gsoc proposal for this GUI.  Since the system is in it’s early stages of development, Ian has invited people to give comments, input and critique on his plans for the project. For your convenience (and with Ian’s permission), I...

Read more »

Quick and dirty parallel processing in R

April 30, 2010
By
Quick and dirty parallel processing in R

R has some powerful tools for parallel processing, which I discovered while searching for ways to fully utilize my 8-core computer at work. What surprised me is how easy it is…about 6 lines of code, if that. Given that I wasn’t allowed to install heavy duty parallel-processing systems like MPICH on the computer, I found

Read more »

How many girls, how many boys?

April 30, 2010
By
How many girls, how many boys?

I found this interesting question over here at mathoverflow.net. Here’s the question: If you have a country where every family will continue to have children until they get a boy, then they will stop. What is the proportion of boys to girls in the country. First off, there are some assumptions you need to make that aren’t

Read more »

Which font uses the most ink?

April 30, 2010
By
Which font uses the most ink?

If you're being particularly cost-conscious about your use of printer ink or toner, you may be wondering which font you should choose to minimize ink use. Here's an infographic with the answer: This is an interesting infographic in its own right, but what makes it cool is that these are not photoshopped images of Bic biros. Matt Robinson created...

Read more »

Data Manipulation with R – Spector (2008)

April 30, 2010
By
Data Manipulation with R – Spector (2008)

Title: Data Manipulation with RAuthor(s): Phil SpectorPublisher/Date: Springer/2008Statistics level: N/A Programming level: Intermediate Overall recommendation: Highly recommended If there is one book that every beginning R user coming from a programming background should have, it is Spector’s Data Manipulation with R. New R users with analytic backgrounds and experience with software packages such as SAS The post Data...

Read more »

Data Manipulation with R – Spector (2008)

April 30, 2010
By

Title: Data Manipulation with RAuthor(s): Phil SpectorPublisher/Date: Springer/2008Statistics level: N/A Programming level: Intermediate Overall recommendation: Highly recommended If there is one book that every beginning R user coming from a program...

Read more »

hash-2.0.0

April 30, 2010
By
hash-2.0.0

Come see my talk on hashes in R at useR! 2010. (http://user2010.org/) July 20-23 National Institute of Standards and Technology (NIST), Gaithersburg, Maryland, USA

Read more »

5 Minute Analysis in R: Case-Shiller Indices

April 29, 2010
By
5 Minute Analysis in R: Case-Shiller Indices

The Case-Shiller Home Price Indices measure residential home values for 20 cities in the US, with some indices going all the way back to the 80s. With housing prices all the rage these days, we should perform a quick-and-dirty analysis using R to see what we can glean from this rich dataset. First things

Read more »

Research in pair next summer

April 29, 2010
By
Research in pair next summer

Today I received the very good news that our proposal with Jean-Michel Marin to undertake “research in pair” in CIRM, Luminy, a fortnight next summer was accepted! This research centre in Mathematics is a southern and French version of the renowned German centre of Oberwolfach and, while I would have prefered the cool Black Forest

Read more »

Getting a Web application to talk to R

April 29, 2010
By

Let's face it: you can do some pretty awesome things with R -- statistical models, beautiful charts, you name it -- but if the only way to do those things is from the R command line you're limiting the audience of people who might make use of all this awesomeness to a limited subset: R programmers. What if you...

Read more »