Quick and dirty parallel processing in R

April 30, 2010
By
Quick and dirty parallel processing in R

R has some powerful tools for parallel processing, which I discovered while searching for ways to fully utilize my 8-core computer at work. What surprised me is how easy it is…about 6 lines of code, if that. Given that I wasn’t allowed to install heavy duty parallel-processing systems like MPICH on the computer, I found

Read more »

How many girls, how many boys?

April 30, 2010
By
How many girls, how many boys?

I found this interesting question over here at mathoverflow.net. Here’s the question: If you have a country where every family will continue to have children until they get a boy, then they will stop. What is the proportion of boys to girls in the country. First off, there are some assumptions you need to make that aren’t

Read more »

Which font uses the most ink?

April 30, 2010
By
Which font uses the most ink?

If you're being particularly cost-conscious about your use of printer ink or toner, you may be wondering which font you should choose to minimize ink use. Here's an infographic with the answer: This is an interesting infographic in its own right, but what makes it cool is that these are not photoshopped images of Bic biros. Matt Robinson created...

Read more »

Data Manipulation with R – Spector (2008)

April 30, 2010
By
Data Manipulation with R – Spector (2008)

Title: Data Manipulation with RAuthor(s): Phil SpectorPublisher/Date: Springer/2008Statistics level: N/A Programming level: Intermediate Overall recommendation: Highly recommended If there is one book that every beginning R user coming from a programming background should have, it is Spector’s Data Manipulation with R. New R users with analytic backgrounds and experience with software packages such as SAS

The post Data...

Read more »

Data Manipulation with R – Spector (2008)

April 30, 2010
By

Title: Data Manipulation with RAuthor(s): Phil SpectorPublisher/Date: Springer/2008Statistics level: N/A Programming level: Intermediate Overall recommendation: Highly recommended If there is one book that every beginning R user coming from a program...

Read more »

hash-2.0.0

April 30, 2010
By
hash-2.0.0

Come see my talk on hashes in R at useR! 2010. (http://user2010.org/) July 20-23 National Institute of Standards and Technology (NIST), Gaithersburg, Maryland, USA

Read more »

5 Minute Analysis in R: Case-Shiller Indices

April 29, 2010
By
5 Minute Analysis in R: Case-Shiller Indices

The Case-Shiller Home Price Indices measure residential home values for 20 cities in the US, with some indices going all the way back to the 80s. With housing prices all the rage these days, we should perform a quick-and-dirty analysis using R to see what we can glean from this rich dataset. First things

Read more »

Research in pair next summer

April 29, 2010
By
Research in pair next summer

Today I received the very good news that our proposal with Jean-Michel Marin to undertake “research in pair” in CIRM, Luminy, a fortnight next summer was accepted! This research centre in Mathematics is a southern and French version of the renowned German centre of Oberwolfach and, while I would have prefered the cool Black Forest

Read more »

UAH Channel 5 Anomaly Trends

April 29, 2010
By
UAH Channel 5 Anomaly Trends

Lucia at The Blackboard has on on-going bet where readers submit predictions of the monthly UAH anomaly, the April bet post is here. Lucia does some very interesting climate analysis, some of her charts, however,  can give me a headache … Contin...

Read more »

Getting a Web application to talk to R

April 29, 2010
By

Let's face it: you can do some pretty awesome things with R -- statistical models, beautiful charts, you name it -- but if the only way to do those things is from the R command line you're limiting the audience of people who might make use of all this awesomeness to a limited subset: R programmers. What if you...

Read more »

Response to Flowingdata Challenge: Graphing obesity trends

April 29, 2010
By
Response to Flowingdata Challenge: Graphing obesity trends

Nathan at Flowingata put up another interesting challenge today to improve the following graphic showing obesity trends in America. Here’s my attempt: I transposed the data so that the cohorts are on the X axis and each separate line represents an age group. So each line shows the percentage of obese people in a particular

Read more »

JAGS 2.0

April 29, 2010
By

Is out. On sourceforge. Along with a new rjags. Thanks Martyn. I’m looking forward to working my way through some of the improvements, which include block-updating of parameters in GLMs more use of R’s d/p/q family of functions for den...

Read more »

Tipping heuristics

April 28, 2010
By
Tipping heuristics

INCREDIBLY SIMPLE CALCULATIONS MADE SIMPLE Yes, we all know how to calculate 15% or 20% exactly, but it’s fun to use tipping heuristics and even more fun to make crowded graphs of how they compare to each other. (Sorry for the junky chart. Open for suggestions, in the words of Tom Waits.) Here are a

Read more »

Wavelet Spectrogram Non-Stationary Financial Time Series analysis using R (TTR/Quantmod/dPlR) with USDEUR

April 28, 2010
By
Wavelet Spectrogram Non-Stationary Financial Time Series analysis using R (TTR/Quantmod/dPlR) with USDEUR

I've been doing some research lately regarding types of spectral imaging and decomposition techniques that apply to non-stationary signals. As mentioned earlier, one of the major problems with the simple fourier analysis is that the basis functions ext...

Read more »

R Beginner’s Guide Book Update 4/28/2010

April 28, 2010
By
R Beginner’s Guide Book Update 4/28/2010

Update: Statistical Analysis with R is now available!I am writing to update you on the progress of my R Beginner's Guide book, which is to be published through Packt. I have really gotten to work over the past couple months and have recently completed ...

Read more »

R Beginner’s Guide Book Update 4/28/2010

April 28, 2010
By
R Beginner’s Guide Book Update 4/28/2010

Update: Statistical Analysis with R is now available!I am writing to update you on the progress of my R Beginner's Guide book, which is to be published through Packt. I have really gotten to work over the past couple months and have recently completed ...

Read more »

Sweave vs. pgfSweave

April 28, 2010
By
Sweave vs. pgfSweave

I think pgfSweave should replace the default of Sweave system because it brings significant improvements over the original Sweave. I certain have made it my default Sweave tool on my system.------------------------------- EDITED ON MAY 1, 2010 --------...

Read more »

Eclipse and StatET – a working environment for R

April 28, 2010
By

How to use Eclipse and StatET to create a handy-dandy development environment for R. Using the Eclipse environment provides a nice colored-syntax script editor, an R console, and an object browser to let you peruse the variables you've put into memory....

Read more »

Annotating Lattice Box and Whisker Plots

April 28, 2010
By
Annotating Lattice Box and Whisker Plots

bwplot annotation examplebwplot annotation example

Sometimes you want to add a little text to box and whisker plots produced by the lattice function bwplot(). Here is one approach. Could be optimized a bit more to reduce manual specification of some elements. Suggestions...

Read more »

Analysis of Covariance – Extending Simple Linear Regression

April 28, 2010
By
Analysis of Covariance – Extending Simple Linear Regression

The simple linear regression model considers the relationship between two variables and in many cases more information will be available that can be used to extend the model. For example, there might be a categorical variable (sometimes known as a covariate) that can be used to divide the data set to fit a separate linear

Read more »

Interview with Revolution CEO Norman Nie

April 28, 2010
By

Steve Miller has posted his interview with Revolution's CEO Norman Nie at Information Management blogs. In the interview, Steve digs into Norman's motivations for taking on a new venture around R after his successes with SPSS and how what he learned there applies to Revolution Computing. Also up for discussion: the benefits and challenges of an open-source business model;...

Read more »

Bhapkar V test

April 28, 2010
By

This is the code to perform the Bhapkar V test. I've rapidly wrote it, in 2 hours. The code is then quite brutal and it could be done better. As soon as possible, I will correct it.

WARNING: it works *ONLY* with 3 groups, for now!


bhapkar.test.3g
sample for(i in 1:length(data1)){
sample }

obs for(i in 1:length(data1)){
obs }
rank
cplets vec for(i in 1:length(data1])){
vec...

Read more »

Bhapkar V test

April 28, 2010
By

This is the code to perform the Bhapkar V test. I've rapidly wrote it, in 2 hours. The code is then quite brutal and it could be done better. As soon as possible, I will correct it.

WARNING: it works *ONLY* with 3 groups, for now!


bhapkar.test.3g
sample for(i in 1:length(data1)){
sample }

obs for(i in 1:length(data1)){
obs }
rank
cplets vec for(i in 1:length(data1])){
vec...

Read more »

Social Network Analysis using R and Gephis

April 28, 2010
By
Social Network Analysis using R and Gephis

After learning the basics of R, I decided to learn something harder last week. I picked Social Network Analysis (SNA) to learn the concepts of SNA and R. My primary interest in SNA is visual exploration of networks, so I needed to find a tool first.&n...

Read more »

Transitions in R redux

April 28, 2010
By

Previously, I shared with the world a function to create a pairwise matrix of the number of transitions and transversions between two DNA sequences. Klaus Schliep kindly pointed out the possibility of a bug in the function and offered a faster, more accurate version. Thanks Klaus!


titv mat res for(i...

Read more »

Transitions in R redux

April 28, 2010
By

Previously, I shared with the world a function to create a pairwise matrix of the number of transitions and transversions between two DNA sequences. Klaus Schliep kindly pointed out the possibility of a bug in the function and offered a faster, more accurate version. Thanks Klaus!


titv mat res for(i...

Read more »

Climate Oscillations and GISS Temperature Anomaly Trends

April 27, 2010
By
Climate Oscillations and GISS Temperature Anomaly Trends

In this post, I examine the combined impacts of Pacific Decadal Oscillation (PDO), Atlantic Multidecadal Oscillation (AMO) and El Nino – Southern Oscillation (ENSO)   on the long-term GISS Land and Ocean Temperature Anomaly (LOTA) trend. Introduction Professor Don Easterbrook of … Continue reading

Read more »

Using R for Introductory Statistics, Chapters 1 and 2

April 27, 2010
By
Using R for Introductory Statistics, Chapters 1 and 2

I'm working my way through Using R for Introductory Statistics, by John Verzani, a free version of which is available as SimpleR. Chapter 1 ...covers basics of R such as arithmetic, loading libraries and reading data. We also get an introduction to v...

Read more »

How to do this graph using R

April 26, 2010
By
How to do this graph using R

I am showing three examples that will help an average R user to create beautiful graphs. The interesting (may be useful) parts of these examples are the use of some very smart but tricky functions to, for example, add a Greek symbol on a plot, add a title to a plot with some mathematical symbols.

I am showing three...

Read more »

Contributing Blogs