New R User Group in Atlanta

June 11, 2010
By

There's a new R User Group in Atlanta, GA. Well, it's not exactly new -- they've already had their first meeting -- but due to a cock-up on my part I failed to announce it at the time. (Memo to self: don't forget to star emails in Gmail that you have to follow up on!) There's a Google Group...

Read more »

The R Revolution on TV

June 11, 2010
By
The R Revolution on TV

I never thought I would ever embed videos from FOX on my blog, but this one needs to be covered: Watch the latest business video at video.foxbusiness.com Watch SPSS co-founder Norman Nie talking about the “… unbelievably powerful open source language called R …” and “… I am not sure that SPSS is our biggest

Read more »

Performance benefits of linking R to multithreaded math libraries

June 11, 2010
By

R wasn't originally designed as a multithreaded application -- multiprocessor systems were still rare when the R Project was first conceived in the mid-90's -- and so, by default, R will only use one processor of your dual-core laptop or quad-core desktop machine when doing calculations. For calculations that take a long time, like big simulations or modeling of...

Read more »

The Deepwater Horizon, in context

June 11, 2010
By
The Deepwater Horizon, in context

The Earth is pretty big. Give or take, it's about 36,000 feet (11km) to the height of a 747 soaring above Everest, and (in pleasing symmetry) about the same distance down the bottom of the Mariana Trench. (Nonetheless, if you shrunk the Earth down to scale, it would still be smoother than a billiard ball.) So I was surprised...

Read more »

Quantitative Candlestick Pattern Recognition (HMM, Baum Welch, and all that)

June 10, 2010
By
Quantitative Candlestick Pattern Recognition (HMM, Baum Welch, and all that)

Fig 1. Clustering based approach to candlestick Pattern Recognition. I've been reading a book titled, 'the Quants,' that I'm sure will tantalize many traders with some of the ideas embedded within. Most notably (IMO), the notion that Renaissance's Jame...

Read more »

R on TV

June 10, 2010
By

Had a fun day today in a TV studio for Fox Business News, where Revolution CEO Norman Nie was giving an interview to Liz Claman at Fox Business Network. As you might expect, the interview focused a lot more on the business side than on technical capabilities of Revolution R, but there was some good discussion of the impacts...

Read more »

Plotting BP Oil Spill Testing Data using R

June 10, 2010
By
Plotting BP Oil Spill Testing Data using R

The Deepwater Horizon Incident is in the U.S. headlines and on many peoples minds. The folks over at Revolution Analytics were even discussing this subject today. Several government sites including the NOAA have sections dedicated to the problem. In...

Read more »

oro.nifti 0.1.5

June 10, 2010
By

The latest version of oro.nifti (0.1.5) has been released on CRAN.  New features include:Reduced overall size of the package by 50%Minor improvements; for example, the XML package is no longer required as indicated from the DESCRIPTION file ...

Read more »

oro.nifti 0.1.5

June 10, 2010
By

The latest version of oro.nifti (0.1.5) has been released on CRAN.  New features include:Reduced overall size of the package by 50%Minor improvements; for example, the XML package is no longer required as indicated from the DESCRIPTION file ...

Read more »

oro.dicom 0.2.6

June 10, 2010
By

The latest version of oro.dicom (0.2.6) has been released on CRAN. New features/improvements include:33% increase in the speed of DICOM file handlingAutomatic re-orientation/re-slicing of 3D and 4D volumes when converting from DICOM to NIfTI formatWo...

Read more »

oro.dicom 0.2.6

June 10, 2010
By

The latest version of oro.dicom (0.2.6) has been released on CRAN. New features/improvements include:33% increase in the speed of DICOM file handlingAutomatic re-orientation/re-slicing of 3D and 4D volumes when converting from DICOM to NIfTI formatWo...

Read more »

Rcpp 0.8.2

June 10, 2010
By

A bug fix release Rcpp version 0.8.2 is now on CRAN and Debian. It contains some fixes for Sun compiler, but no user-visible changes and complements the Rcpp 0.8.1 release made Tuesday. Our thanks to Brian Ripley for help with the portability tests for...

Read more »

Rcpp 0.8.2

June 10, 2010
By

A bug fix release Rcpp version 0.8.2 is now on CRAN and Debian. It contains some fixes for Sun compiler, but no user-visible changes and complements the Rcpp 0.8.1 release made Tuesday. Our thanks to Brian Ripley for help with the portability tests ...

Read more »

RcppArmadillo 0.2.2

June 9, 2010
By

Following the Rcpp 0.8.1 release we made yesterday, we released RcppArmadillo release 0.2.2 this morning. RcppArmadillo uses Rcpp (and some 'glue' code) to provide a transparent interface from R to Conrad Sanderson's impressive Armadillo library for li...

Read more »

RcppArmadillo 0.2.2

Following the Rcpp 0.8.1 release we made yesterday, we released RcppArmadillo release 0.2.2 this morning. RcppArmadillo uses Rcpp (and some 'glue' code) to provide a transparent interface from R to Conrad Sanderson's impressive Armadillo library for...

Read more »

Data Mining with WEKA example implemented in R

June 9, 2010
By
Data Mining with WEKA example implemented in R

IBM Developer Works has several new articles on Data Mining with WEKA by Michael Abernethy. I decided to implement the example provided in the first article in the series using R. I realize that I could have used WEKA through R (using the RWeka packa...

Read more »

3 lines of R code to Process a Web Service

June 9, 2010
By
3 lines of R code to Process a Web Service

Ruby is well known for its terse syntax and ability to process web services. I prefer JSON (Javascript Object Notation) to XML whenever possible. For example, a script to retrieve a list of World Bank Data takes all of three lines of code (after in...

Read more »

Thoughts on Making Data Work

June 9, 2010
By

I really enjoyed all four talks at today's online conference, Making Data Work. (Disclosure: Revolution sponsored this conference.) I thought the four speakers together gave a great overview of issues related to the processing, analysis, and visualization of big data. Mike Driscoll started off with a useful categorization for data size. "Small Data" (<10Gb) fits in the memory of...

Read more »

Pegging your multicore CPU in Revolution R, Good and Bad

June 9, 2010
By
Pegging your multicore CPU in Revolution R, Good and Bad

I take an almost unhealthy pleasure in pushing my computer to its limits. This has become easier with Revolution R and its free license for academic use. One of its best features is debugger that allows you to step through … Continue reading →

Read more »

Efficient Mixed-Model Association eXpedited (EMMAX) to Simutaneously Account for Relatedness and Stratification in Genome-Wide Association Studies

June 9, 2010
By

A few months ago I covered an algorithm called EMMA (Efficient Mixed-Model Association) implemented in R for simultaneously correct for both population stratification and relatedness in an association study. This method/software is very useful because ...

Read more »

Stratified sampling

June 9, 2010
By
Stratified sampling

The recently arXived paper of Goldstein, Rinott and Scarsini studies the impact of refining a partition on the precision of a stratified maximising/integration Monte Carlo approach. Quite naturally, if the partition gets improved, simulating points in each set of the partition can only improve the quality of the approximation, whether the problem is in maximising

Read more »

Go Guerrill-R on Your Data in August

June 8, 2010
By
Go Guerrill-R on Your Data in August

Guerrill-R, get it? Register now for the Guerrilla Data Analysis Techniques (GDAT) class to be held during the week of August 9-13, 2010. The focus will be on using R and the PDQ-R for computer performance analysis and capacity planning.(Click on the...

Read more »

Plotting World Bank Data with R

June 8, 2010
By
Plotting World Bank Data with R

Copy and paste the code listed in this post below, and you can create a plot of World Bank data using R with a call like the following:plotWorldBank('USA','NY.GDP.MKTP.KD.ZG')There is a wealth of economic and demographic available at the World Bank. ...

Read more »

Rcpp 0.8.1

June 8, 2010
By

Early this morning I sent Rcpp version 0.8.1 off to CRAN and Debian. In the meantime, Romain has already provided a very nice blog post about it. There are a few fairly visible new things in this release. As we want to focus the next few minor release...

Read more »

Rcpp 0.8.1

Early this morning I sent Rcpp version 0.8.1 off to CRAN and Debian. In the meantime, Romain has already provided a very nice blog post about it. There are a few fairly visible new things in this release. As we want to focus the next few minor rel...

Read more »

PostGIS in Action Book Review

June 8, 2010
By
PostGIS in Action Book Review

I was recently asked to review a soon to be published book on PostGIS, a spatial extension to the very popular Postgresql relational database. I was very excited about receiving an early copy of this book, as the authors have provided countless tips, ...

Read more »

R is eve R ywhe R e

June 8, 2010
By
R is eve R ywhe R e

R did definitely not start to be THE statistical computing tool. The “two Rs” in far down-under just needed some tool which was not too expensive and structured enough to support the elementary statistics classes filled with hundreds of students. Another constraint was the computing lab which was large enough, but “only” filled with Mac

Read more »

Norman Nie talks R and statistics with CNET’s Dave Rosenberg

June 8, 2010
By

Revolution's CEO Norman Nie recently gave an interview to CNET's Dave Rosenberg, who blogs on disrupting the software market. Among the topics covered: What is R (Nie: "R is the most powerful statistical computing language on the planet"), what is predictive analytics ("statistical modeling by another name") and plans for Revolution R ("we intend to bring predictive analytics to...

Read more »

Rcpp 0.8.1

June 8, 2010
By

We released Rcpp 0.8.0 almost a month ago. It finalized our efforts in designing a better, faster and more natural API than any version of Rcpp ever before. The journey from Rcpp 0.7.0 to

Read more »