Beancounter minor bug fix release 0.8.10

June 13, 2010
By

A few month after the 0.8.9 release, we have another small update to beancounter. This version fixes a minor infelicity in the manual page (thanks to an Ubuntu and then Debian bug report) as well as a small coding bug where 'USD' was hard-coded when th...

Read more »

Beancounter minor bug fix release 0.8.10

June 13, 2010
By

A few month after the 0.8.9 release, we have another small update to beancounter. This version fixes a minor infelicity in the manual page (thanks to an Ubuntu and then Debian bug report) as well as a small coding bug where 'USD' was hard-coded when t...

Read more »

Primes in R (Part III): Schoolmath is Broken!

June 13, 2010
By

Here we are in Part III. Wait!? What happened to Parts I and II? Well, I started to write an article about Amdahl's law, parallelism and prime numbers, but found myself buried three levels deep trying to resolve problems with prime numbers in R. My normal inclination is to use Mathematica for such things, but I happened to...

Read more »

Primes in R (Part III): Schoolmath is Broken!

June 13, 2010
By

Here we are in Part III. Wait!? What happened to Parts I and II? Well, I started to write an article about Amdahl's law, parallelism and prime numbers, but found myself buried three levels deep trying to resolve problems with prime numbers in R. My normal inclination is to use Mathematica for such things, but I happened to...

Read more »

Dynamic Modeling 3: When the first-order difference model doesn’t cut it

June 12, 2010
By
Dynamic Modeling 3: When the first-order difference model doesn’t cut it

Data must be selected carefully.  The predictive usefulness of the model is grossly diminished if outliers taint the available data.  Figure 1, for instance, shows the Defense spending (as a fraction of the national budget) between 1948 and 1968. Note how the trend curve (as defined by our linear difference model from the last post: see

Read more »

R Tools for Dynamical Systems ~ bifurcation plot in R for system of ODEs

June 12, 2010
By
R Tools for Dynamical Systems ~ bifurcation plot in R for system of ODEs

As per request, here is the code that I wrote to draw bifurcation plots in R. Bifurcation diagrams for discrete maps can be done using this code by James Jones. It is a little easier since approximation is not needed. In the following code, I used the deSolve library to draw bifurcation diagrams for a

Read more »

A different way to view probability densities

June 12, 2010
By
A different way to view probability densities

The standard, textbook way to represent a density function looks like this: Perhaps you have seen this before? (Plot created in R, all source code from this post is included at the end). Not only will you find this plot in statistics books, you’ll also see it in medical texts, sociology, and even economics books.

Read more »

Prototype: More Web-Friendly Visualizations in R

June 12, 2010
By

I've spent some more time thinking about how best to put together the package for creating web-friendly, interactive data visualizations in R. I have a pretty substantial JavaScript package that does a lot of basic visualizations now, and it's really exciting to see where this is going. With this in mind, I'm releasing a new

Read more »

New R User Group in Atlanta

June 11, 2010
By

There's a new R User Group in Atlanta, GA. Well, it's not exactly new -- they've already had their first meeting -- but due to a cock-up on my part I failed to announce it at the time. (Memo to self: don't forget to star emails in Gmail that you have to follow up on!) There's a Google Group...

Read more »

The R Revolution on TV

June 11, 2010
By
The R Revolution on TV

I never thought I would ever embed videos from FOX on my blog, but this one needs to be covered: Watch the latest business video at video.foxbusiness.com Watch SPSS co-founder Norman Nie talking about the “… unbelievably powerful open source language called R …” and “… I am not sure that SPSS is our biggest

Read more »

Performance benefits of linking R to multithreaded math libraries

June 11, 2010
By

R wasn't originally designed as a multithreaded application -- multiprocessor systems were still rare when the R Project was first conceived in the mid-90's -- and so, by default, R will only use one processor of your dual-core laptop or quad-core desktop machine when doing calculations. For calculations that take a long time, like big simulations or modeling of...

Read more »

The Deepwater Horizon, in context

June 11, 2010
By
The Deepwater Horizon, in context

The Earth is pretty big. Give or take, it's about 36,000 feet (11km) to the height of a 747 soaring above Everest, and (in pleasing symmetry) about the same distance down the bottom of the Mariana Trench. (Nonetheless, if you shrunk the Earth down to scale, it would still be smoother than a billiard ball.) So I was surprised...

Read more »

Quantitative Candlestick Pattern Recognition (HMM, Baum Welch, and all that)

June 10, 2010
By
Quantitative Candlestick Pattern Recognition (HMM, Baum Welch, and all that)

Fig 1. Clustering based approach to candlestick Pattern Recognition. I've been reading a book titled, 'the Quants,' that I'm sure will tantalize many traders with some of the ideas embedded within. Most notably (IMO), the notion that Renaissance's Jame...

Read more »

R on TV

June 10, 2010
By

Had a fun day today in a TV studio for Fox Business News, where Revolution CEO Norman Nie was giving an interview to Liz Claman at Fox Business Network. As you might expect, the interview focused a lot more on the business side than on technical capabilities of Revolution R, but there was some good discussion of the impacts...

Read more »

Plotting BP Oil Spill Testing Data using R

June 10, 2010
By
Plotting BP Oil Spill Testing Data using R

The Deepwater Horizon Incident is in the U.S. headlines and on many peoples minds. The folks over at Revolution Analytics were even discussing this subject today. Several government sites including the NOAA have sections dedicated to the problem. In...

Read more »

oro.nifti 0.1.5

June 10, 2010
By

The latest version of oro.nifti (0.1.5) has been released on CRAN.  New features include:Reduced overall size of the package by 50%Minor improvements; for example, the XML package is no longer required as indicated from the DESCRIPTION file ...

Read more »

oro.nifti 0.1.5

June 10, 2010
By

The latest version of oro.nifti (0.1.5) has been released on CRAN.  New features include:Reduced overall size of the package by 50%Minor improvements; for example, the XML package is no longer required as indicated from the DESCRIPTION file ...

Read more »

oro.dicom 0.2.6

June 10, 2010
By

The latest version of oro.dicom (0.2.6) has been released on CRAN. New features/improvements include:33% increase in the speed of DICOM file handlingAutomatic re-orientation/re-slicing of 3D and 4D volumes when converting from DICOM to NIfTI formatWo...

Read more »

oro.dicom 0.2.6

June 10, 2010
By

The latest version of oro.dicom (0.2.6) has been released on CRAN. New features/improvements include:33% increase in the speed of DICOM file handlingAutomatic re-orientation/re-slicing of 3D and 4D volumes when converting from DICOM to NIfTI formatWo...

Read more »

Rcpp 0.8.2

June 10, 2010
By

A bug fix release Rcpp version 0.8.2 is now on CRAN and Debian. It contains some fixes for Sun compiler, but no user-visible changes and complements the Rcpp 0.8.1 release made Tuesday. Our thanks to Brian Ripley for help with the portability tests for...

Read more »

Rcpp 0.8.2

June 10, 2010
By

A bug fix release Rcpp version 0.8.2 is now on CRAN and Debian. It contains some fixes for Sun compiler, but no user-visible changes and complements the Rcpp 0.8.1 release made Tuesday. Our thanks to Brian Ripley for help with the portability tests ...

Read more »

RcppArmadillo 0.2.2

June 9, 2010
By

Following the Rcpp 0.8.1 release we made yesterday, we released RcppArmadillo release 0.2.2 this morning. RcppArmadillo uses Rcpp (and some 'glue' code) to provide a transparent interface from R to Conrad Sanderson's impressive Armadillo library for li...

Read more »

RcppArmadillo 0.2.2

Following the Rcpp 0.8.1 release we made yesterday, we released RcppArmadillo release 0.2.2 this morning. RcppArmadillo uses Rcpp (and some 'glue' code) to provide a transparent interface from R to Conrad Sanderson's impressive Armadillo library for...

Read more »

Data Mining with WEKA example implemented in R

June 9, 2010
By
Data Mining with WEKA example implemented in R

IBM Developer Works has several new articles on Data Mining with WEKA by Michael Abernethy. I decided to implement the example provided in the first article in the series using R. I realize that I could have used WEKA through R (using the RWeka packa...

Read more »

3 lines of R code to Process a Web Service

June 9, 2010
By
3 lines of R code to Process a Web Service

Ruby is well known for its terse syntax and ability to process web services. I prefer JSON (Javascript Object Notation) to XML whenever possible. For example, a script to retrieve a list of World Bank Data takes all of three lines of code (after in...

Read more »

Thoughts on Making Data Work

June 9, 2010
By

I really enjoyed all four talks at today's online conference, Making Data Work. (Disclosure: Revolution sponsored this conference.) I thought the four speakers together gave a great overview of issues related to the processing, analysis, and visualization of big data. Mike Driscoll started off with a useful categorization for data size. "Small Data" (<10Gb) fits in the memory of...

Read more »

Pegging your multicore CPU in Revolution R, Good and Bad

June 9, 2010
By
Pegging your multicore CPU in Revolution R, Good and Bad

I take an almost unhealthy pleasure in pushing my computer to its limits. This has become easier with Revolution R and its free license for academic use. One of its best features is debugger that allows you to step through … Continue reading

Read more »

Efficient Mixed-Model Association eXpedited (EMMAX) to Simutaneously Account for Relatedness and Stratification in Genome-Wide Association Studies

June 9, 2010
By

A few months ago I covered an algorithm called EMMA (Efficient Mixed-Model Association) implemented in R for simultaneously correct for both population stratification and relatedness in an association study. This method/software is very useful because ...

Read more »

Stratified sampling

June 9, 2010
By
Stratified sampling

The recently arXived paper of Goldstein, Rinott and Scarsini studies the impact of refining a partition on the precision of a stratified maximising/integration Monte Carlo approach. Quite naturally, if the partition gets improved, simulating points in each set of the partition can only improve the quality of the approximation, whether the problem is in maximising

Read more »

Contributing Blogs