Posts Tagged ‘ packages ’

R for Data Mining

June 6, 2011
By

Statistics and data mining often get bundled together, but (in my opinion), they're generally different practices with different goals. As a language designed for statistics, much of R's core functionality is focused on exploring and understanding data: model design, inference, and visualization. But when your goal is simply to get the best predictions from a big data set (without...

Read more »

Using R for Map-Reduce applications in Hadoop

May 4, 2011
By

Data Scientist Antonio Piccolboni recently published this comparison of the various language and interfaces available for programming Big Data analysis tasks in the map-reduce framework. The interfaces he reviewed included: Java Hadoop (mature and efficient, but verbose and difficult to program) Cascading (brings an SQL-like flavor to Java programming with Hadoop) Pipes/C++ (a C++ interface to programming on Hadoop)...

Read more »

Exporting R graphics as LaTeX code – version 0.6.1 of the tikzDevice package is out

April 17, 2011
By

(Guest post on R-bloggers by Charlie Sharpsteen) Cameron and I are pleased to announce version 0.6.0 of the tikzDevice package which should be available shortly at your local CRAN mirror! The tikzDevice makes it possible to export R graphics as LaTeX code that can be included in other documents or compiled into stand alone figures. The full power of...

Read more »

Slides and replay for PMML webinar

April 15, 2011
By

If you missed Wednesday's webinar on Deploying Predictive Analytics with Revolution R, PMML and ADAPA, the presentation slides and the replay (in WMV, reg. req'd) format are now available for download. Here are some additional references, from the closing slide: How Revolution R and ADAPA work together The PMML Package reference White Paper: Deploying Advanced Analytics Using R and...

Read more »

Journal of Statistical Software. Vol. 40

April 12, 2011
By

The latest edition of the Journal of Statistical Software is out, with plenty of interesting articles for R users. A must-read is Hadley Wickham's article on "The Split-Apply-Combine Strategy for Data Analysis", which makes a compelling argument for the use of the plyr package to partition datasets and apply sub-group analyses. Also, anyone who hasn't yet purchased a copy...

Read more »

Article about plyr published in JSS, and the citation was added to the new plyr (version 1.5)

April 11, 2011
By
Article about plyr published in JSS, and the citation was added to the new plyr (version 1.5)

The plyr package (by Hadley Wickham) is one of the few R packages for which I can claim to have used for all of my statistical projects. So whenever a new version of plyr comes out I tend to be excited about it (as was when version 1.2 came out with support for parallel processing)

Read more »

How to search the documentation of all CRAN packages

April 5, 2011
By
How to search the documentation of all CRAN packages

Ever tried to find a specific function or algorithm in R that you're sure that someone has implemented in a CRAN package, but you don't know which one? Now you can find it, in the R Packages section of R community site inside-R.org. The R Packages section now includes the ability to browse and search help files from R...

Read more »

“An R package” or “A R package”

March 26, 2011
By
“An R package” or “A R package”

I’m currently writing some lecture notes on R and I used the phrase “a R package” without thinking. Since the word following the article “a” was a consonant, I automatically went for “a” instead of “an”. The problem is that “R” sounds likes a vowel, so “a R package” grates on the listener. The correct

Read more »

Graphical Display of R Package Dependencies

March 23, 2011
By
Graphical Display of R Package Dependencies

In some work that I am currently involved in, we have to decide which GUI engine we should use. As an obvious starter, we decided to have a look at what other people are using in their packages. While cran helpfully displays all the R packages that are available, it doesn’t (I don’t think), give

Read more »

Parallel processing in R for Windows

March 4, 2011
By

The doSMP package (and its companion package, revoIPC), previously bundled only with Revolution R, is now available on CRAN for use with open-source R under the GPL2 license. In short, doSMP makes it easy to do SMP parallel processing on a Windows box with multiple processors. (It works on Mac and Linux too, but it's been relatively easy to...

Read more »