packages

Benchmarking bigglm

November 13, 2012 | Joseph Rickert

By Joseph Rickert In a recent blog post, David Smith reported on a talk that Steve Yun and I gave at STRATA in NYC about building and benchmarking Poisson GLM models on various platforms. The results presented showed that the rxGlm function from Revolution Analytics’ RevoScaleR package running on a ... [Read more...]

RStudio releases Shiny

November 12, 2012 | David Smith

RStudio has released a new package for R. Shiny allows R developers to build simple interactive Web-based interfaces for R scripts, using only R code (no JavaScript development required!). You can see some examples of Shiny in action in this blog post, and there are more details about Shiny's capabilities ... [Read more...]

3-D animation of the changing Antarctic ice sheet

September 27, 2012 | David Smith

Last month we shared an visualization showing the changing extent of Arctic sea-ice. This visualization by the multinational Commission for the Conservation of Antarctic Marine Living Resources (CCAMLR) switches the view to the Southern pole and takes the visualization to a whole new level, by animating it in 3-D: The ... [Read more...]

Visualize complex data with subplots

September 14, 2012 | David Smith

Today's guest post comes from Garrett Grolemund, a software developer at RStudio — ed. I think of graphs as a type of visual summary for data. Yet I rarely see graphs used this way within visualizations. Consider tile plots. They group data into 2d bins and then summarize each group with ... [Read more...]

Integrate data and reporting on the Web with knitr

September 11, 2012 | David Smith

Today's guest post comes from Yihui Xie, author of the knitr package — ed. Hi, this is Yihui Xie, and I'm guest posting on the Revolutions blog to talk about one aspect of the knitr package: how we can integrate data analysis and reporting in R with the Web. This post ... [Read more...]

RStan: Fast, multilevel Bayesian modeling in R

August 31, 2012 | David Smith

For the last decade or so, the go-to software for Bayesian statisticians has been BUGS (and later the open-source incarnation, OpenBugs, or JAGS). BUGS is used for multi-level modeling: using a specialized notation, you can define random variables of various distributions, set Bayesian priors for their parameters, and create the ... [Read more...]

Creating beautiful reports from R with knitr

August 21, 2012 | David Smith

People use the R language every day to create the elements of reports: tables, charts, analyses, and forecasts. But assembling all of that information into a print-ready document laid out with text can a hassle. You can cut-and-paste all of the elements into Word, but then what do you do ... [Read more...]

Getting Started with R and Hadoop

August 20, 2012 | David Smith

Last week's meeting of the Chicago area Hadoop User Group (a joint meeting the Chicago R User Group, and sponsored by Revolution Analytics) focused on crunching Big Data with R and Hadoop. Jeffrey Breen, president of Atmosphere Research Group, frequently deals with large data sets in his airline consulting work, ... [Read more...]

The top 10 critical packages on CRAN

August 8, 2012 | David Smith

While most R packages on CRAN are designed to be used by an R user directly, a few packages are designed to be used by other package developers. (And some packages are so useful that they're regularly used by both camps.) When a package author publishes a package to CRAN, ... [Read more...]

Faster R in Hadoop: rmr 1.3 now available

July 23, 2012 | David Smith

The RHadoop project continues the Big Data integration of R and Hadoop, with a new update to its rmr package. Version 1.3 of rmr improves the performance of map-reduce jobs for Hadoop written in R. New features include: An optional vectorized API for efficient R programming when dealing with small records. ... [Read more...]

R Journal, June 2012

July 20, 2012 | David Smith

The June 2012 issue of the R Journal, the peer-reviewed open-journal about R packages and applications of R, is now available. This issue includes articles about: Efficiently calling C functions from R without the need for wrapper code Using clusters of Macs running Apple Xgrid for parallel distributed processing with R ... [Read more...]

The R packages in a data scientist’s toolbox

July 17, 2012 | David Smith

John Myles White, self-described "statistics hacker" and co-author of "Machine Learning for Hackers" was interviewed recently by The Setup. In the interview, he describes his some of his go-to R packages for data science: Most of my work involves programming, so programming languages and their libraries are the bulk of ... [Read more...]

Using integer programming in R to optimize cargo loads

July 16, 2012 | David Smith

Linear Programming is a mathematical technique used to find the values of some variables (within the bounds of some defined constraints) to find the maximum value of a quantity. For example, consider this problem from the FishyOperations blog: A trading company is looking for a way to maximize profit per ... [Read more...]

Load Packages Automatically in RStudio

June 6, 2012 | inkhorn82

I recently finished a long stretch of work on a particular project that required me to draw upon four R packages.  Each time I got back to my work on the project, I’d have to load the packages manually, as … Continue reading → [Read more...]

Facebook-class social network analysis with R and Hadoop

May 25, 2012 | David Smith

In computing, social networks are traditionally represented as graphs: a connection of nodes (people), pairs of which may be connected by edges (friend relationships). Visually, the social networks can then be represented like this: Social network analysis often amounts to calculating the statistics on a graph like this: the number ... [Read more...]
1 2 3 5

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)