Blog Archives

Fast parallel computing with Intel Phi coprocessors

May 19, 2015
By
Fast parallel computing with Intel Phi coprocessors

by Andrew Ekstrom Recovering physicist, applied mathematician and graduate student in applied Stats and systems engineering We know that R is a great system for performing statistical analysis. The price is quite nice too ;-) . As a graduate student, I need a cheap replacement for Matlab and/or Maple. Well, R can do that too. I’m running a large...

Read more »

A first look at htmlwidgets

May 14, 2015
By

by Joseph Rickert A strong case can be made that base R graphics supplemented with either the lattice library or ggplot2 for plotting by subgroups provides everything a statistician might need for both exploratory data analysis and for developing clear, crisp for communicating results. However, it is abundantly clear that web based graphics, driven to a large extent by...

Read more »

Using Azure as an R data source, Part 1

May 12, 2015
By
Using Azure as an R data source, Part 1

by Gregory Vandenbrouck Software Engineer at Microsoft This post is the first in a series that covers pulling data from various Windows Azure hosted storage solutions (such as MySQL, or Microsoft SQL Server) to an R client on Windows or Linux. We’ll start with a relatively simple case of pulling data from SQL Azure to an R client on...

Read more »

Digging up embedded plots

May 7, 2015
By
Digging up embedded plots

by Joseph Rickert The following multi-panel graph, which graces the cover of the most recent issue of the Journal of Computational and Graphical Statistics ,JCGS, (Vol 24, Num 1, March 2015) is from the paper by Grolemund and Wickham entitled Visualizing Complex Data With Embedded Plots. The four plots are noteworthy for a couple or reasons: They present superb...

Read more »

Data Science in HR

May 5, 2015
By
Data Science in HR

by Joseph Rickert Last year in a post on interesting R topics presented at the JSM I described how data scientists in Google's human resources department were using R and predictive analytics to better understand the characteristics of its workforce. Google may very well have done the pioneering work, but predictive analytics for HR applications is going mainstream. In...

Read more »

The First NY R Conference

April 30, 2015
By
The First NY R Conference

by Joseph Rickert Last Friday and Saturday the NY R Conference briefly lit up Manhattan's Union Square neighborhood as the center of the R world. You may have caught some of the glow on twitter. Jared Lander, volunteers from the New York Open Statistical Programming Meetup along with the staff at Workbench (the conference venue) set the bar pretty...

Read more »

Situational Baseball: Analyzing Runs Potential Statistics

April 28, 2015
By
Situational Baseball: Analyzing Runs Potential Statistics

by Mark Malter After reading the book, Analyzing Baseball with R, by Max Marchi and Jim Albert, I decided to expand on some of their ideas relating to runs created and put them into an R shiny app . The Server and UI code are linked at the bottom of the Introduction tab. I downloaded the Retrosheet play-by-play data...

Read more »

The new science journalism and open science

April 23, 2015
By

by Joseph Rickert The New York Times is quietly changing the practice of science journalism. The Tuesday April 21, 2015 article: Ebola Lying in Wait, reports on "A growing body of scientific clues - some ambiguous, other substantive" that the Ebola virus may have lain dormant in West African rain forest for years before igniting last year's outbreak. In...

Read more »

R for more powerful clustering

April 21, 2015
By
R for more powerful clustering

by Vidisha Vachharajani Freelance Statistical Consultant R showcases several useful clustering tools, but the one that seems particularly powerful is the marriage of hierarchical clustering with a visual display of its results in a heatmap. The term “heatmap” is often confusing, making most wonder – which is it? A "colorful visual representation of data in a matrix" or "a...

Read more »

R User Group Meetings this week in the Bay Area and around the world

April 16, 2015
By
R User Group Meetings this week in the Bay Area and around the world

by Joseph Rickert Tracking R user group meetings is a good way to stay informed about what's happening in the R world. On Tuesday the Bay Area useR Group (BARUG) met at AdRoll in San Francisco. It was a mini-conference with 6 talks: Bryan Galvin our host at AdRoll (many thanks for the pizza and beer) kicked off the...

Read more »