February 2015

Aggregation

February 26, 2015 | 0 Comments

Aggregation splits data into subsets, computes summary statistics on each subset, and reports the results in a conveniently summarized form. The aggregate function is one of the most capable functions in the scidb package. The package overloads R’s standard aggregate function for SciDB arrays, using reasonably standard R syntax ... [Read more...]

Collaborative Computing with distcomp

February 26, 2015 | 0 Comments

by Joseph Rickert Distcomp, a new R package available on GitHub from a group of Stanford researchers has the potential to significantly advance the practice of collaborative computing with large data sets distributed over separate sites that may be unwilling to explicitly share data. The fundamental idea is to be ... [Read more...]

R: How to Layout and Design an Infographic

February 26, 2015 | 0 Comments

As promised from my recent article, here's my tutorial on how to layout and design an infographic in R. This article will serve as a template for more infographic design that I plan to share on future posts. Hence, we will go through the following sect...
[Read more...]

Generating ANOVA-like table from GLMM using parametric bootstrap

February 26, 2015 | 0 Comments

This article may also be found on RPubs: http://rpubs.com/hughes/63269 In the list of worst to best way to test for effect in GLMM the list on http://glmm.wikidot.com/faq state that parametric bootstrapping is among the best options. PBmodcomp in the pbkrtest package implement such ...
[Read more...]

Mapping religious adherence in the US with CartoDB

February 25, 2015 | 0 Comments

Before coming to TCU, I worked as a data analyst for the Church Pension Group, which manages the retirement funds and provides other financial services for the Episcopal Church. I was part of a small research group that completed both internal and public-facing studies using the company’s data. You ... [Read more...]

RMySQL version 0.10.2: Full SSL Support

February 25, 2015 | 0 Comments

RMySQL version 0.10.2 has appeared on CRAN. This is a maintenance release to streamline the build process on various platforms. Most importantly, the Windows/OSX binary packages from CRAN are now built with full SSL support. On Linux, the configure script has been updated a bit to automatically find the mysql ...
[Read more...]

Talking about R, Data Science and Microsoft on theCUBE

February 25, 2015 | 0 Comments

It was a pleasure to appear live on theCUBE last week while attending the Strata conference. In my interview with Jeff Kelly and John Furrier, I talked about the rising popularity of R, the applications of data science, and the recent announcement of Microsoft acquiring Revolution Analytics. I also gushed ... [Read more...]

Max Kuhn’s Talk on Predictive Modeling

February 25, 2015 | 0 Comments

Max Kuhn, Director of Nonclinical Statistics of Pfizer and also the author of Applied Predictive Modeling joined us on February 17, 2015 and shared his experience with Data Mining […] [Read more...]

Announcing: Introduction to Data Science video course

February 25, 2015 | 0 Comments

Win-Vector LLC’s Nina Zumel and John Mount are proud to announce their new data science video course Introduction to Data Science is now available on Udemy. We designed the course as an introduction to an advanced topic. The course description is: Use the R Programming Language to execute data ... [Read more...]

How Big Is The Vatican City?

February 24, 2015 | 0 Comments

Dici che il fiume trova la via al mare e come il fiume giungerai a me (Miss Sarajevo, U2) One way to calculate approximately the area of some place is to circumscribe it into a polygon of which you know its area. After that, generate coordinates inside the polygon and ...
[Read more...]

Visualizing Clusters

February 24, 2015 | 0 Comments

Consider the following dataset, with (only) ten points x=c(.4,.55,.65,.9,.1,.35,.5,.15,.2,.85) y=c(.85,.95,.8,.87,.5,.55,.5,.2,.1,.3) plot(x,y,pch=19,cex=2) We want to get – say – two clusters. Or more specifically, two sets of observations, each of them sharing some similarities. Since the number of observations is rather small, it is actually possible to ... [Read more...]

RStudio v0.99 Preview: Data Viewer Improvements

February 24, 2015 | 0 Comments

RStudio’s data viewer provides a quick way to look at the contents of data frames and other column-based data in your R environment. You invoke it by clicking on the grid icon in the Environment pane, or at the console by typing View(mydata). As part of the RStudio ...
[Read more...]

Monitoring progress of a foreach parallel job

February 24, 2015 | 0 Comments

by Andrie de Vries R has strong support for parallel programming, both in base R and additional CRAN packages. For example, we have previously written about foreach and parallel programming in the articles Tutorial: Parallel programming with foreach and Intro to Parallel Random Number Generation with RevoScaleR. The foreach package ... [Read more...]

Rare snowmelt estimation (GB)

February 24, 2015 | 0 Comments

I read Hough and Hollis’ 1997 paper recently which uses Met Office synoptic stations to estimate a magnitude – recurrence relationship for snowmelt in the UK. i.e. how often do we get how… Continue reading →
[Read more...]

Minimal examples help

February 24, 2015 | 0 Comments

The other day I got stuck working with a huge data set using data.table in R. It took me a little while to realise that I had to produce a minimal reproducible example to actually understand why I got stuck in the first place. I know, this is the ... [Read more...]

Strata 2015: Keynote roundup

February 23, 2015 | 0 Comments

I spent last week at the Strata 2015 Conference in San José, California. As always, Strata made for a wonderful conference to catch up on the latest developments on big data and data science, and to connect with colleagues and friends old and new. Having been to every Strata conference since ... [Read more...]
1 2 3 4 11

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)