RcppArmadillo 0.4.650.1.1 (and also 0.4.650.2.0)

February 26, 2015
By

A new Armadillo release 4.650.1 was released by Conrad a few days ago. Armadillo is a powerful and expressive C++ template library for linear algebra aiming towards a good balance between speed and ease of use with a syntax deliberately close to a Matlab. It turned out that this release had one shortcoming with respect to the C++11...

Read more »

Compiling CoffeeScript in R with the js package

February 26, 2015
By
Compiling CoffeeScript in R with the js package

A new release of the js package has made it’s way to CRAN. This version adds support for compiling Coffee Script....

Read more »

reshape: from long to wide format

February 26, 2015
By
reshape: from long to wide format

This is to continue on the topic of using the melt/cast functions in reshape to convert between long and wide format of data frame. Here is the example I found...

Read more »

Why I think twice before editing plots in Powerpoint, Illustrator, Inkscape, etc.

February 26, 2015
By
Why I think twice before editing plots in Powerpoint, Illustrator, Inkscape, etc.

Thanks to a nice post by Meghan Duffy on the Dynamic Ecology blog (How do you make figures?), we have some empirical evidence that many figures made in...

Read more »

Using and Abusing Data Visualization: Anscombe’s Quartet and Cheating Bonferroni

February 26, 2015
By
Using and Abusing Data Visualization: Anscombe’s Quartet and Cheating Bonferroni

Anscombe’s quartet comprises four datasets that have nearly identical simple statistical properties, yet appear very different when graphed. Each dataset consists of eleven (x,y) points. They were

Read more »

Announcing shinyapps.io General Availability

February 26, 2015
By
Announcing shinyapps.io General Availability

RStudio is excited to announce the general availability (GA) of shinyapps.io. Shinyapps.io is an easy to use, secure, and scalable hosted service already being used by thousands of professionals...

Read more »

Aggregation

February 26, 2015
By

Aggregation splits data into subsets, computes summary statistics on each subset, and reports the results in a conveniently summarized form. The aggregate function is one of the most capable...

Read more »

The Downside of Rankings-Based Strategies

February 26, 2015
By
The Downside of Rankings-Based Strategies

This post will demonstrate a downside to rankings-based strategies, particularly when using data of a questionable quality (which, unless one … Continue reading →

Read more »

Collaborative Computing with distcomp

February 26, 2015
By
Collaborative Computing with distcomp

by Joseph Rickert Distcomp, a new R package available on GitHub from a group of Stanford researchers has the potential to significantly advance the practice of collaborative computing with...

Read more »

Fuzzy String Matching – a survival skill to tackle unstructured information

February 26, 2015
By
Fuzzy String Matching – a survival skill to tackle unstructured information

“The amount of information available in the internet grows every day” thank you captain Obvious! by now even my grandma is aware of that!. Actually, the internet...

Read more »

R: How to Layout and Design an Infographic

February 26, 2015
By
R: How to Layout and Design an Infographic

As promised from my recent article, here's my tutorial on how to layout and design an infographic in R. This article will serve as a template for more infographic...

Read more »

Adobe Sitecatalyst API and R: integrate reports with the SAINT classification file

February 26, 2015
By
Adobe Sitecatalyst API and R: integrate reports with the SAINT classification file

From original post @ http://analyticsblog.mecglobal.it/

Read more »

RMySQL version 0.10.2: Full SSL Support

February 25, 2015
By
RMySQL version 0.10.2: Full SSL Support

RMySQL version 0.10.2 has appeared on CRAN. This is a maintenance release to streamline the build process on various platforms. Most importantly,...

Read more »

Talking about R, Data Science and Microsoft on theCUBE

February 25, 2015
By

It was a pleasure to appear live on theCUBE last week while attending the Strata conference. In my interview with Jeff Kelly and John Furrier, I talked about the...

Read more »

Announcing: Introduction to Data Science video course

February 25, 2015
By
Announcing: Introduction to Data Science video course

Win-Vector LLC’s Nina Zumel and John Mount are proud to announce their new data science video course Introduction to Data Science is now available on Udemy. We designed the...

Read more »

Using Hadoop Streaming API to perform a word count job in R and C++

February 25, 2015
By

by Marek Gagolewski, Maciej Bartoszuk, Anna Cena, and Jan Lasek (Rexamine). Introduction In a recent blog post we explained how we managed to set up a working Hadoop environment...

Read more »

How Big Is The Vatican City?

February 24, 2015
By
How Big Is The Vatican City?

Dici che il fiume trova la via al mare e come il fiume giungerai a me (Miss Sarajevo, U2) One way to calculate approximately the area of some place is to...

Read more »

Visualizing Clusters

February 24, 2015
By
Visualizing Clusters

Consider the following dataset, with (only) ten points x=c(.4,.55,.65,.9,.1,.35,.5,.15,.2,.85) y=c(.85,.95,.8,.87,.5,.55,.5,.2,.1,.3) plot(x,y,pch=19,cex=2) We want to get – say – two clusters. Or more specifically, two sets of observations, each of...

Read more »

RStudio v0.99 Preview: Data Viewer Improvements

February 24, 2015
By
RStudio v0.99 Preview: Data Viewer Improvements

RStudio’s data viewer provides a quick way to look at the contents of data frames and other column-based data in your R environment. You invoke it by clicking on...

Read more »

Monitoring progress of a foreach parallel job

February 24, 2015
By
Monitoring progress of a foreach parallel job

by Andrie de Vries R has strong support for parallel programming, both in base R and additional CRAN packages. For example, we have previously written about foreach and parallel...

Read more »

Minimal examples help

February 24, 2015
By

The other day I got stuck working with a huge data set using data.table in R. It took me a little while to realise that I had...

Read more »

Strata 2015: Keynote roundup

February 23, 2015
By

I spent last week at the Strata 2015 Conference in San José, California. As always, Strata made for a wonderful conference to catch up on the latest developments on...

Read more »

How to learn R: A flow chart

February 23, 2015
By
How to learn R: A flow chart

I often find myself giving people suggestions about how to learn R, so I decided to put together a flow chart. This is geared toward typical psychology or cognitive...

Read more »

Make prettier documents by reusing chunks in RMarkdown

February 23, 2015
By
Make prettier documents by reusing chunks in RMarkdown

No revelations here, just a little R tip for generating more readable documents. There are times when I want to show code in a document, but I don’t want...

Read more »

RStudio v0.99 Preview: Vim Mode Improvements

February 23, 2015
By
RStudio v0.99 Preview: Vim Mode Improvements

RStudio’s code editor includes a set of lightweight Vim key bindings. You can turn these on in Tools | Global Options | Code | Editing: For those not familiar,...

Read more »

RStudio v0.99 Preview: Code Completion

February 23, 2015
By
RStudio v0.99 Preview: Code Completion

We’re busy at work on the next version of RStudio (v0.99) and this week will be blogging about some of the noteworthy new features. If you want to try...

Read more »

Resources for Learning Data Manipulation in R, SAS and Microsoft Excel

Resources for Learning Data Manipulation in R, SAS and Microsoft Excel

I had the great pleasure of speaking to the Department of Statistics and Actuarial Science at Simon Fraser University on last Friday to share my career advice with its...

Read more »

The Logical-Invest “Universal Investment Strategy”–A Walk Forward Process on SPY and TLT

February 23, 2015
By
The Logical-Invest “Universal Investment Strategy”–A Walk Forward Process on SPY and TLT

I’m sure we’ve all heard about diversified stock and bond portfolios. In its simplest, most diluted form, it can be … Continue reading →

Read more »

Launching DataScience.Vegas Blog

February 23, 2015
By

We are glad to announce the launch of DataScience.Vegas as a blog that aggregates all the events, news and information impacting the Las Vegas data science community. Our community has...

Read more »