yorkr crashes the IPL party! – Part 3!

April 2, 2016
By
yorkr crashes the IPL party! – Part 3!

Introduction “I’m sorry, if you were right, I’d agree with you.” Robin Williams Get your facts first. Then you can distort them as you please. Mark Twain Do not take life too seriously. You will never get out of it alive. Elbert Hubbard This is the 3rd post in the “yorkr crashes the IPL party!”

Read more »

yorkr crashes the IPL party ! – Part 1

April 2, 2016
By
yorkr crashes the IPL party ! – Part 1

  Where tireless striving stretches its arms towards perfection Where the clear stream of reason has not lost its way Into the dreary desert sand of dead habit Rabindranath Tagore Introduction In this post, my R package yorkr crashes the IPL party! In my earlier posts I had already created functions for handling Twenty20 matches.

Read more »

yorkr crashes the IPL party! – Part 2

April 2, 2016
By
yorkr crashes the IPL party! – Part 2

Most people say that it is the intellect which makes a great scientist. They are wrong: it is character. Albert Einstein *Science is organized knowledge. Wisdom is organized life.“* Immanuel Kant If I have seen further, it is by standing on the shoulders of giants Isaac Newton Valid criticism does you a favor. Carl Sagan

Read more »

Introducing cricket package yorkr: Part 1- Beaten by sheer pace!

April 2, 2016
By
Introducing cricket package yorkr: Part 1- Beaten by sheer pace!

“We need to regard statistical intuition with proper suspicion and replace impression formation by computation wherever possible” “We are pattern seekers, believers in a coherent world” “The hot hand is entirely in the eyes of the beholders, who are consistently” “too quick to perceive order and causality in randomeness. The hot hand is a” “massive

Read more »

Election analysis contest entry part 1 – introducing the nzelect R package

April 2, 2016
By
Election analysis contest entry part 1 – introducing the nzelect R package

The contest Inspired by Ari Lamstein’s R Election Analysis Contest, I’ve fast-tracked a project that’s been at the back of my mind for a while, to make available in a friendly, tidy R package a range of data about New Zealand elections. My entry for the contest will involve 3 or 4 posts over the next week or...

Read more »

Diabetes Drug Switching: Why do people switch from one Drug to another?

April 1, 2016
By

(This article was first published on sweissblaug, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on their blog: sweissblaug. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL,...

Read more »

The adblockr package: block ads from the monetizr package

April 1, 2016
By

I was horrified to learn of the existence of the monetizr package, which adds advertisements to R functions. The package goes against the entire philosophy of open source and the spirit of the R community. Luckily, I was able to construct a fix- the a...

Read more »

WVPlots: example plots in R using ggplot2

April 1, 2016
By
WVPlots: example plots in R using ggplot2

Nina Zumel and I have been working on packaging our favorite graphing techniques in a more reusable way that emphasizes the analysis task at hand over the steps needed to produce a good visualization. The idea is: we sacrifice some of the flexibility and composability inherent to ggplot2 in R for a menu of prescribed … Continue reading...

Read more »

Two fun plots with R

April 1, 2016
By
Two fun plots with R

Data visualization with R doesn't always have to be serious. Here are a couple of fun charts created recently by R users. First, here's a minimalist rendition of the characters in The Simpsons, by an anonymous blogger: And from Alex Whan, here's a near-perfect recreation of the classic cover of the Joy Division album Unknown Pleasures, based on simulated...

Read more »

satRdays are go!

April 1, 2016
By
satRdays are go!

Steph Locke has already written about this last week, so just a quick follow-up on R-bloggers to share the great news with a wider audience: the R Consortium agreed to the support satRdays!A quick reminder on the project:satRdays are SQLSaturday-i...

Read more »

Version 2.1.1 Released

March 31, 2016
By
Version 2.1.1 Released

Hi Folks, ggtern v2.1.1 has just been submitted to CRAN. This includes a number of minor bug fixes, and an additional annotation, permitting the rendering of raster data, say for the inclusion of images. This post will demonstrate the use of this requested feature. Demonstration of Raster Annotation Several people have asked me how they The post

Read more »

RStudio addins manager

March 31, 2016
By
RStudio addins manager

RStudio addins let you execute a bit of R code or a Shiny app through the RStudio IDE, either via the Addins dropdown menu or with a keyboard shortcut. This package is an RStudio addin for managing other addins. To run these addins, you need the latest version of RStudio. Installation The package can be

Read more »

The monetizr package: make money on your open source R packages

March 31, 2016
By

I’ve had the great privilege to be a small part of the R open source community, contributing packages like broom, gganimate, fuzzyjoin, and ggfreehand. In the process I’ve become friends and colleagues with brilliant statisticians and data scientists and learned to engage with data in powerful ways. But there’s one thing that my colleagues and I...

Read more »

densratio: New R Package for Density Ratio Estimation

March 31, 2016
By
densratio: New R Package for Density Ratio Estimation

1. OverviewDensity ratio estimation is described as follows: for given two data samples $x$ and $y$ from unknown distributions $p(x)$ and $q(y)$ respectively, estimate $$ w(x) = frac{p(x)}{q(x)} $$ where $x$ and $y$ are $d$-dimensional real numbers.The estimated density ratio function $w(x)$ can be used in many applications such as the inlier-based outlier detection and...

Read more »

What’s new on CRAN: March 2016

March 31, 2016
By
What’s new on CRAN: March 2016

by Joseph Rickert Packages continue to flood into CRAN at a rate the challenges the sanity of anyone trying to keep up with what's new. So far this month, more than 190 packages have been added. Here is a my view of what's interesting in this March madness. The launch_tutorial() function from the RtutoR package by Anup Nair launches...

Read more »

Another Route to Jupyter Notebooks – Azure Machine Learning

March 31, 2016
By
Another Route to Jupyter Notebooks – Azure Machine Learning

In much the same way that the IBM DataScientist Workbench seeks to provide some level of integration between analysis tools such as Jupyter notebooks and data access and storage, Azure Machine Learning studio also provides a suite of tools for accessing and working with data in one location. Microsoft’s offering is new to me, but

Read more »

Updated R & BLAS Timings

March 30, 2016
By
Updated R & BLAS Timings

With the recent releases of R 3.2.4 and OpenBLAS 2.17, I decided it was time to re-benchmark R speed. I’ve settled on a particular set of tests, based on my experience as well as some of Simon Urbanek’s work which I separated into two groups: those focusing on BLAS-heavy operations and those which do not. Read the full...

Read more »

Real-time model scoring for streaming data – a prototype based on Oracle Stream Explorer and Oracle R Enterprise

March 30, 2016
By
Real-time model scoring for streaming data – a prototype based on Oracle Stream Explorer and Oracle R Enterprise

Whether applied to manufacturing, financial services, energy, transportation, retail, government, security or other domains, real-time analytics is an umbrella term which covers a broad spectrum of capabilities (data integration, analytics, business intelligence) built on streaming input from multiple channels. Examples of such channels are: sensor data, log data, market data, click streams, social media and monitoring imagery. Key metrics...

Read more »

pacman Ver 0.4.1 Release

March 30, 2016
By
pacman Ver 0.4.1 Release

It was just over a year ago that Dason Kurkiewicz and I released pacman to CRAN.  We have been developing the package on GitHub in the past 14 months and are pleased to announce these changes have made their way … Continue reading →

Read more »

Unit testing with R

March 30, 2016
By

I've been introduced to unit testing while working with colleagues on quite a big project for which we use Python. At first I was a bit skeptical about the need of writing unit tests, but now I must admit that I am seduced by the idea and by the huge time savings it allows. Naturally, I was wondering...

Read more »

List of useful RStudio addins made by useRs

March 30, 2016
By

RStudio’s new feature of supporting addins is great. It’s about 2 months old now, and I think one of the main problems right now is discoverability: there’s just no easy way to find addins or publish your own. As a first attempt at making addins more discoverable, I started a GitHub repo listing some addins, with the hope that other...

Read more »

Jupyter Notebooks with R in Azure Machine Learning Studio

March 30, 2016
By
Jupyter Notebooks with R in Azure Machine Learning Studio

by Andrie de Vries Earlier today Microsoft announced that Jupyter Notebooks are now available with the R Kernel as a service in Azure Machine Learning (ML) Studio. I wrote about Jupyter Notebooks in September 2015 (Using R with Jupyter Notebooks), where I noted some of the great benefits of using notebooks: Jupyter is an easy to use and convenient...

Read more »

Introducing a Weekly R / Python / JS / etc Vis Challenge!

March 30, 2016
By
Introducing a Weekly R / Python / JS / etc Vis Challenge!

Per a suggestion, I’m going to try to find a neat data set (prbly one from @jsvine) to feature each week and toss up some sample code (99% of the time prbly in R) and offer up a vis challenge. Just reply in the comments with a link to a gist/repo/rpub/blog/etc (or post directly, though

Read more »

An awesome RStudio addin for selecting colours, and another for adding marginal density plots to ggplot2

March 30, 2016
By
An awesome RStudio addin for selecting colours, and another for adding marginal density plots to ggplot2

TL;DR: There’s a colour picker addin in shinyjs and a ggplot2 marginal plots addin in ggExtra. Any R user who hasn’t been spending the past 2 months under a rock should know by now about RStudio’s new exciting features: addins and gadgets. (In case you don’t know, here’s a summary: Gadgets are simply Shiny apps that return a value,...

Read more »

How to check Likert scale summaries for plausibility

March 30, 2016
By

Suppose you are reading a paper that uses Likert scale responses. The paper reports the mean, standard deviation, and number of responses. If we are -- for some reason -- suspicious of a paper, we might ask, "Are these summary statistics possible for t...

Read more »

european R users meeting (eRum) 2016 – Call for Workshops deadline (by 15th of April)

March 30, 2016
By
european R users meeting (eRum) 2016 – Call for Workshops deadline (by  15th of April)

deadline: 15th of April   submission form and details: https://docs.google.com/forms/d/1ZZwSPUB4-31nUYOhjdhArOF1kjdXrQW_XPWpfd0nqWU/viewform Info about the conference: eRum 2016 will take place in the beautiful city of Poznań, Poland, betweenOctober 12th and 14th, and we already have confirmed invited speakers such asRasmus Bååth, Romain Francois,  Ulrike Grömping, Matthias Templ, andHeather Turner, as well as high-level local R users. European R users meeting (eRum)...

Read more »

GTC 2016

March 29, 2016
By
GTC 2016

I will be an invited speaker at GTC 2016, a large conference on GPU computation. The main topic will be usage of GPU in conjunction with R, and I will also speak on my Software Alchemy method, especially in relation to GPU computing.. GTC asked me to notify my “network” about the event, and this … Continue reading...

Read more »

Election tRends: An interactive US election tracker (using Shiny and Plotly)

March 29, 2016
By

Guest post by Jonathan Sidi Introduction The US primaries are coming on fast with almost 120 days left until the conventions. After building a shinyapp for the Israeli Elections I decided to update features in the app tried out plotly in the shiny framework. As a casual voter trying to gauge the true temperature of … Continue reading...

Read more »

Feather: A Fast On-Disk Format for Data Frames for R and Python, powered by Apache Arrow

March 29, 2016
By
Feather: A Fast On-Disk Format for Data Frames for R and Python, powered by Apache Arrow

Wes McKinney, Software Engineer, Cloudera Hadley Wickham, Chief Scientist, RStudio This past January, we (Hadley and Wes) met and discussed some of the systems challenges facing the Python and R open source communities. In particular, we wanted to see if there were some opportunities to collaborate on tools for improving interoperability between Python, R, and

Read more »

Sponsors