March 2016

Confidence Intervals for Random Forests

March 3, 2016 | Joseph Rickert

by Joseph Rickert Random Forests, the "go to" classifier for many data scientists, is a fairly complex algorithm with many moving parts that introduces randomness at different levels. Understanding exactly how the algorithm operates requires some work, and assessing how good a Random Forests model fits the data is a ... [Read more...]

Where People Live

March 3, 2016 | arthur charpentier

There was an interesting map on reddit this morning, with a visualisation of latitude and longituge of where people live, on Earth. So I tried to reproduce it. To compute the density, I used a kernel based approch __ library(maps) __ data("world.cities") __ X=world.cities[,c("lat","pop")] __ liss=...
[Read more...]

Declutter a shiny report’s code v2.0

March 3, 2016 | Steph

I wrote a year ago on a way to declutter shiny report code which involved putting objects into a sourced file, however, at that point in time the solution was a bit brittle and clunky. Now there’s a better way to develop shiny applications – shiny modules. In October, RStudio ... [Read more...]

How do I re-arrange??: Ordering a plot re-revisited

March 2, 2016 | tylerrinker

Several years back I wrote a two part blog series in response to seeing questions about plotting and reordering on list serves, talkstats.com, and stackoverflow.  Part I discussed the basics of reordering plots by reordering factor levels.  The essential gist was: … Continue reading →
[Read more...]

Semi-finished

March 2, 2016 | R on Gianluca Baio

I’ve finally managed to have a reasonably functional release for SWSamp, my package for simulation-based sample size calculations, specifically (but not necessarily just!) for a Stepped Wedge design trial. There are still a few details that we need to polish and more importantly we need to work on the ...
[Read more...]

New R Package: rmote

March 2, 2016 | ryan hafen

I often find myself in the situation where I need to log into a remote machine to do analysis in R but do not have a satisfactory way to view output of graphics being generated in the remote session. There are many possible ways to work graphically wi... [Read more...]

Introducing Package ‘fuzztest’

March 2, 2016 | cloudcell

This is a tool for code fault analysis. I built it to automate the most boring part of my debugging process. The package automates test setup and logging and visualizes function exit states in a way that simplifies identification of root causes of software defects. Fuzzing is implemented by random ...
[Read more...]

Win-Vector video courses: price/status changes

March 2, 2016 | John Mount

Win-Vector LLC has been offering a couple of online video courses on the topics of data science and A/B testing (both using R). These are high quality courses and well worth the money and time needed to work through them closely (with all materials distributed on GitHub). Our current ... [Read more...]

How to Use R to Scrape Tweets: Super Tuesday 2016

March 2, 2016 | Kris Eberwein

Super Tuesday 2016 has come and gone, we have most of the election results, but what were the American public saying on Twitter? The twitteR package for R allows you to scrape tweets from Twitter’s API and use them to form sentiment analysis. The Plotly chart below shows what the ... [Read more...]

Data Science Virtual Machine updated with Microsoft R Server

March 2, 2016 | David Smith

Microsoft has updated the Data Science Virtual Machine, a data science toolkit-in-a-box that you can easily spin up on the Microsoft Azure cloud service. The virtual machine now comes pre-configured with Microsoft R Server Developer Edition (upgraded from Microsoft R Open), Anaconda Python, Jupyter notebooks for Python and R, Visual ... [Read more...]

mefa4 R package update

March 2, 2016 | Peter Solymos

The mefa4 R package is aimed at efficient manipulation of very big data sets leveraging sparse matrices thanks to the Matrix package. The recent update (version 0.3-3) of the package includes a bugfix and few new functions to compare sets and finding dominant features in compositional data as described in ... [Read more...]

R projects at Google Summer of Code 2016

March 1, 2016 | Gergely Daróczi

This is the 12th year Google offering grants to students living (almost) anywhere in the world for working on various open-source projects at the summer, and R was again selected as a GSoC mentor organization, just like in eg 2008, 2009, 2010, 2011, 2012, 2013, 2014 or 2015.In short, Google offers $5,000 to the accepted students to work ... [Read more...]

Innumeracy, Statistics and R

March 1, 2016 | matloff

A couple of years ago, when an NPR journalist was interviewing me, the conversation turned to quantitative matters. The reporter said, only half jokingly, “We journalists are innumerate and proud.” :-) Some times it shows, badly. This morning a radio reporter stated, “Hillary Clinton beat Bernie Sanders among South Carolina ... [Read more...]

RcppEigen 0.3.2.8.1

March 1, 2016 | Thinking inside the box

A good week after the update to Eigen 3.2.8 in our RcppEigen package 0.3.2.8.0, we have another (local) update with RcppEigen 0.3.2.8.1 which should fix another UBSAN issue. This new version in now on CRAN and in Debian. Big thanks once again to Yixu... [Read more...]
1 12 13 14 15

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)