Understanding data.table Rolling Joins

June 5, 2016
By

Understanding data.table Rolling JoinsRobert NorbergJune 5, 2016IntroductionRolling joins in data.table are incredibly useful, but not that well documented. I wrote this to help myself figure out how to use them and perhaps it can help you too.library(data.table)The SetupImagine we have an eCommerce website that uses a third party (like PayPal) to handle payments. We track user sessions on our website and...

Read more »

Exploring Quantum Gate operations with QCSimulator

June 5, 2016
By
Exploring Quantum Gate operations with QCSimulator

Introduction: Ever since I was initiated into Quantum Computing, through IBM’s Quantum Experience I have been hooked. My initial encounter with domain made me very excited and all wound up. The reason behind this, I think, is because there is an air of mystery around ‘Quantum’ anything.  After my early rush with the Quantum Experience,

Read more »

Bootstrap and cross-validation for evaluating modelling strategies

Bootstrap and cross-validation for evaluating modelling strategies

Modelling strategies I’ve been re-reading Frank Harrell’s Regression Modelling Strategies, a must read for anyone who ever fits a regression model, although be prepared - depending on your background, you might get 30 pages in and suddenly become convinced you’ve been doing nearly everything wrong before, which can be disturbing. I wanted to evaluate three simple modelling strategies in dealing...

Read more »

Curated list of R tutorials for Data Science

June 3, 2016
By
Curated list of R tutorials for Data Science

Here is topic wise list of R tutorials for Data Science, Time Series Analysis, Natural Language Processing and Machine Learning. This list also serves as a reference guide for several common data analysis tasks. The R Language Awesome-R Repository on GitHub R Reference Card: Cheatsheet R bloggers: blog aggregator R Resources on GitHub Awesome R

Read more »

Using geom_step

June 3, 2016
By
Using geom_step

geom_step is an interesting geom supplied by the R package ggplot2. It is an appropriate rendering option for financial market data and we will show how and why to use it in this article. Let’s take a simple example of plotting market data. In this case we are plotting the "ask price" (the publicly published … Continue reading...

Read more »

Visualizing a flood with R

June 3, 2016
By
Visualizing a flood with R

As more settlements in Texas and France are impacted by severe flooding, this is a good time to thank the hydrologists at the NOAA who forecast river level rises in advance and give residents in affected areas time to move to higher ground. Along with topgraphic, rainfall, and weather data, monitoring stations maintained by NOAA and the USGS along...

Read more »

RQGIS – integrating R with QGIS

June 3, 2016
By
RQGIS – integrating R with QGIS

This is the excerpt for your very first post.

Read more »

Images as x-axis labels (updated)

June 2, 2016
By
Images as x-axis labels (updated)

They say "if you want to find an answer on the internet, just present a wrong one as fact. Then wait." It didn't take long, actually. Despite my searches while trying to get images into x-axis labels it seems I...Continue Reading →

Read more »

Using caret to compare models

June 2, 2016
By
Using caret to compare models

by Joseph Rickert The model table on the caret package website lists more that 200 variations of predictive analytics models that are available withing the caret framework. All of these models may be prepared, tuned, fit and evaluated with a common set of caret functions. All on its own, the table is an impressive testament to the utility and...

Read more »

methylKit v0.9.6

June 2, 2016
By

We released a new version of methylKit, which is a package for DNA methylation analysis with bisulfite-seq data. This version comes with many changes summarized below. you can also have a look at the release notes. The vignette is now converted to...

Read more »

Remote Pair Programming in R

June 2, 2016
By
Remote Pair Programming in R

Recently I’ve been doing a lot of remote pair programming with clients. A few people have asked how this works. Rather than try to explain it, I recorded one of my sessions with a client last week, and you can watch it below: The actual session was about an hour long, though I edited it The post

Read more »

Images as x-axis labels

June 2, 2016
By
Images as x-axis labels

Open-source software is awesome. If I found that a piece of closed-source software was missing a feature that I wanted, well, bad luck. I probably couldn't even tell if was actually missing or if I just didn't know about it....Continue Reading →

Read more »

R for Publication by Page Piccinini: Lesson 2 – Linear Regression

June 2, 2016
By
R for Publication by Page Piccinini: Lesson 2 – Linear Regression

This is our first lesson where we actually learn and use a new statistic in R. For today’s lesson we’ll be focusing on linear regression. I’ll be taking for granted some of the set-up steps from Lesson 1, so if you haven’t done that yet be sure to go back and do it. By the Lesson 2: Linear...

Read more »

A demonstration of vtreat data preparation

June 1, 2016
By
A demonstration of vtreat data preparation

This article is a demonstration the use of the R vtreat variable preparation package followed by caret controlled training. In previous writings we have gone to great lengths to document, explain and motivate vtreat. That necessarily gets long and unnecessarily feels complicated. In this example we are going to show what building a predictive model … Continue reading...

Read more »

Le Monde puzzle [#964]

June 1, 2016
By
Le Monde puzzle [#964]

A not so enticing Le Monde mathematical puzzle: Find the minimal value of a five digit number divided by the sum of its digits. This can formalised as finding the minimum of N/(a+b+c+d+e) when N writes abcde. And solved by brute force. Using a rough approach to finding the digits of a five-digit number, the

Read more »

Reference semantics in R

June 1, 2016
By

Question I recently got a mail from Václav on reference semantics in data.tree, reading as follows: Dear Christoph, I am rather inexperienced when it comes to environments in R and henceforth I apologize if my question is basic; however, my colleagues are no better than me to answer my question. I would have a question iro The post

Read more »

Covcalc: Shiny App for Calculating Coverage Depth or Read Counts for Sequencing Experiments

June 1, 2016
By
Covcalc: Shiny App for Calculating Coverage Depth or Read Counts for Sequencing Experiments

How many reads do I need? What's my sequencing depth? These are common questions I get all the time. Calculating how much sequence data you need to hit a target depth of coverage, or the inverse, what's the coverage depth given a set amount of sequenci...

Read more »

Trisurf Plots in R using Plotly

June 1, 2016
By

In this post we’ll show how to create Triangular Surface Plots in R. This post is based on timelyportfolio’s gist. Moebius Strip 2D Surface over a disk Chopper from python

Read more »

Scripting Loops In R

June 1, 2016
By
Scripting Loops In R

An R programmer can determine the order of processing of commands, via use of the control statements; repeat{}, while(), for(), break, and next Answers to the exercises are available here. Exercise 1 The repeat{} loop processes a block of code until the condition specified by the break statement, (that is mandatory within the repeat{} loop),

Read more »

EARL London 2016 – Speakers Announced

June 1, 2016
By
EARL London 2016 – Speakers Announced

LONDON  13th – 15th SEPTEMBER 2016 EARL is an exciting cross-sector Conference dedicated to the real business usage of R.   One day of Workshops and two days devoted to the most innovative R implementations by the world’s leading practitioners. … Continue reading →

Read more »

MilanoR meeting | Call for presentations

June 1, 2016
By
MilanoR meeting | Call for presentations

A MilanoR meeting is an occasion to bring together R users from the Milano area to share R tips and experience: the next one will be Thursday, October 27th. We are looking for volunteers to present at the next meeting: if you feel you have something to input or you can recommend someone, please contact us! The post

Read more »

NLP on NPR’s Commencement Addresses

June 1, 2016
By
NLP on NPR’s Commencement Addresses

Vocativ did an interesting analysis of the President’s State of the Union (SOTU) speeches. They showed that across the past couple hundred years and many Presidents, SOTU speeches have been targeted at audiences with lower and lower education levels. Vocativ’s in-print interpretation of the downward sloping trend was that a speeches have gotten less sophisticated. Their recommended share-tweet for the article...

Read more »

Recent presentations

June 1, 2016
By
Recent presentations

The last month or so has been a whirlwind of awesomeness with a veritable bevvy of user group and conference talks on my part! I thought I would share the materials with you and provide some brief thoughts on how each presentation went. Sessions SQL Saturday Exeter : Stats 101 London Business Analytics (LBAG) : The post

Read more »

Scientific RMarkdown

May 31, 2016
By

Recently, in my own little scientific community bubble there was increasing interest in markdown and its use for science. As a big fan of markdown and espacially rmarkdown, I created the following cheat sheet and shared it at a couple of events. Sinc...

Read more »

heatmaply: interactive heat maps (with R)

May 31, 2016
By

I am pleased to announce heatmaply, my new R package for generating interactive heat maps, based on the plotly R package. tl;dr By running the following 3 lines of code: install.packages("heatmaply") library(heatmaply) heatmaply(mtcars, k_col = 2, k_row = 3) %>% layout(margin = list(l = 130, b = 40)) You will get this output in your browser … Continue reading...

Read more »

Happy New Year, Mr. President. Data and Sentiment Analysis of Presidential New Year Speeches

May 31, 2016
By
Happy New Year, Mr. President. Data and Sentiment Analysis of Presidential New Year Speeches

Salvino A. Salvaggio At a moment where many are preparing for the December 31st evening cocktail, the End of Year speech of the President of the Italian Republic is broadcast right on time at 8:30pm. A tradition which came to be with the constitutional establishment...

Read more »

Principal Components Regression in R: Part 3

May 31, 2016
By
Principal Components Regression in R: Part 3

by John Mount Ph. D. Data Scientist at Win-Vector LLC In her series on principal components analysis for regression in R, Win-Vector LLC's Dr. Nina Zumel broke the demonstration down into the following pieces: Part 1: the proper preparation of data and use of principal components analysis (particularly for supervised learning or regression). Part 2: the introduction of y-aware...

Read more »

Predictive Bookmaker Consensus Model for the UEFA Euro 2016

May 31, 2016
By

(By Achim Zeileis) From 10 June to 10 July 2016 the best European football teams will meet in France to determine the European Champion in the UEFA European Championship 2016 tournament. For the first time 24 teams compete, expanding the format from 16 teams as in the previous five Euro tournaments. For forecasting the winning probability of each team...

Read more »

Understanding beta binomial regression (using baseball statistics)

May 31, 2016
By
Understanding beta binomial regression (using baseball statistics)

Previously in this series: Understanding the beta distribution Understanding empirical Bayes estimation Understanding credible intervals Understanding the Bayesian approach to false discovery rates Understanding Bayesian A/B testing In this series we’ve been using the empirical Bayes method to estimate batting averages of baseball players. Empirical Bayes is useful here because when we...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.