Parametric variable names and dplyr

December 3, 2016
By
Parametric variable names and dplyr

When writing reusable code or packages you often do not know the names of the columns or variables you need to work over. This is what I call “parametric treatment of variables.” This can be a problem when using R libraries that assume you know the variable names. The R data manipulation library dplyr currently … Continue...

Read more »

Highlighting R code for the web

December 3, 2016
By

When blogging about statistics and R, it is very useful to differentiate the body text to R code. I used to manage this issue by highlighting the code and...

Read more »

How important is that variable?

December 3, 2016
By

When modeling any phenomena by including explanatory variables that highly relates the variable of interest, one question arises: which of the auxiliary variables have a higher influence on the...

Read more »

R Course Finder December update!

December 3, 2016
By
R Course Finder December update!

A few months ago we launched R course finder, an online directory that helps you to find the right R course quickly. With so many R courses available online,...

Read more »

k-fold cross validation with modelr and broom

December 3, 2016
By
k-fold cross validation with modelr and broom

@drsimonj here to discuss how to conduct k-fold cross validation, with an emphasis on evaluating models supported by David Robinson’s broom package. Full credit also goes...

Read more »

Non-linear dynamics and chaos using R (1)

December 3, 2016
By
Non-linear dynamics and chaos using R (1)

I have been teaching the course "Mathematics" for the PhD students at Ca' Foscari University for a few years. In the lectures I cover some "scattered" material that may...

Read more »

Be careful evaluating model predictions

December 2, 2016
By
Be careful evaluating model predictions

One thing I teach is: when evaluating the performance of regression models you should not use correlation as your score. This is because correlation tells you if a re-scaling...

Read more »

Forecast double seasonal time series with multiple linear regression in R

December 2, 2016
By
Forecast double seasonal time series with multiple linear regression in R

I will continue in describing forecast methods, which are suitable to seasonal (or multi-seasonal) time series. In the previous post smart meter data of electricity consumption were introduced...

Read more »

Stylometry: Identifying authors of texts using R

December 2, 2016
By
Stylometry: Identifying authors of texts using R

Few people expect politicians to write every word they utter themselves; reliance on speechwriters and spokepersons is a long-established political practice. Still, it's interesting to know which statements are...

Read more »

Understanding SQL and R Training Video

December 2, 2016
By
Understanding SQL and R Training Video

Just had a video series released by O'Reilly Media on infiniteskills.com:https://www.infiniteskills.com/training/understanding-sql-and-r.htmlThis course shows you how to combine two of the most popular technologies...

Read more »

Writing Good R Code and Writing Well

December 2, 2016
By

by Joseph Rickert If you are aspiring to write good R code, you may find it helpful to occasionally spend some time reading about writing: reading about writing R...

Read more »

Model Evaluation Exercises 1

December 2, 2016
By
Model Evaluation Exercises 1

We are committed to bringing you 100% authentic exercise sets. We even try to include as different datasets as possible to give you an understanding of different problems. No...

Read more »

Announcing bookdown: Authoring Books and Technical Documents with R Markdown

December 2, 2016
By
Announcing bookdown: Authoring Books and Technical Documents with R Markdown

We have released the R package bookdown (v0.3) to CRAN. It may be old news to some users, but we are happy to make an official announcement today. To install the package...

Read more »

D3PartitionR 0.3.0 is available on CRAN

December 2, 2016
By
D3PartitionR 0.3.0 is available on CRAN

Yesterday, the new version of D3partitionR was uploaded on CRAN, this version contains some major improvements. 1. General layout: Before this update, the height and width of the widget...

Read more »

High Performance CommonMark and Github Markdown Rendering in R

December 2, 2016
By
High Performance CommonMark and Github Markdown Rendering in R

This week the folks at Github have open sourced their fork of libcmark (based on the extensive PR by Mathieu Duponchelle), which they use to render markdown...

Read more »

Relational operators for intervals with the intrval R package

December 2, 2016
By
Relational operators for intervals with the intrval R package

I recently posted a piece about how to write and document special functions in R. I meant that as a prelude for the topic I am writing about in...

Read more »

Introduction of the pdfsearch package

December 2, 2016
By

I'm happy to introduce an add-on package, pdfsearch, that adds the ability to do keyword searches on pdf files. This add-on package uses the excellent pdftools package from the...

Read more »

Extracting Tables from PDFs in R using the Tabulizer Package

December 1, 2016
By
Extracting Tables from PDFs in R using the Tabulizer Package

Recently I wanted to extract a table from a pdf file so that I could work with the table in R. Specifically, I wanted to get data on layoffs...

Read more »

Replicating NYT Weather App

December 1, 2016
By
Replicating NYT Weather App

So much time since my last post so I want to post something, no matter what it is, but I hope this will...

Read more »

Extreme Gradient Boosting and Preprocessing in Machine Learning – Addendum to predicting flu outcome with R

December 1, 2016
By
Extreme Gradient Boosting and Preprocessing in Machine Learning – Addendum to predicting flu outcome with R

In last week’s post I explored whether machine learning models can be applied to predict flu deaths from the 2013 outbreak of influenza A H7N9 in China. There, I...

Read more »

ratio-of-uniforms [#4]

December 1, 2016
By
ratio-of-uniforms [#4]

Possibly the last post on random number generation by Kinderman and Monahan’s (1977) ratio-of-uniform method. After fiddling with the Gamma(a,1) distribution when a<1 for a while, I indeed figured...

Read more »

RStudio in the cloud with Amazon Lightsail and docker

December 1, 2016
By
RStudio in the cloud with Amazon Lightsail and docker

About two years ago we published a quick and easy guide to setting up your own RStudio server in the cloud using the Docker service and Digital Ocean. ...

Read more »

An Analysis of Contributions to PubMed Commons

December 1, 2016
By
An Analysis of Contributions to PubMed Commons

I recently saw a tweet floating by which included a link to some recent statistics from PubMed Commons, the NCBI service for commenting on scientific articles in PubMed. Perhaps...

Read more »

Efficiently Saving and Sharing Data in R

December 1, 2016
By
Efficiently Saving and Sharing Data in R

After spending a day the other week struggling to make sense of a federal data set shared in an archaic format (ASCII fixed format dat file). It is essential...

Read more »

Using R to Gain Insights into the Emotional Journeys in War and Peace

December 1, 2016
By
Using R to Gain Insights into the Emotional Journeys in War and Peace

by Wee Hyong Tok, Senior Data Scientist Manager at Microsoft How do you read a novel in record time, and gain insights into the emotional journey of main characters,...

Read more »

Scripting Loops exercises Vol.2

Scripting Loops exercises Vol.2

Answers to the exercises are available here. Exercise 1 Using the following...

Read more »

Analysis of software developers in New York, San Francisco, London and Bangalore

December 1, 2016
By
Analysis of software developers in New York, San Francisco, London and Bangalore

(Note: Cross-posted with the Stack Overflow Blog.) When I tell someone Stack Overflow is based in New York City, they’re often surprised: many people assume it’s in San Francisco....

Read more »

New Course: Joining Data in R with dplyr

December 1, 2016
By
New Course: Joining Data in R with dplyr

We just launched Joining Data in R with dplyr taught by Garrett Grolemund, the author of Hands-On Programming with R and R for Data Science from O'Reilly Media. This course builds on what you...

Read more »

How to send bulk email to your students using R

December 1, 2016
By
How to send bulk email to your students using R

In this post I write about a technique I use to send personalized bulk emails to my students at the open and distance education university where I teach (the...

Read more »

Sponsors

Mango solutions



dominolab webpage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de







ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.