SatRday and visual inference of vine copulas

February 19, 2017
By
SatRday and visual inference of vine copulas

SatRday From the 16th to the 18th of February, satRday was held in the City of Cape Town in South Africa. The programme kicked off with two days of workshops and then the conference on Saturday. The workshops were divided up into three large sections:...

Read more »

Building Shiny App exercises part 7

February 19, 2017
By
Building Shiny App exercises part 7

Connect widgets & plots In the seventh part of our journey we are ready to connect more of the widgets we created before with our k-means plot in order...

Read more »

Factoextra R Package: Easy Multivariate Data Analyses and Elegant Visualization

February 19, 2017
By
Factoextra R Package: Easy Multivariate Data Analyses and Elegant Visualization

factoextra is an R package making easy to extract and visualize the output...

Read more »

Predicting food preferences with sparklyr (machine learning)

February 18, 2017
By
Predicting food preferences with sparklyr (machine learning)

This week I want to show how to run machine learning applications on a Spark cluster. I am using the sparklyr package, which provides a handy interface to access...

Read more »

Bar bar plots but not Babar plots

February 18, 2017
By
Bar bar plots but not Babar plots

You might have heard of the “bar bar plots” movement whose goal is to prevent using (let’s use ggplot2 language shall we) geom_bar when you could have used e.g....

Read more »

Using Rcpp with C++11, C++14 and C++17

February 18, 2017
By
Using Rcpp with C++11, C++14 and C++17

Background When we started the Rcpp Gallery in late 2012, a few of us spent the next four weeks diligently writing articles ensuring that at least one new article would be...

Read more »

Analytical and Numerical Solutions to Linear Regression Problems

February 18, 2017
By
Analytical and Numerical Solutions to Linear Regression Problems

This exercise focuses on linear regression with both analytical (normal equation) and numerical (gradient descent) methods. We will start with linear regression with one variable. From this part of...

Read more »

Putting It All Together

February 18, 2017
By
Putting It All Together

The kind folks over at @RStudio gave a nod to my recently CRAN-released epidata package in their January data package roundup and I thought it might be useful to...

Read more »

cricketr and yorkr books – Paperback now in Amazon

February 18, 2017
By
cricketr and yorkr books – Paperback now in Amazon

My books – Cricket Analytics with cricketr – Beaten by sheer pace!: Cricket analytics with yorkr are now available on Amazon in both Paperback and Kindle versions The cricketr...

Read more »

Accessing MSSQL Server with R (RSQLServer with dplyr)

February 18, 2017
By
Accessing MSSQL Server with R (RSQLServer with dplyr)

Recently I have been starting to use dplyr for handling my data in R. It makes everything a lot smoother! My previous workflow – running an SQL query, storing...

Read more »

RPushbullet 0.3.1

February 17, 2017
By
RPushbullet 0.3.1

A new release 0.3.1 of the RPushbullet package, following the recent 0.3.0 release is now on CRAN. RPushbullet is interfacing the neat Pushbullet service for inter-device messaging, communication,...

Read more »

Using Armadillo with SuperLU

February 17, 2017
By
Using Armadillo with SuperLU

Armadillo is very versatile C++ library for linear algebra, brough to R via the RcppArmadillo package. It has proven to be very useful and popular, and is (as of February...

Read more »

A plot against the CatterPlots complot

February 17, 2017
By
A plot against the CatterPlots complot

In these terrible times, we R people have more important subjects to debate/care about than ggplot2 vs. base R graphics (isn’t even worth discussing anyway, ggplot2 is clearly the...

Read more »

Optimization and Operations Research in R

February 17, 2017
By

Authors: Stefan Feuerriegel and Joscha Märkle-Huß R is widely taught in business courses and, hence, known by most data scientists with business background. However, when it comes to optimization...

Read more »

Catterplots: Plots with cats

February 17, 2017
By
Catterplots: Plots with cats

As a devotee of Tufte, I'm generally against chartjunk. Graphical elements that obscure interpretation of the data occasionally have a useful role to play, but more often than not...

Read more »

January New Data Packages

February 17, 2017
By
January New Data Packages

by Joseph Rickert As forecast, the number of R packages hosted on CRAN exceeded 10,000 in January. Dirk Eddelbuettel, who tracks what’s happening on CRAN with his CRANberries site,...

Read more »

Naive Bayes Classification in R (Part 2)

February 17, 2017
By
Naive Bayes Classification in R (Part 2)

Following on from Part 1 of this two-part post, I would now like to explain how the Naive Bayes classifier works before applying it to a classification problem involving...

Read more »

Naive Bayes Classification in R (Part 1)

February 17, 2017
By
Naive Bayes Classification in R (Part 1)

Introduction A very useful machine learning method which, for its simplicity, is incredibly successful in many real world applications is the Naive Bayes classifier. I am currently taking a...

Read more »

How to perform PCA with R?

February 17, 2017
By
How to perform PCA with R?

This post shows how to perform PCA with R and the package FactoMineR. If you want to learn more on methods such as PCA, you can enroll in this...

Read more »

Moving largish data from R to H2O – spam detection with Enron emails

February 17, 2017
By
Moving largish data from R to H2O – spam detection with Enron emails

Moving around sparse matrices of text data - the limitations of as.h2o This post is the resolution of a challenge I first wrote about in late 2016, moving large sparse...

Read more »

Exploratory Multivariate Data Analysis with R- enroll now in the MOOC

February 17, 2017
By
Exploratory Multivariate Data Analysis with R- enroll now in the MOOC

Exploratory multivariate data analysis is studied and has been taught in a “French-way” for a long time in France. You can enroll in a MOOC (completely free) on Exploratory Multivariate Data...

Read more »

The Steep Slide of NFL Draft Salaries

February 16, 2017
By
The Steep Slide of NFL Draft Salaries

Some friends and I got into a conversation about rookies in the NFL and how much their salaries were. We eventually started guessing how much more the first overall...

Read more »

littler 0.3.2

February 16, 2017
By
littler 0.3.2

The third release of littler as a CRAN package is now available, following in the now more than ten-year history as a...

Read more »

A knapsack riddle [#2]?

February 16, 2017
By
A knapsack riddle [#2]?

Still about this allocation riddle of the past week, and still with my confusion about the phrasing of the puzzle, when looking at a probabilistic interpretation of the game,...

Read more »

Factor Analysis with the Principal Component Method Part Two

February 16, 2017
By

In the first post on factor analysis, we examined computing the estimated covariance matrix of the rootstock data and proceeded to find two factors that fit most of the...

Read more »

Should I Learn R or Python? … It Doesn’t Matter

February 16, 2017
By

Should I learn R or Python for data science? I am asked this question regularly, both online and in person. There is a simple answer: it doesn’t matter. There...

Read more »

Chop It: Look up the Generating Data Frame Columns of a Formula Term

February 16, 2017
By
Chop It: Look up the Generating Data Frame Columns of a Formula Term

We the moody Gucci, Louis and Pucci men Escada, Prada The chopper it got the Uzi lens Bird’s-eye view The birds I knew, flip birds Bird gangs, it was...

Read more »

Interactive plots in PCA with Factoshiny

February 16, 2017
By
Interactive plots in PCA with Factoshiny

A beautiful graph tells more than a lenghtly speach!! So it is crucial to improve the graphs obtained by Principal Component Analysis or (Multiple) Correspondence Analysis. The package Factoshiny allows...

Read more »

Six Articles on using R with SQL Server

February 16, 2017
By

Tomaž Kaštrun is developer and data analyst working for the IT group at SPAR (the ubiquitous European chain of convenience stores) in Austria. He blogs regularly about using Microsoft...

Read more »

Sponsors

Mango solutions







Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de







CRC R books series







Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.