## Edinbr: Text Mining with R

February 23, 2018
By

During a very quick tour of Edinburgh (and in particular some distilleries), Dave Robinson (Tidytext fame), was able to drop by the Edinburgh R meet-up group to give a very neat talk on tidy text. The first part of the talk set the scene What does does text mean? Why make text tidy? What sort of problems can you solve. This was a...

## How to set up a sparklyr cluster in 5 minutes

February 23, 2018
By

If you've ever wanted to play around with big data sets in a Spark cluster from R with the sparklyr package, but haven't gotten started because setting up a...

## Is R base::subset() really that bad?

February 23, 2018
By

Is R base::subset() really that bad? Notes discussing subset() often refer to the following text (from help(subset), referred to in examples: 1, 2): Warning This is a convenience function...

## Do tweeps with more followers follow tweeps with more followers? by @ellis2013nz

February 23, 2018
By

Branko Milanovic asked on Twitter: Idea for a paper: “Homogamy” on Twitter. Do people with more followers follow people with more followers? I don’t have time to write...

## Xie Yihui, R Superstar and Mensch

February 23, 2018
By

Yesterday a friend told me, “Yihui has written the most remarkably open blog post, and you’ve got to read it.” I did and it was. Though my post here...

## R’s S3 generic-function object-oriented system

February 22, 2018
By

Just like your mom's chippity chipper recipe - In Data Science, there are numerous instances where different techniques call for the use of different tools....

## An introduction to joint modeling in R

February 22, 2018
By

By J Espasandin, O Lado, A Bouzas, A Baluja. You can also check this post, written in #blogdown, here: intro-joint-modeling-r. These days, between the 19th and 21st of February,...

## Analyzing accelerometer data with R

February 22, 2018
By

Using your smartphone (any modern phone with a built-in accelerometer should work), visit the Cast Your Spell page created by Nick Strayer. (If you need to type it to...

## R Tip: Force Named Arguments

February 22, 2018
By

R tip: force the use of named arguments when designing function signatures. R’s named function argument binding is a great aid in writing correct programs. It is a good...

## Whys and Hows of Apply Family of Functions in R

February 22, 2018
By

Introduction to Looping system Imagine you were to perform a simple task, let’s say calculating sum of columns for 3X3 matrix, what do you think is the best way?...

February 22, 2018
By

R-Ladies Remote is kicking off and we want YOU! Do you want to be part of the R community but can’t attend meetups? There are many R-Ladies across the...

February 21, 2018
By

About a year and a half back I was working in Python a bit and became accustomed to the explicit importing of modules (akin to R packages) and functions....

## Jan 2018: “Top 40” New Package Picks

February 21, 2018
By

Here are my “Top 40” picks from the two hundred or so new packages that stuck to CRAN in January, listed under seven categories: Data, Data Science, Science, Statistics,...

## Combine your hex stickers with magic(k)

February 21, 2018
By

Hex stickers remind me of Pogs, except they’re cooler because you can combine them together! Some people do that very smartly. Now when I forget how to do an #rstats...

## Topological Tomfoolery in R: Plotting a Möbius Strip

February 21, 2018
By
$Topological Tomfoolery in R: Plotting a Möbius Strip$

Topology is the silly putty of mathematics. One of the strangest shapes in topology is the Möbius strip. This strip is a surface with only one side. This article...

## Update: “Difficult to Reproduce Choroplethr Bug” Isolated and Fixed

February 21, 2018
By

Yesterday I wrote that I’ve received a number of choroplethr bug reports recently that I simply cannot reproduce. Due to a large number of people... The post Update: “Difficult to...

## Machine Learning in R with TensorFlow

February 21, 2018
By

Modern machine learning platforms like Tensorflow have to date been used mainly by the computer science crowd, for applications like computer vision and language understanding. But as JJ Allaire...

## R Tip: Use [[ ]] Wherever You Can

February 21, 2018
By

R tip: use ] wherever you can. In R the ] is the operator that (when supplied a scalar argument) pulls a single element out of lists...

## Markdown based web analytics? Rectangle your blog

February 21, 2018
By

Locke Data’s great blog is Markdown-based. What this means is that all blog posts exist as Markdown files: you can see all of them here. They then get rendered...

## “I have to randomize by cluster. Is it OK if I only have 6 sites?”

February 20, 2018
By

The answer is probably no, because there is a not-so-low chance (perhaps considerably higher than 5%) you will draw the wrong conclusions from the study. I have heard variations...

## Creating a Table of Monthly Returns With R and a Volatility Trading Interview

February 20, 2018
By

This post will cover two aspects: the first will be a function to convert daily returns into a table of … Continue reading →

## Effortless but Powerful Exception Logging in R: loggit! 1.0.0 Released on CRAN

February 20, 2018
By

Frustrated with the lack of pain-free logging in R, a colleague inspired me to write my first public R package: loggit. What follows is the README that you can find...

## Difficult to Reproduce Choroplethr Bug: Can You Help?

February 20, 2018
By

Since November 2017 I have received three bug reports from users who see this error when running any command in the choroplethr package: Theme element panel.border missing   Error in if (theme\$panel.ontop)... The post Difficult to Reproduce...

## Speeding up spatial analyses by integrating `sf` and `data.table`: a test case

February 20, 2018
By

The problem Last week, I replied to this interesting question posted by @Tim_K over stackoverflow. He was seeking efficient solutions to identify all points falling within a maximum distance of xx...

## Deep Learning Image Classification with Keras and Shiny

February 19, 2018
By

I have to admit my initial thoughts of deep learning were pessimistic and in order to not succumb to impostor syndrome, I put off learning...

## webmockr: mock HTTP requests

webmockr webmockr is an R library for stubbing and setting expectations on HTTP requests. It is a port of the Ruby gem webmock. webmockr works by plugging in to another R package...

## BIKE SERVICES API + SHINY = NICE APP

Hi everyone, In this blog post, I will be short and I will introduce our shiny application on bike self-service stations. Your browser does not support the video tag. The...

## Blog about something you just learned

February 19, 2018
By

Great effort has recently been made to encourage also the not-so-experienced to jump into the water and blog about data science. Some of the community’s hot shots gracefully draw...

## R Tip: Use seq_len() to Avoid The Backwards Sequence Bug

February 19, 2018
By

Another R tip. Use seq_len() to avoid The backwards seqeunce bug. Many R users use the “colon sequence” notation to build sequences. For example: for(i in 1:5) { print(paste(i,...