My RStudio Configuration

May 30, 2019
By
My RStudio Configuration

Whenever I need to install RStudio on a new machine, I have to think a bit about the configuration options I’ve tweaked. Invariably, I miss a checkbox that leaves me with slightly different RStudio behavior on each system. This post includes screenshots of my RStudio configuration and custom keyboard shortcuts for RStudio 1.3, MacOS, so … Continue reading My...

Read more »

Star Wars Fandom Survey, Part 5: Importance of Movie Characteristics

May 30, 2019
By
Star Wars Fandom Survey, Part 5: Importance of Movie Characteristics

Welcome from Part 1, where I talked mainly about methods; Part 2, where I discussed the three major types of Star Wars fans; Part 3, where I discussed sexism and political attitudes; and Part 4, where I discussed age and nostalgia. In this part, I will focus on age and nostalgia. As always, email [email protected] with questions about analyses,...

Read more »

Star Wars Fandom Survey, Part 4: Age and Nostalgia

May 30, 2019
By
Star Wars Fandom Survey, Part 4: Age and Nostalgia

Welcome from Part 1, where I talked mainly about methods; Part 2, where I discussed the three major types of Star Wars fans; and Part 3, where I discussed sexism and political attitudes. In this part, I will focus on age and nostalgia. As always, email [email protected] with questions about analyses, methods, results, and so on. Star Wars was a...

Read more »

Star Wars Fandom Survey, Part 3: Sexism and Political Attitudes

May 30, 2019
By
Star Wars Fandom Survey, Part 3: Sexism and Political Attitudes

Welcome from Part 1, where I talked mainly about methods, and Part 2, where I discussed the three major types of Star Wars fans. In this part, I will focus on sexism and political attitudes. As always, email [email protected] with questions about analyses, methods, results, and so on. It is not inherently sexist to dislike Disney’s Star Wars films. There...

Read more »

How to start a new package with testing in R

May 30, 2019
By

# Navigate where you want your folder to be located setwd("C:/Users/chief/Documents/Github") # Assumes usethis is installed usethis::create_package("foo") # Say yes or no to next (annoying) popup window, it doesn't matter. # Add a test environment setw...

Read more »

How to start a new package with testing in R

May 30, 2019
By

# Navigate where you want your folder to be locatedsetwd("C:/Users/chief/Documents/Github")# Assumes usethis is installedusethis::create_package("foo")# Say yes or no to next (annoying) popup window, it doesn't matter.# Add a test environmentsetwd("foo")usethis::use_testthat()# Add first test function to at least get something in that folder.# Go to foo\tests\testthat# and add this file.context("foo")library(foo)test_that("I'm testing something", {  # do something with your code  expect_equal(1:4,...

Read more »

More Bayes and multiple comparisons

More Bayes and multiple comparisons

In my last post I had a little fun comparing perspectives among Bayesian, frequentist and programmer methodologies. I took a nice post from Anindya Mozumdar from the R Bloggers feed and investigated the world’s fastest man. I’ve found that in writing these posts two things always happen. I learn a lot, and I have follow-on questions or thoughts. This time is no exception, the last post made...

Read more »

78th #TokyoR Meetup Roundup!

May 30, 2019
By
78th #TokyoR Meetup Roundup!

With the arrival of summer, another TokyoR User Meetup! On May 25th, useRs from all over Tokyo (and some even from further afield - including Kan Nishida of Exploratory, all the way from California!) flocked to Jimbocho, Tokyo for...

Read more »

How to Become a Data Scientist

May 30, 2019
By
How to Become a Data Scientist

This question and its variations are the most searched topics on Google. As a practicing datascience professional, and manager to boot, dozens of people ask me this question every week. This post is my honest and detailed answer. Step 1 – Coding & ML skills You need to master programming in either R or Python.

Read more »

Quick and easy t-SNE analysis in R

May 30, 2019
By
Quick and easy t-SNE analysis in R

t-SNE is a useful dimensionality reduction method that allows you to visualise data embedded in a lower number of dimensions, e.g. 2, in order to see patterns and trends in the data. It can deal with more complex patterns of Gaussian clusters in multidimensional space compared to PCA. Although is not suited to finding outliers

Read more »

Bayesian modeling using Stan: A case study

May 30, 2019
By
Bayesian modeling using Stan: A case study

Practice makes better. And faster. But what exactly is the relation between practice and reaction time? In this blog post, we will focus on two contenders: the power law and exponential function. We will implement these models in Stan and extend them to account for learning plateaus and the fact that, with increased practice, not only the mean reaction...

Read more »

xaibot – conversations with predictive models!

May 30, 2019
By
xaibot – conversations with predictive models!

If you could talk to a predictive machine learning model, what would you ask for? Try! Michał Kuźba is developing a mind-blowing project – xai chat-bot. Dialog based system that helps to explore and understand predictive models through natural language conversations (type, speak or phone the model 😉 ). For example, imagine that you have … Czytaj dalej xaibot...

Read more »

Cognitive capitalism chapter reworked

May 29, 2019
By

The Cognitive capitalism chapter of my evidence-based software engineering book took longer than expected to polish; in fact it got reworked, rather than polished (which still needs to happen, and there might be more text moving from other chapters). Changing the chapter title, from Economics to Cognitive capitalism, helped clarify lots of decisions about the

Read more »

An Introduction to Modelling Soccer Matches in R (part 1)

An Introduction to Modelling Soccer Matches in R (part 1)

For anyone watching football, being able to predict matches is a key aspect of the hobby. Whether explicitly (e.g. when betting on matches, or deciding on recruitment for an upcoming season), or more implicitly when discussing favourites to win the league in the pub, almost all discussion of the sport on some level require predictions about some set of upcoming...

Read more »

April 2019: “Top 40” New CRAN Packages

May 29, 2019
By
April 2019: “Top 40” New CRAN Packages

One hundred eighty-seven new packages made it to CRAN in April. Here are my picks for the “Top 40”, organized into ten categories: Biotechnology, Data, Econometrics, Machine Learning, Medicine, Science, Statistics, Time Series, Utilities, and Visualization. Biotechnology genpwr v1.00: Provides functions for power and sample size calculations for genetic association studies allowing for mis-specification of the model of genetic susceptibility....

Read more »

A New Release of rIP (v1.2.0) for Detecting Fraud in Online Surveys

May 29, 2019
By
A New Release of rIP (v1.2.0) for Detecting Fraud in Online Surveys

We are excited to announce the latest major release of rIP (v1.2.0), which is an R package that detects fraud in online surveys by tracing, scoring, and visualizing IP addresses. Essentially, rIP takes an array of IP addresses, which are always captured in online surveys (e.g., MTurk), and the keys for the services the user … Continue reading A...

Read more »

Interactive charts with chartbookR

“There is no such thing as information overload. There is only bad design.” (— Edward Tufte). There is nothing worse than charts overladed with information. One solution to this are interactive charts that let users select the time series they’re interested in, zoom in on them, and focus on individual data points. The chartbookR package makes the creation of interactive charts very...

Read more »

Momentum Investing with R

May 28, 2019
By
Momentum Investing with R

After an extended hiatus, Reproducible Finance is back! We’ll celebrate by changing focus a bit and coding up an investment strategy called Momentum. Before we even tiptoe in that direction, please note that this is not intended as investment advice and it’s not intended to be a script that can be implemented for trading. The goal is to explore...

Read more »

Employee flight risk modeling behavior

May 28, 2019
By
Employee flight risk modeling behavior

An analytical model for predicting employee flight risk behaviour “People are the nucleus of any organization. So, how can you find, engage and retain top performers who’ll contribute to your goals, your future?” There is no dearth of Enterpris...

Read more »

How to keep up with CRAN policies and processes?

May 28, 2019
By

CRAN, the Comprehensive R Archive Network, changes its rules and workflow every so often: see for instance the new encoding setting of one of its check flavors. As a package developer, you’d better keep up with CRAN policies and processes to be able to safely retain your package(s) on CRAN and to prepare your next submissions with as much...

Read more »

Data Science Jobs Report 2019: Python Way Up, Tensorflow Growing Rapidly, R Use Double SAS

May 28, 2019
By
Data Science Jobs Report 2019: Python Way Up, Tensorflow Growing Rapidly, R Use Double SAS

In my ongoing quest to track The Popularity of Data Science Software, I've just updated my analysis of the job market. To save you from reading the entire tome, I'm reproducing that section here. Continue reading →

Read more »

What single step does with relationship

May 28, 2019
By
What single step does with relationship

We had a journal club about the single step GBLUP method for genomic evaluation a few weeks ago. In this post, we’ll make a few graphs of how the single step method models relatedness between individuals. Imagine you want to use genomic selection in a breeding program that already has a bunch of historical pedigree

Read more »

Automate Your KPI Forecasts With Only 1 Line of R Code Using AutoTS

Automate Your KPI Forecasts With Only 1 Line of R Code Using AutoTS

If you are having the following symptoms at your company when it comes to business KPI forecasting, then maybe you need to look at automated forecasting: Ugly Excel spreadsheets with multiple tabs and 2000s style pastel formatting Business unit managers, store managers, operations managers, sales teams, and finance teams who give convoluted and indirect answers

Read more »

Package Spotlight: anim.plots

May 28, 2019
By

The package anim.plots behaves like a sort of user-friendly shell on top of animate that makes animations of some of the most common types of plots in base R in a more intuitive fashion that animate. This package depends on two other important packages: -   magick, which is an R implementation of imageMagick, which itself is software used to create animated...

Read more »

Learning R: Painting with Fire

May 28, 2019
By
Learning R: Painting with Fire

A few months ago I published a post on recursion: To understand Recursion you have to understand Recursion…. In this post we will see how to use recursion to fill free areas of an image with colour, the caveats of recursion and how to transform a recursive algorithm into a loop-based version using a queue … Continue reading "Learning...

Read more »

Individuals Control Charts

May 27, 2019
By
Individuals Control Charts

This post is a follow-up two my two recent posts on generating control charts in R, and animating them. One thing that I’ve been wondering about is how to calculate a range chart without using a package like ggQC or qcc. I knew that I could do it using a loop, but I was looking for a dplyr method...

Read more »

How to save (and load) datasets in R: An overview

May 27, 2019
By
How to save (and load) datasets in R: An overview

What I will show you In this post, I want to show you a few ways how you can save your datasets in R. Maybe, this seems like a dumb question to you. But after giving quite a few R courses mainly - but not only - for R beginners, I came to acknowledge that the answer to this question...

Read more »

ramlegacy: a package for RAM Legacy Database

Introduction ramlegacy is a new R package to download, cache and read in all the different versions of the RAM Legacy Stock Assessment Database, a public database containing stock assessment results of commercially exploited marin...

Read more »

Job @ Oxford

May 27, 2019
By

Boby Mihaylova has two exciting posts available at the Health Economics Research Centre at the University of Oxford. In particular, she is looking for two R-minded researchers/analysts to develop work on disease modelling/cost-effectiveness using large individual-patients databases. In fact, I think it’s really good that they are explicitly including knowledge of R as part of the job specification —...

Read more »

Search R-bloggers

Sponsors