Tech Dividends, Part 2

August 16, 2019
By
Tech Dividends, Part 2

In a previous post, we explored the dividend history of stocks included in the SP500, and we followed that with exploring the dividend history of some NASDAQ tickers. Today’s post is a short continuation of that tech dividend theme, with the aim of demonstrating how we can take our previous work and use it to quickly visualize research from...

Read more »

Modern R with the tidyverse is available on Leanpub

Modern R with the tidyverse is available on Leanpub

Yesterday I released an ebook on Leanpub, called Modern R with the tidyverse, which you can also read for free here. In this blog post, I want to give some context. Modern R...

Read more »

3 tidyverse tricks for most commonly used Excel Features

August 16, 2019
By

In this post, We’re simply going to see 5 tricks that could help improve your tooling using {tidyverse}. Create a difference variable between the current value and the next value This...

Read more »

We’re RStudio Trainers!

August 16, 2019
By
We’re RStudio Trainers!

We're RStudio Trainers! Big news. RStudio recently started certifying trainers in three areas: the tidyverse, Shiny and teaching. To be certified to teach a topic you have to pass...

Read more »

EARL London – speaker interview

August 16, 2019
By
EARL London – speaker interview

Robert Duff (Transport for London) and Rahulan Chandrasekaran (Department for Transport) Robert and Rahulan are doing a joint presentation titled ‘Let me in! Let me on! Quantifying highly frustrating...

Read more »

What the MAPE is FALSELY blamed for, its TRUE weaknesses and BETTER alternatives!

August 16, 2019
By
What the MAPE is FALSELY blamed for, its TRUE weaknesses and BETTER alternatives!

In time series context, one of most the commonly used measures is the MAPE. In this blog post, I evaluate critical arguments and weaknesses concerning the MAPE and demonstrate...

Read more »

Really large numbers in R

August 15, 2019
By
Really large numbers in R

This post will discuss ways of handling huge numbers in R using the gmp package. The gmp package The gmp package provides us a way of dealing with really...

Read more »

Converting lines in an svg image to csv

August 15, 2019
By
Converting lines in an svg image to csv

During a search for data on programming language usage I discovered Stack Overflow Trends, showing an interesting plot of language tags appearing on Stack Overflow questions (see below). Where...

Read more »

Insurance data science : Networks

August 15, 2019
By
Insurance data science : Networks

At the Summer School of the Swiss Association of Actuaries, in Lausanne, I will start talking about networks and insurance this Friday. Slides are available online

Read more »

Fun with progress bars: Fish, daggers and the Star Wars trench run

August 15, 2019
By
Fun with progress bars: Fish, daggers and the Star Wars trench run

If you’re like me, when running a process through a loop you’ll add in counters and progress indicators. That way The post Fun with progress bars: Fish, daggers and...

Read more »

EARL London – agenda highlights

August 15, 2019
By

There are so many wonderful EARL talks happening this year – it’s hard to highlight them all! But we thought we’d share some that the Mango team are really...

Read more »

Big Data: Wrangling 4.6M Rows with dtplyr (the NEW data.table backend for dplyr)

August 14, 2019
By
Big Data: Wrangling 4.6M Rows with dtplyr (the NEW data.table backend for dplyr)

Wrangling Big Data is one of the best features of the R programming language, which boasts a Big Data Ecosystem that contains fast in-memory tools (e.g. data.table) and distributed...

Read more »

2 Months in 2 Minutes – rOpenSci News, August 2019

2 Months in 2 Minutes – rOpenSci News, August 2019

rOpenSci HQ rOpenSci received a $678K award from the Sloan Foundation to expand Software Peer Review. We are hiring for a new position in statistical software testing and peer review. Join our...

Read more »

Insurance data science : Text

August 14, 2019
By
Insurance data science : Text

At the Summer School of the Swiss Association of Actuaries, in Lausanne, I will start talking about text based data and NLP this Thursday. Slides are available online Ewen...

Read more »

Prepping data for #rstats #tidyverse and a priori planning

August 14, 2019
By
Prepping data for #rstats #tidyverse and a priori planning

Many if not most data clean up, tidying, wrangling, and joining can be done directly in R. There are many advantages to this approach – i.e. read in data...

Read more »

Using the lpSolve package in R to optimise an electricity system

August 14, 2019
By
Using the lpSolve package in R to optimise an electricity system

Reducing carbon emissions is maybe the world's most pressing challenge at the moment. One obvious avenue for action is the reduction of carbon emissions from electricity generation, which are...

Read more »

What is vtreat?

August 14, 2019
By
What is vtreat?

vtreat is a DataFrame processor/conditioner that prepares real-world data for supervised machine learning or predictive modeling in a statistically sound manner. vtreat takes an input DataFrame that has a...

Read more »

Plotting Pairs (GC4W6HJ)

August 14, 2019
By
Plotting Pairs (GC4W6HJ)

Planning for a trip to Cornwall, I was churning through some mysteries there, largely picking them at random from the map. I worked my way through this one by...

Read more »

Using linear models with binary dependent variables, a simulation study

Using linear models with binary dependent variables, a simulation study

This blog post is an excerpt of my ebook Modern R with the tidyverse that you can read for free here. This is taken from Chapter 8, in which I discuss...

Read more »

(Bootstrapping) Follow-Up Contrasts for Within-Subject ANOVAs (part 2)

(Bootstrapping) Follow-Up Contrasts for Within-Subject ANOVAs (part 2)

1. Fit your repeated-measures model with lmer 2. Define the contrast(s) of interest 3. Run the bootstrap Summary A while back I wrote a post demonstrating how to bootstrap follow-up contrasts for repeated-measure...

Read more »

Insurance data science : Pictures

August 13, 2019
By
Insurance data science : Pictures

At the Summer School of the Swiss Association of Actuaries, in Lausanne, following the part of Jean-Philippe Boucher (UQAM) on telematic data, I will start talking about pictures this...

Read more »

Speaking at BARUG

August 13, 2019
By

We will be speaking at the Tuesday, September 3, 2019 BARUG. If you are in the Bay Area, please come see us. Nina Zumel & John Mount Practical Data...

Read more »

Be careful of NA/NaN/Inf values when using base R’s plotting functions!

August 13, 2019
By
Be careful of NA/NaN/Inf values when using base R’s plotting functions!

I was recently working on a supervised learning problem (i.e. building a model using some features to predict some response variable) with a fairly large dataset. I used base...

Read more »

Cyclists – London Ride 100 – Analysis for riders and clubs using Shiny/R

August 13, 2019
By
Cyclists – London Ride 100 – Analysis for riders and clubs using Shiny/R

The Prudential London-Surrey 100 is an annual cycling event for amateur cyclists. This article describes the public Shiny page riders use to analyse how they and their club performed...

Read more »

How to Automate EDA with DataExplorer in R

August 13, 2019
By
How to Automate EDA with DataExplorer in R

EDA (Exploratory Data Analysis) is one of the key steps in any Data Science Project. The better the EDA is the better the Feature Engineering could be done. From...

Read more »

Do you love Data Science? I mean, the Data part in it

August 13, 2019
By

Last week, We talked all about Artificial Intelligence (also Artifical Stupidity) which led me to think about the foundation of Data Science that's the Data itself. I think, Data...

Read more »

Basic Quantile Regression

August 12, 2019
By
Basic Quantile Regression

By Gabriel Vasconcelos Introduction Today we are going to talk about quantile regression. When we use the lm command in R we are fitting a linear regression using Ordinary...

Read more »

Local randomness in R

August 12, 2019
By

One approach of using random number generation inside a function without affecting outer state of random generator. Prologue Let’s say we...

Read more »

Plumber Logging

August 12, 2019
By
Plumber Logging

The plumber R package is used to expose R functions as API endpoints. Due to plumber’s incredible flexibility, most major API design decisions are left up to the developer....

Read more »

Search R-bloggers

Sponsors