rOpenSci Hiring for New Position in Statistical Software Testing and Peer Review

Are you passionate about statistical methods and software? If so we would love for you to join our team to dig deep into the world of statistical software packages. You’ll develop standards for evaluating and reviewing statistical t...

Read more »

Combining momentum and value into a simple strategy to achieve higher returns

July 17, 2019
By
Combining momentum and value into a simple strategy to achieve higher returns

In this post I'll introduce a simple investing strategy that is well diversified and has been shown to work across different markets. In short, buying cheap and uptrending stocks...

Read more »

An Ad-hoc Method for Calibrating Uncalibrated Models

July 16, 2019
By
An Ad-hoc Method for Calibrating Uncalibrated Models

In the previous article in this series, we showed that common ensemble models like random forest and gradient boosting are uncalibrated: they are not guaranteed to estimate aggregates or...

Read more »

Three Strategies for Working with Big Data in R

July 16, 2019
By
Three Strategies for Working with Big Data in R

For many R users, it’s obvious why you’d want to use R with big data, but not so obvious how. In fact, many people (wrongly) believe that R just...

Read more »

101 Machine Learning Algorithms for Data Science with Cheat Sheets

July 16, 2019
By
101 Machine Learning Algorithms for Data Science with Cheat Sheets

Your one-stop-shop for machine learning algorithms. Each algorithm is complete with a short description and links to examples. If you would like to take the algorithms with you, click...

Read more »

shinymeta — a revolution for reproducibility

July 16, 2019
By
shinymeta — a revolution for reproducibility

Joe Cheng presented shinymeta enabling reproducibility in shiny at useR in July 2019. I am really thankful for this. This article shows a… Continue reading on Towards Data Science »

Read more »

Shiny Modules

July 16, 2019
By
Shiny Modules

Tidiness is half the life .. this is a German saying that you might not necessarily...

Read more »

eRum2020 in Milan

July 16, 2019
By
eRum2020 in Milan

The European R conference will visit Milan in 2020! Mirai Solutions is delighted to actively support and participate in the organization of the event. The European R Users Meeting (eRum)...

Read more »

Reinforcement Learning: Life is a Maze

July 16, 2019
By
Reinforcement Learning: Life is a Maze

It can be argued that most important decisions in life are some variant of an exploitation-exploration problem. Shall I stick with my current job or look for a new...

Read more »

Bojack Horseman and Tidy Data Principles (Part 1)

July 15, 2019
By
Bojack Horseman and Tidy Data Principles (Part 1)

Motivation After reading The Life Changing Magic of Tidying Text and A tidy text analysis of Rick and Morty I wanted to do something similar for Rick and Morty and...

Read more »

Pricing floating legs of interest rate swaps

Pricing floating legs of interest rate swaps

In this post we will close the trilogy on (old style) swap pricing. In particular, we will look at how downloading the data for the variable rate needed to...

Read more »

Aggregating spatial data with the grainchanger package

Aggregating spatial data with the grainchanger package

The grainchanger package provides functionality for data aggregation to a coarser resolution via moving-window or direct methods. Why do we need new methods for data aggregation? As landscape ecologists and macroecologists,...

Read more »

Estimating treatment effects (and ICCs) for stepped-wedge designs

July 15, 2019
By
Estimating treatment effects (and ICCs) for stepped-wedge designs

In the last two posts, I introduced the notion of time-varying intra-cluster correlations in the context of stepped-wedge study designs. (See here and here). Though I generated lots of...

Read more »

Quick Hit: A Different (Diminutive) Look At Distributions With {ggeconodist}

July 15, 2019
By
Quick Hit: A Different (Diminutive) Look At Distributions With {ggeconodist}

Despite being a full-on denizen of all things digital I receive a fair number of dead-tree print magazines as there’s nothing quite like seeing an amazing, large, full-color print...

Read more »

Is Scholarly Use of R Use Beating SPSS Already?

July 15, 2019
By
Is Scholarly Use of R Use Beating SPSS Already?

by Bob Muenchen & Sean Mackinnon One of us (Muenchen) has been tracking The Popularity of Data Science Software using a variety of different approaches. One approach is to...

Read more »

Twitter coverage of the useR! 2019 conference

July 15, 2019
By
Twitter coverage of the useR! 2019 conference

Very briefly: Last week was useR! conference time again, coming to you this time from Toulouse, France I’ve retrieved 8 318 tweets that mention #user2019 and run them through...

Read more »

Looking at flood insurance claims with choroplethr

July 14, 2019
By
Looking at flood insurance claims with choroplethr

I recently learned how to use the choroplethr package through a short tutorial by the package author Ari Lamstein (youtube link here). To cement what I learned, I thought...

Read more »

Recreating ‘Unknown Pleasures’ graphic

July 14, 2019
By
Recreating ‘Unknown Pleasures’ graphic

For some time I’ve wanted to recreate the cover art from Joy Division’s Unknown Pleasures album. The visualisation depicts successive pulses from the pulsar PSR B1919+21, discovered by Jocelyn...

Read more »

Distribution of Headline Sentiment

July 14, 2019
By
Distribution of Headline Sentiment

My web scraping project explored the distribution of headline sentiment by news source. To do this, I scraped the Nasdaq latest market headlines page and applied sentiment analysis to...

Read more »

rOpenSci Announces $678K Award from the Sloan Foundation to Expand Software Peer Review

We’re delighted to announce that we have received new funding from the Alfred P. Sloan Foundation. The $678K grant, awarded through the Foundation’s Data & Computational Research program, will...

Read more »

Experimenting with Hierarchical Clustering in a galaxy far far away…

July 14, 2019
By
Experimenting with Hierarchical Clustering in a galaxy far far away…

Introduction This post will be taking a bit of an unexpected diversion. As I was experimenting with hierarchical clustering I ran into the issue of how many clusters to assume....

Read more »

rstudio::conf(2020) is open for registration!

July 14, 2019
By
rstudio::conf(2020) is open for registration!

rstudio::conf, the conference for all things R and RStudio, will take place January 29 and 30, 2020 in San Francisco, California. It will be preceded by Training Days on...

Read more »

Yet Another R Package for General Regression Neural Network

July 14, 2019
By

Compared with other types of neural networks, General Regression Neural Network (Specht, 1991) is advantageous in several aspects. Being an universal approximation function, GRNN has only one tuning parameter...

Read more »

Forecast Combination in R – slides

July 14, 2019
By
Forecast Combination in R – slides

The useR! 2019 held in Toulouse ended couple of days ago. I spoke of the recent R journal publication about forecast combinations (joint work with Christoph Weiss and Gernot...

Read more »

Some Details on Running xgboost

July 14, 2019
By

While reading Dr. Nina Zumel’s excellent note on bias in common ensemble methods, I ran the examples to see the effects she described (and I think it is very important...

Read more »

Writing Functions in R: Example One

July 13, 2019
By

A. Background In previous posts, I covered a number of useful functions and packages for writing reusable code. I wanted to extend on that information by providing a working...

Read more »

Back from useR! 2019

July 13, 2019
By

I’m back from useR! 2019!, Toulouse, where I gave one talk and a workshop. Here are the links to the materials. 2019-07-08 Contributing to the R ecosystem useR! newbie session A short...

Read more »

Simulating Data in R: Examples in Writing Modular Code

July 13, 2019
By
Simulating Data in R: Examples in Writing Modular Code

Simulating data is an invaluable tool. I use simulations to conduct power analyses, probe how robust methods are to violating assumptions, and examine how different methods handle different types...

Read more »

Quick Hit: {waffle} 1.0 Font Awesome 5 Pictograms and More

July 12, 2019
By
Quick Hit: {waffle} 1.0 Font Awesome 5 Pictograms and More

The {waffle} package got some 💙 this week and now has a substantially improved geom_waffle() along with a brand new sibling function geom_pictogram() which has all the powerful new...

Read more »

Search R-bloggers

Sponsors