Speedmining the Cubing Community with dbplyr

Speedmining the Cubing Community with dbplyr

Abstract: We use the RMariaDB and dbplyr packages to analyze the results database of the World Cubing Association. In particular we are interested in finding unofficial world records of fastest 3x3x3 solves, countries with large proportion of female cubers as well as acceptable solving times before entering a WCA competition. This work is licensed under a Creative Commons Attribution-ShareAlike 4.0...

Read more »

R Weekly on Github

May 5, 2019
By
R Weekly on Github

Check out this repo from Rweekly.org on Github: https://github.com/rweekly/rweekly.org This is a great place to start with some coding fun and contribute to the community. Also hold you accountable. Here’s an excerpt to their page: R Weekly R weekly provides weekly updates from the R community. You are welcome to contribute as long as you … Continue reading "R...

Read more »

Repeated cross-validation in cvms and groupdata2

May 5, 2019
By
Repeated cross-validation in cvms and groupdata2

I have spent the last couple of days adding functionality for performing repeated cross-validation to cvms and groupdata2. In this quick post I will show an example. In cross-validation, we… Read More → Indlægget Repeated cross-validation in cvms and groupdata2 blev først udgivet på .

Read more »

Hierarchical Customer Lifetime Value

May 4, 2019
By
Hierarchical Customer Lifetime Value

Hierarchical Customer Lifetime Value Posted on 5 May, 2019 by Brian Tags: customer lifetime value, recency frequency, hierarchical model, centred parameterisation, non-centred parameterisation, prior-predictive distribution, stan, e-bfmi, energy Category: customer_lifetime_value In a previous post,...

Read more »

Benchmark of popular graph/network packages

May 4, 2019
By
Benchmark of popular graph/network packages

In this post I benchmark the performance of 5 popular graph/network packages. This was inspired by two questions I had: Recently, I have been working with large networks (millions of vertices and edges) and often wonder what is the best currently a...

Read more »

rTRNG on CRAN now!

May 4, 2019
By
rTRNG on CRAN now!

After dwelling happily on our GitHub repositories for an extended period of time, rTRNG has now finally made it to CRAN. We are very happy to get this out just ahead of the R in Finance in Chicago, where the functionality had been show-cased with an applied example two years ago. The main changes since then are related to dependency upgrades...

Read more »

Deep (learning) like Jacques Cousteau – Part 2 – Scalars

Deep (learning) like Jacques Cousteau – Part 2 – Scalars

(TL;DR: Scalars are single numbers.)

Read more »

Free online r course

May 4, 2019
By

Recently a young relative mentioned that the campus R course she hoped to attend was full. What online alternatives did she have? So, I decided to start one of my own! https://github.com/matloff/fasteR  Designed for complete beginners. I now have six lessons up on the site. I hope to add one new lesson per week. Advertisements

Read more »

Why Use Weight of Evidence?

May 4, 2019
By
Why Use Weight of Evidence?

I had been asked why I spent so much effort on developing SAS macros and R functions to do monotonic binning for the WoE transformation, given the availability of other cutting-edge data mining algorithms that will automatically generate the prediction with whatever predictors fed in the model. Nonetheless, what really distinguishes a good modeler from

Read more »

How do casinos make money?

May 4, 2019
By
How do casinos make money?

This blog was written by Sacha Epskamp. In my class on Structural Equation Modeling, I introduce the concepts of expected values and variances through the game of roulette. A french roulette game looks like this: A ball will roll in the left basin and land on one of the 37 values. You can bet on

Read more »

Fast food, causality and R packages, part 2

Fast food, causality and R packages, part 2

I am currently working on a package for the R programming language; its initial goal was to simply distribute the data used in the Card and Krueger 1994 paper that you can read here (PDF warning). However, I decided that I would add code to perform diff-in-diff. In my previous blog post I showed how to set up the structure of your new package....

Read more »

One Step to Quickly Improve the Readability and Visual Appeal of ggplot Graphs

May 3, 2019
By
One Step to Quickly Improve the Readability and Visual Appeal of ggplot Graphs

There's something wonderful about a graph that communicates a point clearly. You know it when you see it. It's the kind of graph that makes you pause and say 'wow!'. There are all kinds of different graphs that fit this description, but they usually have a few things in common: Clarity: The message of the graph is clear Simplicity: Extraneous details...

Read more »

Availability of Microsoft R Open 3.5.2 and 3.5.3

May 2, 2019
By

It's taken a little bit longer than usual, but Microsoft R Open 3.5.2 (MRO) is now available for download for Windows and Linux. This update is based on R 3.5.2, and accordingly fixes a few minor bugs compared to MRO 3.5.1. The main change you will note is that new CRAN packages released since R 3.5.1 can now be...

Read more »

Deep (learning) like Jacque Cousteau – Part 1 – Sets

Deep (learning) like Jacque Cousteau – Part 1 – Sets

(TL;DR: I’m going to go deep into deep learning. Sets are collections of things.)

Read more »

R for Data Science in a Day

May 2, 2019
By
R for Data Science in a Day

Want to get hands-on experience with R and explore new Data Science project? On 13 May we're hosting a R for Data Science workshop, come and join us! The post R for Data Science in a Day appeared first on MilanoR.

Read more »

Under Pi : gganimate test around quadrature of the circle

May 2, 2019
By
Under Pi : gganimate test around quadrature of the circle

An updated look on squaring the circle using gganimate and R code. It gives a geometric and visual construction, a good and practical representation of what Pi is. As n becomes larger, segments become smaller and smaller, Pi can then be se...

Read more »

Queensland road accidents mapped with Shiny and leaflet in R

May 2, 2019
By
Queensland road accidents mapped with Shiny and leaflet in R

The Queensland government collects data on road accidents dating back to 1st January 2001 and details characteristics of the incident The post Queensland road accidents mapped with Shiny and leaflet in R appeared first on Daniel Oehm | Gradient Descending.

Read more »

[R]eady for production: The Data Science Event 2019 with eoda and RStudio

May 2, 2019
By
[R]eady for production: The Data Science Event 2019 with eoda and RStudio

eoda and RStudio invite the German speaking R-community to the Data Science Event 2019 in Frankfurt on June 13th – the event for the productive use of R. Learn how…

Read more »

Earthquake Analysis (2/4): Categorical Variables Exploratory Analysis

May 1, 2019
By
Earthquake Analysis (2/4): Categorical Variables Exploratory Analysis

CategoriesBasic Statistics Tags Data Visualisation Exploratory Analysis R Programming This is the second part of our post series about the exploratory analysis of a publicly available dataset reporting earthquakes and similar events within a specific time window of 30 days. In the following, we are going to analyze the categorical variables of our dataset. The categorical variables can take on one of a limited,...

Read more »

cranlogs 2.1.1 is on CRAN!

May 1, 2019
By
cranlogs 2.1.1 is on CRAN!

Version 2.1.1 of the cranlogs package has been released on CRAN! cranlogs queries a web API maintained by R-hub that contains the daily download numbers for each package and for R itself from the RStudio CRAN mirror. 📈 Get cranlogs’s latest version from CRAN, hopefully from RStudio CRAN mirror so we can monitor counts. 😉 install.packages("cranlogs", repos = "https://cran.rstudio.com/") The changes brought by...

Read more »

Practical Introduction to Market Basket Analysis – Asociation Rules

May 1, 2019
By
Practical Introduction to Market Basket Analysis – Asociation Rules

Introduction Ever wondered why items are displayed in a particular way in retail/online stores. Why certain items are suggested to you based on what you have added to the cart? Blame it on market basket analysis or association rule mining. Resources Below are the links to all the resources related to this post: Slides Code & Data RStudio Cloud What? Market basket analysis uses association rule mining under the...

Read more »

Process Mining (Part 3/3): More analysis and visualizations

May 1, 2019
By
Process Mining (Part 3/3): More analysis and visualizations

Intro A week ago, Havard Business Review published an article on process mining and provided reasons for companies to adopt it. If you need a refresher on the concepts of process mining, you can refer to my first post. Conducting process mining is easy with R’s bupaR package. bupaR allows you to create a variety of visualizations as you analyse...

Read more »

Bayesian models in R

May 1, 2019
By
Bayesian models in R

If there was something that always frustrated me was not fully understanding Bayesian inference. Sometime last year, I came across an article about a TensorFlow-supported R package for Bayesian analysis, called greta. Back then, I searched for greta tutorials and stumbled on this blog post that praised a textbook called Statistical Rethinking: A Bayesian Course with Examples in … Continue reading Bayesian...

Read more »

Detailed Guide to the Bar Chart in R with ggplot

May 1, 2019
By
Detailed Guide to the Bar Chart in R with ggplot

When it comes to data visualization, flashy graphs can be fun. Believe me, I'm as big a fan of flashy graphs as anybody. But if you're trying to convey information, especially to a broad audience, flashy isn't always the way to go. Whether it's the line graph, scatter plot, or bar chart (the subject of this guide!), choosing a...

Read more »

Set Analysis: A face off between Venn diagrams and UpSet plots

April 30, 2019
By
Set Analysis: A face off between Venn diagrams and UpSet plots

A tutorial to explore the differences between two practices used for set analysis: Venn diagrams and UpSet plots. Tutorial coded in R but similar concepts and packages are available in many tool sets.

Read more »

Risk and return for B3

April 30, 2019
By
Risk and return for B3

One of the subjects that I teach in my undergraduate finance class is the relationship between risk and expected returns. In short, the riskier the investment, more returns should be expected by the investor. It is not a difficult argument to make. All that you need to understand is to remember that people are not naive in financial markets....

Read more »

Building a Shiny App as a Package

April 30, 2019
By

Shiny App as a Package In a previous post, I’ve introduced the {golem} package, which is an opinionated framework for building production-ready Shiny Applications. This framework starts by creating a package skeleton waiting to be filled. But, in a world where Shiny Applications are mostly created as a series of files, why bother with a package? This is the...

Read more »

Machine Learning explained for statistics folk

April 30, 2019
By

I’m running a one-day workshop called “From Statistics To Machine Learning” in central London on 28 October, for anyone who … More

Read more »

Spatial data and maps conference – FOSS4G

April 30, 2019
By
Spatial data and maps conference – FOSS4G

I’m helping organise a conference on (geo)spatial open source software – FOSS4G. We’re hosting it in the great city of Edinburgh, Scotland in September 2019. Abstract submissions: https://uk.osgeo.org/foss4guk2019/talks_workshops.html We’re very interested in hearing… Continue reading →

Read more »

Search R-bloggers

Sponsors