December 2019

Practice R and Python on the Cloud for Free

December 16, 2019 | George Mount

R and Python, the “dynamic duo” of data science, are both free, open-source programming languages. That means that there’s no “vendor” in the sense that, say, Microsoft owns Excel. This can make getting started with these programs a little trickier: there are several ways to install them, often multi-step, ...
[Read more...]

BH 1.72.0-1 on CRAN

December 16, 2019 | Thinking inside the box

The BH package provides a sizeable portion of the Boost C++ libraries as a set of template headers for use by R. It is quite popular, and frequently used together with Rcpp. The BH CRAN page shows e.g. that it is used by rstan, dplyr as well as a ...
[Read more...]

knn model

December 15, 2019 | Modeling with R

Introduction Classification Data partition Train the model Prediction and confusion matrix Fine tuning the model Comparison between knn and svm model Regression Introduction In this paper we will explore the k nearest neighbors model using two data sets, the first is Tiatanic data to which we will fit this model ...
[Read more...]

Reordering bars in GGanimate visualization

December 15, 2019 | R | TypeThePipe

Last week several gganimate visualizations came to my feed. Some R users were wondering about reordering gganimate and ggplot2 bars as long as them are evolving (over animation time). Then, we came up with this R viz where several bars are not only evolving and reordering over time but leaving ... [Read more...]

tidyposterior’s Bayesian Approach to Model Comparison

December 15, 2019 | R Views

A task common to many machine learning workflows is to compare the performance of several models with respect to some metric such as accuracy or area under the ROC curve. Standard practice is to try out several different algorithms on a training data set and see which works better. Unfortunately, ...
[Read more...]

The Renzo Pomodoro dataset

December 15, 2019 | Derek Jones

Estimating how long it will take to complete a task is hard work, and the most common motivation for this work comes from external factors, e.g., the boss, or a potential client asks for an estimate to do a job. People also make estimates for their own use, e.... [Read more...]

The Renzo Pomodoro dataset

December 15, 2019 | Derek Jones

Estimating how long it will take to complete a task is hard work, and the most common motivation for this work comes from external factors, e.g., the boss, or a potential client asks for an estimate to do a job. People also make estimates for their own use, e.... [Read more...]

The significance of education on the salary of engineers in Sweden

December 14, 2019 | n n R Analystatistics Swedenn

In my last posts, I analysed the significance of experience for different occupational groups. In this post, I will turn the interest towards education. I will again start with engineers and see if I can expand my analysis to all occupational groups. First, define libraries and functions.
library (tidyverse)
## -- Attaching packages -------------------------------------------- tidyverse 1.2.1 --
## v ggplot2 3.2.0     v purrr   0.3.2
## v tibble  2.1.3     v dplyr   0.8.3
## v tidyr   0.8.3     v stringr 1.4.0
## v readr   1.3.1     v forcats 0.4.0
{{... [Read more...]

Git Hosting for the Distraught and the Restless

December 14, 2019 | Noam Ross

It’s generally impossible to only use services, private or government, that perfectly align with one’s values, so one must opt to choose one’s battles. The controversy over GitHub’s contract with U.S. Immigration and Customs Enforcement is the latest such battle in the open-source software world. ...
[Read more...]

A large repository of networkdata

December 14, 2019 | schochastics

There are many network repositories out there that offer a large variety of amazing free data. (See the awesome network analysis list on github for an overview.) The problem is, that network data can come in many formats. Either in plain text as edgelist or adjacency matrix, or in a ... [Read more...]

How H2O propels data scientists ahead of itself: enhancing Driverless AI with advanced options, recipes and visualizations

December 14, 2019 | Gregory Kanevsky

H2O engineers continually innovate and implement latest techniques by following and adopting latest research, working on cutting edge use cases, and participating and winning machine learning competitions like Kaggle. But thanks to explosion of AI research and applications even most advanced automated machine learning platforms like H2O.ai ...
[Read more...]

Why R? 2019 Conference – Summary

December 14, 2019 | Why R? Foundation

Why R? conferences have been the hallmark of the Why R? Foundation (whyr.pl). Our goal has been to establish a series of international R-related events in Poland. After three years, we are happy to announce that our main event, the Why R? conference, ... [Read more...]
1 5 6 7 8 9 13

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)