Failure Pressure Prediction Using Machine Learning

December 10, 2018
By
Failure Pressure Prediction Using Machine Learning

CategoriesRegression Models Tags ggplot2 Machine Learning Prediction R Programming In this post, the failure pressure will be predicted for a pipeline containing a defect based solely on burst test results and learning machine models. For this purpose, various Machine Learning models will be fitted to test data under R using the caret package, and in the process compare the accuracy of the models in Related...

Read more »

10 R functions for Linux commands and vice-versa

December 10, 2018
By
10 R functions for Linux commands and vice-versa

This post will go through 10 different Linux commands and their R alternatives. If you’re interested in learning more R functions for working with files like some of those below, also check out this post. How to list all the files in a directory Linux R What does it do? ls list.files() Lists all the The post 10 R...

Read more »

{remedy} is now on CRAN

December 10, 2018
By
{remedy} is now on CRAN

After living for more than a year on GitHub, we are pleased to announce that the {remedy} package is now on CRAN.  A package for easier Markdown writing About {remedy} Have you ever been frustrated about having to manually add markdown tags to your Rmd? If you are an avid markdown user, no doubt you’ve been complaining about what...

Read more »

Day 10 – little helper %nin%

December 10, 2018
By
Day 10 – little helper %nin%

We at STATWORX work a lot with R and we often use the same little helper functions within our projects. These functions ease our daily work life by reducing repetitive code parts or by creating overviews of our projects. At first, there was no plan to make a package, but soon I realised, that it will be much easier...

Read more »

The Need for Speed Part 1: Building an R Package with Fortran (or C)

December 9, 2018
By
The Need for Speed Part 1: Building an R Package with Fortran (or C)

Searching for Speed Everyone who has ever used R has, at one time or another, wished for an increase in R’s speed. If you haven’t, you’re not using R hard enough! Recently, as part of some research on credibility, I was calculating layer loss costs for millions of simulated loss observations. As I progressed, the Read the full article... The...

Read more »

Parallel processing to add a little zip to power simulations (and other replication studies)

December 9, 2018
By
Parallel processing to add a little zip to power simulations (and other replication studies)

It’s always nice to be able to speed things up a bit. My first blog post ever described an approach using Rcpp to make huge improvements in a particularly intensive computational process. Here, I want to show how simple it is to speed things up by using the R package parallel and its function mclapply. I’ve been using this...

Read more »

Network Centrality in R: Neighborhood Inclusion

December 9, 2018
By
Network Centrality in R: Neighborhood Inclusion

This is the second post of a series on the concept of “network centrality” with applications in R and the package netrankr. The first part briefly introduced the concept itself, relevant R package, and some reoccurring issues for applications. This post will discuss some theoretical foundations and common properties of indices, specifically the neighborhood-inclusion preorder and what we can learn from them. library(igraph) library(ggraph) library(tidyverse) library(netrankr) Introduction When looking...

Read more »

Introduction to chartbookR

Introduction to chartbookR

”“Data scientists … spend from 50 percent to 80 percent of their time … collecting and preparing unruly digital data, before it can be explored for useful nuggets.” (— New York Times). The chartbookR package allows the convenient creation of economic and financial data chartbooks. It handles most of the data wrangling and can thus create large chartbooks with few...

Read more »

An 8-hour course on R and Data Mining

December 9, 2018
By

I will run an 8-hour course on R and Data Mining at Black Mountain, CSIRO, Australia on 10 & 13 December 2018. The course materials, incl. slides, R scripts and datasets, are available at http://www.rdatamining.com/training/course. Below is outline of the … Continue reading →

Read more »

CRAN Release of R/exams 2.3-2

December 9, 2018
By
CRAN Release of R/exams 2.3-2

New minor release of the R/exams package to CRAN, containing a range of smaller improvements and bug fixes. Notably scanning of written NOPS exams is enhanced and made more reliable and a new exercise template demonstrates how to use advanced...

Read more »

Interesting packages taken from R/Pharma

December 9, 2018
By
Interesting packages taken from R/Pharma

A few month ago I joined the R/Pharma conference in Cambridge, MA. As a take away I thought of my project and how I can improve, with solutions others provided. Mainly solutions in R are R-packages. So I’m a R-Shiny programmer in a regulated environment, so the list of the solutions I took are mainly helping you, if you are...

Read more »

Canada Map

December 9, 2018
By
Canada Map

I taught my Data Visualization seminar in Philadelphia this past Friday and Saturday. It covers most of the content of my book, including a unit on making maps. The examples in the book are from the United States. But what about other places? Two of the participants were from Canada, and so here’s an example that walks through the...

Read more »

Automatic Dashboard visualizations with Time series visualizations in R

December 9, 2018
By
Automatic Dashboard visualizations with Time series visualizations in R

CategoriesProgramming Tags R Programming RMarkdown Time Series In this article, you learn how to make Automatic Dashboard visualizations with Time series visualizations in R. First you need to install the `rmarkdown` package into your R library. Assuming that you installed the `rmarkdown`, next you create a new `rmarkdown` script in R. After this you type the following code in order to create Related Post Automated...

Read more »

Smartly select and mutate data frame columns, using dict

December 9, 2018
By
Smartly select and mutate data frame columns, using dict

Motivation Column operations Add Modify Remove Benchmark Summary Motivation The dplyr functions select and mutate nowadays are commonly applied to perform data.frame column operations, frequently combined with magrittrs forward %__% pipe. While working well interactively, however, these methods often would require additional checking if used in “serious” code, for example, to catch column name clashes. In principle, the container package provides a dict-class...

Read more »

Day 09 – little helper object_size_in_env

December 9, 2018
By
Day 09 – little helper object_size_in_env

We at STATWORX work a lot with R and we often use the same little helper functions within our projects. These functions ease our daily work life by reducing repetitive code parts or by creating overviews of our projects. At first, there was no plan to make a package, but soon I realised, that it will be much easier...

Read more »

Having bit of party with Material Colour Palette

December 8, 2018
By
Having bit of party with Material Colour Palette

Continuing on with my “slight?” obsession with colours… I love colours in “Material Colour Palette”. There various website that will lets you grab the colours by clicking, such as this one, but I just wanted to have little handy cheet sheet for myself, so I’ve decided I’ll do that using R & my favourite ggplot2. Getting colours out of image...

Read more »

transformr: Age of Spatial

December 8, 2018
By
transformr: Age of Spatial

Once again, I gives me great pleasure to announce a new package has joined CRAN. transformr is the spatial brother of tweenr and as with the tweenr update a few months ago, this package is very much driven by the infrastructural needs of gganimate. It is probably the last piece needed before I can begin preparing gganimate for CRAN, so if you are...

Read more »

Interactive panel EDA with 3 lines of code

Interactive panel EDA with 3 lines of code

Exploratory data analysis is important, everybody knows that. With R, it is also easy. Below you see three lines of code that allow you to interactively explore the Preston Curve, the prominent association of country level real income per capita with life expectancy. install.packages("ExPanDaR") library(ExPanDaR) wb

Read more »

confint3: 2-Sided Confidence Interval (Extended Moodle Version)

December 8, 2018
By
confint3: 2-Sided Confidence Interval (Extended Moodle Version)

Exercise template for computing the 2-sided confidence interval (with extended Moodle processing) for the mean based on a random sample. Name: confint3 Type: cloze Related: confint2 Description: Computing the 2-sided confidence interval at 95% level for...

Read more »

It was twenty years ago …

December 8, 2018
By

… this week that I made a first cameo in the debian/changelog for the Debian R package: r-base (0.63.1-1) unstable; urgency=low New upstream release Linked html directory to /usr/doc/r-base/doc/html (Dirk Eddelbuettel) – Douglas Bates [email protected]

Read more »

Timing Grouped Mean Calculation in R

December 8, 2018
By
Timing Grouped Mean Calculation in R

This note is a comment on some of the timings shared in the dplyr-0.8.0 pre-release announcement. The original published timings were as follows: With performance metrics: measurements are marketing. So let’s dig in the above a bit. These timings are of the kind of small task large number of repetition breed that Matt Dowle writes … Continue reading Timing...

Read more »

Automated Dashboard visualizations with distribution in R

December 8, 2018
By
Automated Dashboard visualizations with distribution in R

CategoriesProgramming Tags Data Visualisation R Markdown R Programming In this article, you learn how to make Automated Dashboard visualizations with distribution in R. First you need to install the `rmarkdown` package into your R library. Assuming that you installed the `rmarkdown`, next you create a new `rmarkdown` script in R. After this you type the following code in order to create a dashboard Related...

Read more »

R:case4base – Sorting data with base R

December 8, 2018
By
R:case4base – Sorting data with base R

Introduction In this post in the R:case4base series we will examine sorting (ordering) data in base R. We will learn to sort our data based on one or multiple columns, with ascending or descending order and as always look at alternatives to base R, namely the tidyverse’s dplyr and data.table to show how we can achieve the same results. It is...

Read more »

Day 08 – little helper intersect2

December 8, 2018
By
Day 08 – little helper intersect2

We at STATWORX work a lot with R and we often use the same little helper functions within our projects. These functions ease our daily work life by reducing repetitive code parts or by creating overviews of our projects. At first, there was no plan to make a package, but soon I realised, that it will be much easier...

Read more »

“Increase sample size until statistical significance is reached” is not a valid adaptive trial design; but it’s fixable.

December 7, 2018
By

TLDR: Begin with N of 10, increase by 10 until p __ 0.05 or max N reached. This design has inflated type-I error. Lower p-value threshold needed to ensure specified type-I error rate. The number of interim analyses and max N affect the type-I error rate. Threshold can be identified using simulation. A recent Facebook … Continue reading "Increase...

Read more »

Automated Dashboard Visualizations with Ranking in R

December 7, 2018
By
Automated Dashboard Visualizations with Ranking in R

CategoriesProgramming Tags Data Visualisation R Markdown R Programming In this article, you learn how to make Automated Dashboard Visualizations with Ranking in R. First you need to install the `rmarkdown` package into your R library. Assuming that you installed the `rmarkdown`, next you create a new `rmarkdown` script in R. After this you type the following code in order to create a dashboard Related...

Read more »

Shinyfit: Advanced regression modelling in a shiny app

December 7, 2018
By
Shinyfit: Advanced regression modelling in a  shiny app

Many of our projects involve getting doctors, nurses, and medical students to collect data on the patients they are looking after. We want to involve many of them in data analysis, without the requirement for coding experience or access to statistical software. To achieve this we have built Shinyfit, a shiny app for linear, logistic, … Continue reading "Shinyfit:...

Read more »

R community update: announcing useR Delhi December meetup and CFP

December 7, 2018
By
R community update: announcing useR Delhi December meetup and CFP

Time really does fly. It’s been 5 months since Delhi NCR useR group had come into being and our first meetup. It was a successful event which included sessions featuring an R-core member and a veteran data scientist. More importantly, …

Read more »

Day 07 – little helper count_na

December 7, 2018
By
Day 07 – little helper count_na

We at STATWORX work a lot with R and we often use the same little helper functions within our projects. These functions ease our daily work life by reducing repetitive code parts or by creating overviews of our projects. At first, there was no plan to make a package, but soon I realised, that it will be much easier...

Read more »

Search R-bloggers


Sponsors

Mango solutions





Zero Inflated Models and Generalized Linear Mixed Models with R



wiley.com/learn/datascience

datasciencego.com

Quantide: statistical consulting and training

ODSC boston

datasociety

http://www.eoda.de









Six Sigma Online Training

mljar.com

Our ads respect your privacy. Read our Privacy Policy page to learn more.

Contact us if you wish to help support R-bloggers, and place your banner here.