Maybe monad in R

May 24, 2019
By

A monad is mysterious entity from the ivory towers of category theory, an idea that turned out to be quite useful in programming. Part of the myth surrounding monads is that as soon as you understand them, you lose the ability to explain the concept. Since I’m not a mathematician, not even a trained programmer, I won’t even try to explain anything. Instead,...

Read more »

How do we combine errors, in biology? The delta method

How do we combine errors, in biology? The delta method

In a recent post I have shown that we can build linear combinations of model parameters (see here ). For example, if we have two parameter estimates, say Q and W, with standard errors respectively equal to \(\sigma_Q\) and \(\sigma_W\), we can build a linear combination as follows: \ where A, B and C are...

Read more »

Predicting Car Battery Failure With R And H2O – Study

May 24, 2019
By
Predicting Car Battery Failure With R And H2O – Study

Using R and H2O Isolation Forest to predict car battery failures. Carlos Kassab 2019-May-24 This is a study about what might be if car makers start using machine learning in our cars to predict falures. # Loading librariessuppressWarnings( suppressMessages( library( h2o ) ) ) suppressWarnings( suppressMessages( library( data.table ) ) )suppressWarnings( suppressMessages( library( plotly ) ) )suppressWarnings( suppressMessages( library( DT ) ) )#...

Read more »

Random Autocorrelation Sequences R version

May 24, 2019
By
Random Autocorrelation Sequences R version

Random Autocorrelation Sequences R version What is an autocorrelation sequence? Autocorrelation sequences (ACSs) are super common when doing anything in probability and statistics. Autocorrelation is a sequence of measurements of how similar a sequence is to it self. In math the autocorrelation sequence r is r = Ex] for k={0,1,...N-1}, where N is the number of...

Read more »

Practical Data Science with R, half off sale!

May 24, 2019
By
Practical Data Science with R, half off sale!

Our publisher, Manning, is running a Memorial Day sale this weekend (May 24-27, 2019), with a new offer every day. Fri: Half off all eBooks Sat: Half off all MEAPs Sun: Half off all pBooks and liveVideos Mon: Half off everything The discount code is: wm052419au. Many great opportunities to get Practical Data Science with … Continue reading Practical...

Read more »

Rstudio & ThinkR roadshow – June 6 – Paris

May 24, 2019
By
Rstudio & ThinkR roadshow – June 6 – Paris

On June the 6th, 2019, Rstudio is partnering with ThinkR to offer you a one day event around “R in production”. See you in Paris! If you’re an experienced developer or a decision-maker looking to learn more about what R and RStudio have to offer, then this event made is for you! During the first part of the event,...

Read more »

News from ggiraph

May 23, 2019
By
News from ggiraph

ggiraph 0.6.1 has evolved, this post presents work that has been done recently. ggiraph, what is it? The ggiraph package lets you work with ggplot and produce interactive graphics. The number of features is low and ggiraph usage is simple. The three features to be aware of are: ability to animate points, polygons or lines, ability to display tooltips when mouse is over these elements, ability to...

Read more »

[R]eady for Production: a Joint Event with RStudio and EODA

May 23, 2019
By
[R]eady for Production: a Joint Event with RStudio and EODA

We’re excited to team up with EODA, an RStudio Full Service Certified Partner, to host a free data science in production event in Frankfurt, Germany, on June 13. This one-day event will be geared for data science and IT teams that want to ...

Read more »

Estimates on training vs. validation samples

May 23, 2019
By
Estimates on training vs. validation samples

Before moving to cross-validation, it was natural to say “I will burn 50% (say) of my data to train a model, and then use the remaining to fit the model”. For instance, we can use training data for variable selection (e.g. using some stepwise procedure in a logistic regression), and then, once variable have been selected, fit the model...

Read more »

Royal Society of Biology: Introduction to Reproducible Analyses in R

May 23, 2019
By
Royal Society of Biology: Introduction to Reproducible Analyses in R

Learn to experiment with R to make analyses and figures more reproducible If you’re in the UK and not too far from York you might be interested in a Royal Society of Biology course which forms part of the Industry Skills Certificate. More details at this link Introduction to Reproducible Analyses in R 24 June… Continue reading Royal Society...

Read more »

Spotlight on: Julia Silge, Stack Overflow

May 23, 2019
By

Julia Silge is joining us as one of our keynote speakers at EARL London 2019. We can’t wait to hear Julia’s full keynote, but until then she kindly answered a few questions. Julia shared with us what we can expect from her address – which will focus on how Stack Overflow uses R and their recent developer survey. Hi...

Read more »

Bayesian estimation of fatality rates and accidents involving cyclists on Queensland roads

May 23, 2019
By
Bayesian estimation of fatality rates and accidents involving cyclists on Queensland roads

In my previous post I built a Shiny app mapping accidents on Queensland roads which was great at showing the The post Bayesian estimation of fatality rates and accidents involving cyclists on Queensland roads appeared first on Daniel Oehm | Gradient Descending.

Read more »

Save a flextable as an image

May 22, 2019
By
Save a flextable as an image

flextable 0.5.4 is now on CRAN. It contains a new output option that some users were asking: image output. You can now save a flextable as a png or pdf file with function save_as_image. The solution was existing since a long time but was buried in a stackoverflow question instead of being provided in flextable as an option. This functionality is letting other options to be...

Read more »

Comparing Frequentist, Bayesian and Simulation methods and conclusions

Comparing Frequentist, Bayesian and Simulation methods and conclusions

So, a programmer, a frequentist, and a bayesian walk into a bar. No this post isn’t really on the path to some politically incorrect stereotypical humor. Jut trying to make it fun and catch your attention. As the title implies this post is really about applying the differing viewpoints and methodologies inherent in those approaches to statistics. To be honest I’m not even...

Read more »

Analysing the HIV pandemic, Part 4: Classification of lab samples

May 22, 2019
By
Analysing the HIV pandemic, Part 4: Classification of lab samples

Andrie de Vries is the author of “R for Dummies” and a Solutions Engineer at RStudio Phillip (Armand) Bester is a medical scientist, researcher, and lecturer at the Division of Virology, University of the Free State, and National Health Laboratory Service (NHLS), Bloemfontein, South Africa In this post we complete our series on analysing the HIV pandemic in Africa. Previously we...

Read more »

MRAN snapshots, and you

May 22, 2019
By
MRAN snapshots, and you

For almost five years, the entire CRAN repository of R packages has been archived on a daily basis at MRAN. If you use CRAN snapshots from MRAN, we'd love to hear how you use them in this survey. If you're not familiar with the concept, or just want to learn more, read on. Every day since September 17, 2014,...

Read more »

Deep (learning) like Jacques Cousteau – Part 5 – Vector addition

Deep (learning) like Jacques Cousteau – Part 5 – Vector addition

(TL;DR: You can add vectors that have the same number of elements.)

Read more »

New Color Palette for R

May 22, 2019
By

As I was preparing some graphics for a presentation recently, I started digging into some of the different color palette options. My motivation was entirely about creating graphics that weren't too visually overwhelming, which I found the default "rain...

Read more »

Easy quick PCA analysis in R

May 22, 2019
By
Easy quick PCA analysis in R

Principal component analysis (PCA) is very useful for doing some basic quality control (e.g. looking for batch effects) and assessment of how the data is distributed (e.g. finding outliers). A straightforward way is to make your own wrapper function for prcomp and ggplot2, another way is to use the one that comes with M3C (https://bioconductor.org/packages/devel/bioc/html/M3C.html)

Read more »

Create a CLI for R with npm

May 21, 2019
By

How to build a CLI for R, with npm. Background This blog post was triggered by a discussion on Twitter with Martin Skarzynski, who was looking for a way to build a CLI that launches an RScript. Here’s a way to do this using npm. Please note that t...

Read more »

Bug when Creating Reference Maps with Choroplethr

May 21, 2019
By
Bug when Creating Reference Maps with Choroplethr

Last week the Census Bureau published a free course I created on using Choroplethr to map US Census Data. Unfortunately, a few people have reported problems when following one of the examples in the course. This post describes that issues and provides instructions for working around it. Where does the bug occur? The bug occurs The post Bug when...

Read more »

Timing hash functions with the bench package

May 21, 2019
By
Timing hash functions with the bench package

This blog post has two goals Investigate the bench package for timing R functions Consequently explore the different algorithms in the digest package using bench What is digest? The digest package provides a hash function to summarise R objects. Standard hashes are available, such as md5, crc32, sha-1, and sha-256. The key function in the The post Timing hash functions...

Read more »

Introductory course on Bayesian data analysis for beginners with Rasmus Bååth

May 21, 2019
By

On 25 November 2019, BayesCamp will be hosting a one-day course called Get up and running with Bayesian data analysis, … More

Read more »

Add Dressbarn to the Continued Retailpocalypse

May 21, 2019
By
Add Dressbarn to the Continued Retailpocalypse

I’ve talked about the retailpocalypse before and this morning I was greeted with the news about Dressbarn closing all 650 stores as I fired up a browser. I tweeted some pix and data but not everyone is on Twitter so I’m just posting a blog-blurb here with the code and data links. Code is below... Continue reading →

Read more »

rsync as R package

May 21, 2019
By
rsync as R package

In this article we present our R package rsync, which serves as an interface between R and the popular Linux command line tool rsync. Originally rsync is an open source tool for efficiently synchronizing files. Published by Paul Mackerras and Andrew Tridgell under the GNU General...

Read more »

Learning R: The Ultimate Introduction (incl. Machine Learning!)

May 21, 2019
By
Learning R: The Ultimate Introduction (incl. Machine Learning!)

There are a million reasons to learn R (see e.g. Why R for data science – and not Python?), but where to start? I present to you the ultimate introduction to bring you up to speed! So read on… I call it ultimate because it is the essence of many years of teaching R… or … Continue reading "Learning...

Read more »

Modern Data Science with R: A review

May 20, 2019
By
Modern Data Science with R: A review

Some say data is the new oil. Others equate its worth to water. And then there are those who believe that data scientists will be (in fact, they already are) one of the most sought-after workers in knowledge economies.Millions of data-centric jobs requ...

Read more »

Modern Data Science with R: A review

May 20, 2019
By
Modern Data Science with R: A review

Some say data is the new oil. Others equate its worth to water. And then there are those who believe that data scientists will be (in fact, they already are) one of the most sought-after workers in knowledge economies.Millions of data-centric jobs requ...

Read more »

Free Video Lecture: Vectors for Programmers and Data Scientists

May 20, 2019
By
Free Video Lecture: Vectors for Programmers and Data Scientists

We have just released two new free video lectures on vectors from a programmer’s point of view. I am experimenting with what ideas do programmers find interesting about vectors, what concepts do they consider safe starting points, and how to condense and present the material. Please check the lectures out. Vectors for Programmers and Data … Continue reading Free...

Read more »

Search R-bloggers

Sponsors