Adding macOS Touch Bar Support to RStudio

April 6, 2018
By
Adding macOS Touch Bar Support to RStudio

Modern MacBook Pros have a fairly useless (c’mon, admit it!) “Touch Bar” that did little more than cause severe ire in the developer community after turning a full-fledged, tactile Escape key into a hollow version if its former self. Having said, that, some apps do make OK use of it, with Fantastical and Omnigraffle being... Continue reading →

Read more »

Writing better R functions part one – April 6, 2018

April 5, 2018
By
Writing better R functions part one – April 6, 2018

One of the nicest things about working with R is that with very little effort you can customize and automate activities to produce the output you want – just the way you want it. You can contrast that with more monolithic packages that may allow you to do a bit of scripting, but for the most part, the price...

Read more »

Where is the value in package peer review?

Where is the value in package peer review?

If you read my reflection #1 on rOpenSci Onboarding, then you know I see value in the Onboarding process. A LOT of value even. This post is about where that value lies. This question has important corollaries which I will explore here based on my experience as a reviewer of bowerbird: How is a package peer reviewer’s time best spent? When is...

Read more »

A few podcast recommendations

April 5, 2018
By

After avoiding the entire medium for years, I've been rather getting into listening to podcasts lately. As a worker-from-home I don't have a commute (the natural use case of podcasts, I guess), but I have been travelling a lot more recently and it's been great to listen to during long flights. It turns out there are a lot of...

Read more »

Advanced Raster Data: Exercises

April 5, 2018
By
Advanced Raster Data: Exercises

Geospatial data is becoming increasingly used to solve numerous ‘real-life’ problems (check out some examples here.) In turn, R is becoming a powerful open-source solution to handle this type of data, currently providing an exceptional range of functions and tools for GIS and Remote Sensing data analysis. In particular, raster data provides support for representing Related exercise sets:Advanced Techniques...

Read more »

Laminar flow with ggplot2 and gganimate

April 5, 2018
By
Laminar flow with ggplot2 and gganimate

Preface I’ve realized that all my previous posts were quite substantial in length and took quite a long time to create them. From this point forward I’ll be generating posts of shorter length (partially for my sanity and more for my impulsivity with ideas). A few of these posts won’t be public health related (like... Continue Reading →

Read more »

Posterior probability of the null hypothesis being true, given a significant effect

April 5, 2018
By

For some reason, I am unable to load this post to google blogger. I have linked the post to an html file on my home page. Please comment here on this blog. Here is the post: Posterior probability of the null hypothesis being true, given a significant...

Read more »

Clojure Integration with R

April 4, 2018
By

(require ' ' ' ') ;; CREATE A TOY DATA (def ds [{:id 1.0 :name "name1"} {:id 2.0 :n...

Read more »

Not Hotdog: An R image classification application, using the Custom Vision API

April 4, 2018
By
Not Hotdog: An R image classification application, using the Custom Vision API

If you're a fan of the HBO show Silicon Valley, you probably remember the episode where Jian Yang creates an application to identify food using a smartphone phone camera: Surprisingly, the app in that scene isn't just a pre-recorded special effect: the producers actually developed a smartphone application using Tensorflow (and you can even download the app for your...

Read more »

The Travelling Salesman Portrait

April 4, 2018
By
The Travelling Salesman Portrait

I have noticed even people who claim everything is predestined, and that we can do nothing to change it, look before they cross the road (Stephen Hawking) Imagine a salesman and a set of cities. The salesman has to visit each one of the cities starting from a certain one and returning to the same … Continue reading The...

Read more »

Four Years of Practical Data Science with R

April 4, 2018
By
Four Years of Practical Data Science with R

Four years ago today authors Nina Zumel and John Mount received our author’s copies of Practical Data Science with R! It has its imitators, but it remains the best “I have R, now what do I do with it?” book (as it works the user through non-trivial projects, analyses, presentations, predictive analytic, data science, and … Continue reading Four...

Read more »

Constricted development with reticulate

April 4, 2018
By

I’ve been using the reticulate package occasionally for a while now, so I was surprised to see that it had only just been officially released. reticulate: R interface to Python https://t.co/qVWmwoMQAP. Comprehensive set of interoperability tools including R Markdown Python...Continue Reading →

Read more »

Exploring R-Bloggers Posts with the Feedly API

April 4, 2018
By
Exploring R-Bloggers Posts with the Feedly API

There’s a yuge chance you’re reading this post (at least initially) on R-Bloggers right now (though you should also check out R Weekly and add their live feed to your RSS reader pronto!). It’s a central “watering hole” for R folks and is read by many (IIRC over 20,000 Feedly users have it in their... Continue reading →

Read more »

Moving from RPubs to Github documents

April 4, 2018
By

If you still follow my Twitter feed – I pity you, as it’s been rather boring of late. Consisting largely of Github commit messages, many including the words “knit to github document”. Here’s why. RPubs, an early offering from RStudio, has been a great platform for easy and free publishing of HTML documents generated from … Continue reading Moving...

Read more »

Design Patterns in R

April 4, 2018
By

These notes are inspired by a talk by Stuart Sierra on Design Patterns in Functional Programming and some thoughts I found on F# for fun an profit and are reflection on how I use different strategies to solve things in R. Design Pattern seems...

Read more »

What is tidy eval and why should I care?

April 3, 2018
By
What is tidy eval and why should I care?

Nic Crane, Data Scientist This article was first published on Nic Crane's Blog and kindly contributed to the Mango Blog. I’m going to begin this post somewhat backwards, and start with the conclusion: tidy eval is important to anyone who writes R functions and uses dplyr and/or tidyr. I’m going to load a couple of packages, and then show you exactly why. library(dplyr) library(rlang) Data...

Read more »

Dominik is coming back on-board to manage Appsilon’s Open Source

Dominik is coming back on-board to manage Appsilon’s Open Source

Meet Dominik, our Open Source Tech Lead Before Dominik took a leadership role he used to be a Data Scientist at Appsilon. He has five years of experience in Python and R programming, mostly from data science and machine learning related project...

Read more »

Mathematical art in R

April 3, 2018
By
Mathematical art in R

Who says there's no art in mathematics? I've long admired the generative art that Thomas Lin Peterson occasionally posts (and that you can see on Instagram), and though he's a prolific R user I'm not quite sure how he makes his art. Marcus Volz has another beautiful portfolio of generative art, and has also created an R package you...

Read more »

My Data Science Tool Box

April 3, 2018
By

This post describes the tools I currently use for working with data. People often ask me to recommend specific tools, and I always hesitate, because so much boils down to personal preference. I recently added a workshop to the DSS lineup providing an ...

Read more »

A new RStudio addin to facilitate inserting tables in rmarkdown documents

April 3, 2018
By

In the last months, I started increasingly using Rmd documents for preparing scientific reports, blog posts, etcetera. While I really like the flexibility offered by the system, one thing that I thought could be improved is the support for easily inserting tables. So, “inspired” also by the recent addition of the excellent insert image addin in blogdown, I decided to give it a go and try...

Read more »

R FOR HYDROLOGISTS – Correlation and Information Theory Measurements: Part 3: Exercises

April 3, 2018
By
R FOR HYDROLOGISTS – Correlation and Information Theory Measurements: Part 3: Exercises

R FOR HYDROLOGISTS CORRELATION AND INFORMATION THEORY MEASUREMENTS – PART 3 Before we begin, if you don’t have the data, first get it from the first tutorial here. You will also need to Install and load the ggplot2 and reshape2 packages. if(!require(ggplot2)){install.packages(ggplot2, dep=T)} if(!require(reshape2)){install.packages(reshape2, dep=T)} Answers to these exercises are available here. The mutual information quantifies Related exercise sets:R FOR...

Read more »

An overview of keyword extraction techniques

April 3, 2018
By
An overview of keyword extraction techniques

In this blogpost, we will show 6 keyword extraction techniques which allow to find keywords in plain text. Keywords are frequently occuring words which occur somehow together in plain text. Common examples are New York, Monte Carlo, Mixed Models, Brussels Hoofdstedelijk Gewest, Public Transport, Central Station, p-values, ... If you master these techniques, it will allow you to easily step...

Read more »

Why you should master the small multiple chart

April 3, 2018
By
Why you should master the small multiple chart

I’ve often said that the foundation of data science is data analysis. That’s not to say that data analysis and data science are 100% synonymous (they’re not), but rather that you need to master the basic tools of data analysis before you can get into the more advanced skills that make up data science. Said The post Why you...

Read more »

Reliability chapter added to “Empirical software engineering using R”

April 3, 2018
By

The Reliability chapter of my Empirical software engineering book has been added to the draft pdf (download here). I have been working on this draft for four months and it still needs lots of work; time to move on and let it stew for a while. Part of the problem is lack of public data;

Read more »

Data on energy consumption and CO2 emissions

Data on energy consumption and CO2 emissions

I like to start my courses in energy economics and environmental economics by asking some question like Which energy source, e.g. gas, coal, nuclear, oil, renewables had the largest absolute increase in world wide energy consumption between the years 2000 und 2016? And then show some data. Below you can see an updated version of a googleVis visualization of...

Read more »

Ridgelines in bayesplot 1.5.0

April 2, 2018
By
Ridgelines in bayesplot 1.5.0

At the end of March, Jonah Gabry and I released bayesplot 1.5.0. The major additions to the package were visualizations using ridgelines and a new plot for PIT diagnostics from LOO validation. I don’t know what that LOO PIT thing is yet, so I’ll ta...

Read more »

R and TensorFlow Presentations

April 2, 2018
By

In early March, the Bay Area useR Group was able to hold an R and TensorFlow mini-conference on Google’s new Sunnyvale campus. Pete Mohanty, a Stanford researcher and frequent BARUG speaker, lead off with a talk on his recent kerasformula package, which allows R users to call a keras-based neural net with R formula objects. Pete’s slides show an...

Read more »

ὕδωρ + σκοπῶ = water + observe

ὕδωρ + σκοπῶ = water + observe

Hydrology is a concept to unify statistics, data analysis and numerical models in order to understand and analyze the endless circulation of water between the earth and its atmosphere. That’s a lot alike Data Science, isn’t it? Hydrologic Processes evolve in space and time, are extremely complex and we may never comprehend them. For this reason Hydrologists use models where...

Read more »

Fun with M&M’s – April 3, 2018

April 2, 2018
By
Fun with M&M’s – April 3, 2018

In this post we’re going to explore the Chi Squared Goodness of Fit test using M&M’s as our subject material. From there we’ll take a look at simultaneous confidence intervals a.k.a. multiple comparisons. On the R side of things we’ll make use of some old friends like ggplot2 and dplyr but we’ll also make use of two packages that...

Read more »

Search R-bloggers


Sponsors

Mango solutions





Zero Inflated Models and Generalized Linear Mixed Models with R



Quantide: statistical consulting and training

ODSC2 west

ODSC1_jobs

datasociety

http://www.eoda.de

max kuhn

CRC R books series







Six Sigma Online Training



mljar.com

datazar.com



Contact us if you wish to help support R-bloggers, and place your banner here.